Gene Athe_2050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2050 
Symbol 
ID7408263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2163389 
End bp2164870 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content41% 
IMG OID643716417 
ProductOrn/Lys/Arg decarboxylase major region 
Protein accessionYP_002573900 
Protein GI222530018 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1982] Arginine/lysine/ornithine decarboxylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0371444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGCAG GGCAGAATAA GGTGACAAAG GAGGACCAGA GCAAAACACC CTTATTTGAT 
GCTGTAAAAA GACATATTGA AAAGAACATA ATACCGTTCC ACGTTCCAGG GCACAAATAT
GGAAGAGGTC TTAAAGAGTT TACCGATTTT GTTGGACAAA ACGTCATGCT AATGGACTTA
AACGGTATGG AAGACTTAGA CAACGCAAAT AACCCAATAG GAGTCATCTA TGAAGCCGAA
AAACTTTTTG CAAGCGCGTT TGGTGCCCAG TATGCATATT TTTTGGTAAA CGGTACAACA
TCGGGTGTTC AAACAATGAT AATGTCGGCG TGCGAACCTG GAGATGAGAT AATACTGCCT
CGAAACGCAC ACAAAAGTGC ATTTGGCGGG ATAATTCTAA GCGGGGCTAT ACCTGTGTAT
GTGCAACCAG AGGTCAATGA AGAGCTTGGG ATTACAATGG GTGTTACAAT TGAGAATGTA
AAAAAGGCAA TCCTGAAACA CCCTCATGCC AAAGCAGTTT TTGTTATAAA CCCCACATAT
TATGGAATTG CAAGTGATTT GAAGTCCATA ACAAGGACAG CGCACAAGTT TGGAATGGCT
GTTTTGGTAG ATGAAGCGCA TGGTGCACAT ATGGGATTTC ATAACGATTT TCCGCTCACT
GCAATGGAAG TTGGAGCAGA TATGAGCGCA GTTTCAACAC ACAAAACAGG TGGGTCGCTA
ACGCAAAGTT CAGTACTTCT TCTTAGAGGG CACAGGATTC AACCAGAAAC TGTAAAGCAG
GTACTAAATC TTACTATGAC AACAAGTTCA TCTTACATTT TGATGTGTTC TATAGACGTT
GCGAGAAAAC AGCTTGCAAT GTATGGTGAA GAGATGTTAG AAGAAACTTT GCGACTTGCC
AGAATGGCAA GAGAAGAGAT TAACAAGATT GAAGGGCTTT ATGCATTTGG TAAAGAGTTG
ATTGGAACAC CGGGAGTTTA TGATTTTGAT GAGACAAAAC TTGGGATAAA TGTCAGAAGA
CTTGGTATAA CTGGATATGA AGCTGAAAGA ATTTTGAGAG ATGAATATAA CATCCAAGTG
GAGATGTCTG ACCTTTACAA TATACTGGCT ATAATCTCTT TGGGAGATAC ACAGGAGAGT
GTGGAAAAGC TAATTGAAGC TCTTCGCGAT ATGGCTAAAA AACTTGGTGT CAAGGATGTA
AAGACACCAA CAATAGTTTT GCACTCACCA CAGGTGATTG TGTCGCCAAG AGATGCCTTT
TACAGCTCTA AAAAGGTTGT TGATCTTGAC AATGCAGTTG GTGAAATTTC GGGTGAGATG
GTCATGGCGT ATCCGCCTGG AATACCACTT ATTTTGCCGG GTGAGAGAAT TACAAAGGAC
CTTGTTGATT ATATAAAACT TTTGAAAGAA GAGGACTGCC AGCTTCAGGG CACAGCCGAC
CCTTATGTCA ATACAATAAG GGTACTTGGA ACAGCTGATT AA
 
Protein sequence
MEAGQNKVTK EDQSKTPLFD AVKRHIEKNI IPFHVPGHKY GRGLKEFTDF VGQNVMLMDL 
NGMEDLDNAN NPIGVIYEAE KLFASAFGAQ YAYFLVNGTT SGVQTMIMSA CEPGDEIILP
RNAHKSAFGG IILSGAIPVY VQPEVNEELG ITMGVTIENV KKAILKHPHA KAVFVINPTY
YGIASDLKSI TRTAHKFGMA VLVDEAHGAH MGFHNDFPLT AMEVGADMSA VSTHKTGGSL
TQSSVLLLRG HRIQPETVKQ VLNLTMTTSS SYILMCSIDV ARKQLAMYGE EMLEETLRLA
RMAREEINKI EGLYAFGKEL IGTPGVYDFD ETKLGINVRR LGITGYEAER ILRDEYNIQV
EMSDLYNILA IISLGDTQES VEKLIEALRD MAKKLGVKDV KTPTIVLHSP QVIVSPRDAF
YSSKKVVDLD NAVGEISGEM VMAYPPGIPL ILPGERITKD LVDYIKLLKE EDCQLQGTAD
PYVNTIRVLG TAD