Gene Athe_1182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1182 
Symbol 
ID7408764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1276024 
End bp1277052 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content34% 
IMG OID643715547 
Productaliphatic sulfonates family ABC transporter, periplsmic ligand-binding protein 
Protein accessionYP_002573055 
Protein GI222529173 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.794723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATAA AAAGATATTA TAAAGTAATT ACCTTCGTTA TATTGCCAAT TGCTTTTCTA 
TTTCTCCTTT CAGGATGCTA TGCAAGAAAA AAAGACAAAA ATCTAAAGGT AAGAATTGCA
TTTTTCCCGA ACATAACTCA TGCCCAAGCA TTGGTAGGGA AAGAACTTGG TATTTTCCAA
AAGAGAATAG GCAAGGATGT AAAAGTTGAA TATAAGGTTT TCAATGCAGG TCCGGCTGAG
ATAGAAGCGT TTTTAGCAGA TGAGGTTGAC ATAGGCTATA TTGGACCTAT ACCAGCGATA
AATGGATTTG CAAAGACAAA TGGAGAAATA AAGATTATTG CAGGAGCTAC AAACGGAGGA
ATGATGCTGG TTTCAAGGCA GGATTTGAAT ATAAAGAATT TAGATGACTT AAAAGGCAAG
AAAATTGCAG TTCCTCAATA TGGGAATACC CAAGATATTG TATTAAGGTT TTTGCTAAGC
AAAGCTGGGC TAAAAGATAC TACCAAAGGT GGAGATGTTG AGATTATTCA AGCTGAAAAT
CCAGACATTA AAACTTTGCT TGATAGAAAC CAGATAGATG CTGCGTTGGT TCCTGAGCCT
TGGGGAACAA GGTTGAAAAA AGAAGTAAAT AGCAATGTTG TGCTTGACAG TAGCCAAATA
AGGCAATACA TAGATATTCC TACAACAGTA ATTATTACTA CCACAAAGTT TTTAAAAGAG
TATTCTGATA TTGTAGAAAA ATTTCTCATA GCGCATCTTG AGGTAACAGA CTTTATTGAA
AAAAATCCTG AAAAATCATA TGAAATAATA AATAACCAAA TTTCTGAGAT AACTTCTAAG
CCGCTGCCGG CAGACATCCT AAAAGACTCC TTCAAAAATA TCAAACTTTC AAGCGAAATA
CAAAGGAAAT CCTTAGAAAA AGCAATTGAG TCATATTTTG AGTTGGGATA CTTAAGAGAA
AAGCCAAATA TTGAAAAATT AGTTAACACA GAAATTTTAG ATAGAATCAA AAACAAAGAG
GTGTACTAA
 
Protein sequence
MEIKRYYKVI TFVILPIAFL FLLSGCYARK KDKNLKVRIA FFPNITHAQA LVGKELGIFQ 
KRIGKDVKVE YKVFNAGPAE IEAFLADEVD IGYIGPIPAI NGFAKTNGEI KIIAGATNGG
MMLVSRQDLN IKNLDDLKGK KIAVPQYGNT QDIVLRFLLS KAGLKDTTKG GDVEIIQAEN
PDIKTLLDRN QIDAALVPEP WGTRLKKEVN SNVVLDSSQI RQYIDIPTTV IITTTKFLKE
YSDIVEKFLI AHLEVTDFIE KNPEKSYEII NNQISEITSK PLPADILKDS FKNIKLSSEI
QRKSLEKAIE SYFELGYLRE KPNIEKLVNT EILDRIKNKE VY