Gene Athe_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2554 
Symbol 
ID7409505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2672878 
End bp2674524 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content40% 
IMG OID643716918 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002574395 
Protein GI222530513 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00103966 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAAA GAAGAGTTGC TTTATTGGTT GCTATTGCCT TTTTGATAAC CATCATTGTT 
CCAGGGTTTT TAAGTACTCC AACAAAGGCA GTAGCAGCAT CTAAAAACCC AATAGTTTTT
AAGATTTATT GGGGTGATTC TAATGCAGAG CCGGTTGATG TGTGGAAGAC ACCAATTGGT
AAAAAGGTTG AACAAATAAC AGGTGTAAGG CTTCAATTTG AATTTATTGT TGGCAGTGAC
GAGGAAACAA AGGCAGGTAT CATGCTTGCA AGTGGCGACT TGCCAGACTT AATCAATGCA
CACAATGTTG TAAACAAGTT TATTGAAGCA GGAGCTTTAG TTCCACTGGA TAATTACATT
GCTAAGTACG GCAAGAACAT CAAAAAGTGG TATGATACAA AAGCGCTTAA AAAACTCAAA
TATCCAAAGG ATGGGCACAT TTACTATCTC ACACCTTTCA GAGAAGAGTC TGACCCGCTC
TATTCGTTTG CTGGTTTTTG GCTGCCTATA TACGTTCTGA AAGAAAACAA ATGGCCAGTT
GTAAGAGATA TTGACACATA TTTTAAGATT GTAAAAGATA CTGTCAAGAA ACATCCAACA
TACAATGGAA AGCCAACGAT AGGTTTTACA GCGCTGACTG ATAGCTGGAG AATTTATGTA
CTGATGCAAC AGCCGCTGAG GCTTGAAGGC TATCCGAACG ATGGTGGCTG GCTGATCGAC
GAAAAGACTG GAGTTGTAAA AGACAGCTAT ACAATGCCAT ATGCAAAGAC ATATTACAAA
ATACTCAACC AGATGTGGAA CGAAGGTCTT CTTGACAAAG AGATGTTCTC ACAAAACTAT
GATCAGTACT TAGCAAAGAT TTCGTCAGGT AGGGTTGTTG GTTTTTATGA TGAAAGATGG
CAGATACAGT CTGCAATAGA CTCTCTTGAA AAACAAGGAC TTTATGACAG AATTCCAATT
GCAATGCCAG TGCTCAAGAA AGGCGTAAAG AGAGATAGAT ACAACGTGGT TACAATGGGA
ACAGGTGCCG GAATATCAAT TACAAAAAAG TGCAAGGACC CGGTTGCAGC CTTCAAGTTC
TTGGACAGAA TGGCTGGCGA GGATATCTTG AAACTCATCA ACTGGGGTAT CCAAGGCCAG
GATTACTATG TAAAGAATGG TAAAATGTAC AAGGATGCAA AACAGATTCA AAACTACATG
AACCCAGATT ACAGAAAGAA ACAGGGCATT GGCGGAAATA TCTGGTTTGC ATTCCCAAGA
CCACCGTTTG ACTGGACATA TTCAGACAAG AGCGGAAAGA TTTCTTGGGA CTATTCAGAC
CAGGCATTAG AGCAGAGGTA CAAACCATAT GAAAAGGAAG TTTTGAAAGC TTATAAGATT
AAGTCGTTCA AAGACTTGTT CTCACCAACA TGGAACTCAC CGTACGGATA TGGCTGGGAT
ATCAAGCTCC CAGACGACCT GCAGGCAATC CAGAACCAGG CTGATGACTT GCAGAGAAGG
TACATCACAA AAGCTATAAT GGCAAAACCG GGTGATTACG ATAAGATTTG GAATGAATAC
CTTAGCAAGA TGAAGAACAT TCCTATCAAG AAGGTAATTG ACTTTAGACA AAAAGAGATC
CAGAGAAGAC TCAAAGAGTG GAACTAA
 
Protein sequence
MFKRRVALLV AIAFLITIIV PGFLSTPTKA VAASKNPIVF KIYWGDSNAE PVDVWKTPIG 
KKVEQITGVR LQFEFIVGSD EETKAGIMLA SGDLPDLINA HNVVNKFIEA GALVPLDNYI
AKYGKNIKKW YDTKALKKLK YPKDGHIYYL TPFREESDPL YSFAGFWLPI YVLKENKWPV
VRDIDTYFKI VKDTVKKHPT YNGKPTIGFT ALTDSWRIYV LMQQPLRLEG YPNDGGWLID
EKTGVVKDSY TMPYAKTYYK ILNQMWNEGL LDKEMFSQNY DQYLAKISSG RVVGFYDERW
QIQSAIDSLE KQGLYDRIPI AMPVLKKGVK RDRYNVVTMG TGAGISITKK CKDPVAAFKF
LDRMAGEDIL KLINWGIQGQ DYYVKNGKMY KDAKQIQNYM NPDYRKKQGI GGNIWFAFPR
PPFDWTYSDK SGKISWDYSD QALEQRYKPY EKEVLKAYKI KSFKDLFSPT WNSPYGYGWD
IKLPDDLQAI QNQADDLQRR YITKAIMAKP GDYDKIWNEY LSKMKNIPIK KVIDFRQKEI
QRRLKEWN