Gene Athe_2330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2330 
Symbol 
ID7407749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2471380 
End bp2472981 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content38% 
IMG OID643716694 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002574173 
Protein GI222530291 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTGGT TCAAATCTTC TAAAAAGGTT TTAAGTATTA TCGTAGTTAT TGCATTTGCC 
TTATCGCTTG TAATTCCAGC ATTTATTTCA TCAACCTCAA TAGCTTATGC AAAAACAATA
CCAACACTTA CCTATTTTGT TCGTCTTGAC CCCAAGGTTG CAACATCTTA CAACAGCTAC
TCTTCAATTG CTGCTTACCA GCTTTTGCAG AAAAAACTTG GGGTAAAGAT TGTGTTCAAG
CACCCACCGG TTGGCGGAGA GACAGACCAG TTCAACTTAA TGGTTGCATC AAGACAGCTG
ACAGACATCA TTGAGTGGAA CTGGGTTGAT AACTATCCTG GTGGACCTGT AAAAGCAATG
CTTGATAAGG TAATCATAAG GCTCAATGAC TATATGCCAA AATATGCTCC AAATTTATCA
AAATACTTAC AGCAACATCC TGACATCAAA AAACTTATTG TTACAGACGA TGGTGACATT
TACGGATTCC CAGCTCTTCG TGGAACAAAC CCAAAAATTG CATGCGTATA CTATGGACCT
CAAATACGAA ATGATTGGTT GAAAAAGCTC GGATTAAAAG AACCAGAAAC AGTTGATGAC
TGGTACAAGG TTTTGAAAGC ATTTGTAACA AAAGACCCAA ACGGCAATGG CAAAAAGGAT
GAAAGAGGAT TTACAATTCT GCGAAATGCT TCAAACCCGA GATATGCATT TGATTATTCT
TCCTTCTTAG TTGGTGCATG GGGAATAAAA ACAGATTTCT TCCAGATAAA TGGAAAGGTC
AAATATGGTC CGTTAGAACC ACAATACAAA CAGTTTATAG CAACACTTCA GAAGTGGTGG
AAAGAGGGCC TCATAGACCC GGATATCCTA ACAATGAACC AGAAGGTTAT AAGAGCAAAT
GTTCAAAACG ATGTAATTGG TGCGTGGATA GGACTTCTTT CTGGCGATAT GGGCTTCTTC
TTGAACCTGA AGAAAGATAT AATAGCTACC AAGTTCCCTG TGCTCAAGAA AGGTGAACAG
CCACTTTTAG GACAGGCTGA GTTCTTGTTC GCTCGAACAA GCGCGGCTAT AACTACTGCA
TGTAAAGACA TACCAACTGC TATGAAGGTG CTTGACTGGG GTTACAGCAA AGAAGGATAT
GAAGCGTTCA ACTATGGTGT ACTTGGAAAG TCTTATATTA AGAAAGATGG CAAGGTTTAC
TATACAGATG AAATCTTGAA AAACCCACAG GGACTTTCTG CAGCAGAAGC TTTAGCAAAA
TATGCTCGTG CATCAATCAG CGGTCCTTTT GCTCAAGCTG ATGAGTATTA TCTACAGATT
CAAATGATGT ATCCACAACA AAAAGAAGCT GTCATGCAAA AATGGTCTGA TGTCAAAAAT
GACAGAATTT TGCCACCACT TTCGTTTACA GACGAAGAAT CCAAGAGACT TGCAAATATT
ATGAATACAG TCAACACGTA CTATGATGAA ATGTTCTTAA GACTTATGAC TGGAAAAGCA
ACAAATGTTG ATGCATTTGT AAAAACTCTT AAACAAATGA AGATTGATGA GGCTATTAAG
ATTTATCAAG CTGCATATGA CAGATGGAAA AAGAGAAAAT AA
 
Protein sequence
MDWFKSSKKV LSIIVVIAFA LSLVIPAFIS STSIAYAKTI PTLTYFVRLD PKVATSYNSY 
SSIAAYQLLQ KKLGVKIVFK HPPVGGETDQ FNLMVASRQL TDIIEWNWVD NYPGGPVKAM
LDKVIIRLND YMPKYAPNLS KYLQQHPDIK KLIVTDDGDI YGFPALRGTN PKIACVYYGP
QIRNDWLKKL GLKEPETVDD WYKVLKAFVT KDPNGNGKKD ERGFTILRNA SNPRYAFDYS
SFLVGAWGIK TDFFQINGKV KYGPLEPQYK QFIATLQKWW KEGLIDPDIL TMNQKVIRAN
VQNDVIGAWI GLLSGDMGFF LNLKKDIIAT KFPVLKKGEQ PLLGQAEFLF ARTSAAITTA
CKDIPTAMKV LDWGYSKEGY EAFNYGVLGK SYIKKDGKVY YTDEILKNPQ GLSAAEALAK
YARASISGPF AQADEYYLQI QMMYPQQKEA VMQKWSDVKN DRILPPLSFT DEESKRLANI
MNTVNTYYDE MFLRLMTGKA TNVDAFVKTL KQMKIDEAIK IYQAAYDRWK KRK