Gene Athe_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2029 
Symbol 
ID7408242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2141072 
End bp2142652 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content34% 
IMG OID643716396 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_002573879 
Protein GI222529997 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGTA AAAAAATATT TGCGATTTTT CTGACAATTA CTTTTTTAGT ATCTATTGTA 
ACTCTGGGAT TTTCATTTGG AAAAGCAGCT ACATCTAAAA AGCTAATTAC AATTACTACT
CATACAACTT CGAGCCAACC ACCAGCAGTT ACAGATTTGT ATAAAAAGAA ATTGAGAGAA
AAATTTGGCA TTGATTTAAA AACAATATAT ATTCCTCAAA GTGACTATGT TACAAAAATG
TCTTTGCTTT TTGCAAGTAA TATGGCACCA GACTGGATAC GTGCTTTAAG GCCTGAGTAT
AATTTAAATG AATGGATTGC AGCAGGATAT TTGATTGGTT TTACCACGGA TGAAATAAAA
AAGAAATGGC CTAATTACCT AAAAATATGG ACAAAGGAAG AATGGGATTA CCTCTATAAA
ATAGTACGTT ACAGTGATGG CAAGGTTTAT TCTTTCCATG GTAGGCGCAT AGCACCAGTG
GATATGGCTT TCTTGTACAG AAAAGAGATA TTTGATAGAT ATAATTTAAA GTTTCCAACC
ACAGTTGATG AGTTTTATAA AACATGTATA TTCTTAAGAC AGAAAACAGG TAAAGTTGTT
TATCTGCACG CAAATGCAGT TTCTGGTAAT TTAAGTCTTT GGGCTTTTAC TGGGATATTC
TTAATGTATG GTTTGCCTGA ACTTGCACCA AGACAGATTT CTTATGTAGA CCCACTTACC
AAGAAATTTG TACCTTTTGC GTTCAATCAA AATAATTACC GTCAAGCTTT AATTTTAATA
AATAAACTTT ATAAAGCAGG TTGTATATGG AAGGAATATG CAACAGCAAC TCGTGATCAG
TTAGATAAGT TCAGAACCCA AGGGCAAGGA ATAATTATGT GGGCATATCC TGCAAATATA
GGAACTTACA ACAATCTGTA TAGAAATACA GATAAGGATA CAAACTGGGT ATGGTCTAAG
GATACACCAA CAGCATATCC TGGAAAAGCG TACTTTTTCA AGAGAAATCC TTTGCACTTT
GCAGATGGTC ATGGGTTTAA TTCAAGCATC AGCAAAGAAA AGTTAGATAG ACTTCTCCAA
TATTTGAACT GGGCTCTTAG TGAAGAAGGT CAGATATTCC ATACTTATGG TGAATATGGT
GTTACTTATA AAAAGGAAGG AAACAAATAC GTATATATGG ACCATATTCA AACTCCAACC
AATCCATCGG GTAAATATAG CTTACAAGAC TATGGATTTC CATTTGCAGC ACCGAACGGG
TTTATGGTGG CATATCCTCA GGCTGTAGAA ACATATGCTC CTATATATGC AGAACTTGCA
AAAACGTTTA TGAATAGGCC AAAGTATTAC TACATCAGGG AAGAACCTAT GATGTATACA
AAAGAAGAAA TGGCAGAAAG AGCTGAGTTA GAATCAAATA TTATGGCAGT TGTGGATGAA
TATTGCATGA AGTTTGTAAC AGGTCAGTTA GACCCAAGTA ATAACAAAGA TTGGCAACAA
TATCTGAATG TTCTCAATAA AGTTGGTCTA CAACGCTTGA TAACAATTAG GATAAATGCA
TACAATAGAG CTAAGAAGTA A
 
Protein sequence
MKGKKIFAIF LTITFLVSIV TLGFSFGKAA TSKKLITITT HTTSSQPPAV TDLYKKKLRE 
KFGIDLKTIY IPQSDYVTKM SLLFASNMAP DWIRALRPEY NLNEWIAAGY LIGFTTDEIK
KKWPNYLKIW TKEEWDYLYK IVRYSDGKVY SFHGRRIAPV DMAFLYRKEI FDRYNLKFPT
TVDEFYKTCI FLRQKTGKVV YLHANAVSGN LSLWAFTGIF LMYGLPELAP RQISYVDPLT
KKFVPFAFNQ NNYRQALILI NKLYKAGCIW KEYATATRDQ LDKFRTQGQG IIMWAYPANI
GTYNNLYRNT DKDTNWVWSK DTPTAYPGKA YFFKRNPLHF ADGHGFNSSI SKEKLDRLLQ
YLNWALSEEG QIFHTYGEYG VTYKKEGNKY VYMDHIQTPT NPSGKYSLQD YGFPFAAPNG
FMVAYPQAVE TYAPIYAELA KTFMNRPKYY YIREEPMMYT KEEMAERAEL ESNIMAVVDE
YCMKFVTGQL DPSNNKDWQQ YLNVLNKVGL QRLITIRINA YNRAKK