Gene Athe_1753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1753 
Symbol 
ID7408540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1827609 
End bp1828790 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content26% 
IMG OID643716131 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002573620 
Protein GI222529738 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA ATAGAAAGGC TATTAACTTG TTATGCTCTC AATTTATATC AGAGATTGGA 
AATTGGATTG ATAGGGTTGC TTTACTTACA TTAGTGTATA GTGTTAGTAA GTCAAATTTA
CAAATGTCCA TTCTTTCTAT TTTAATATTA TTACCTGCTG TTATATTTGG TATTCCATTT
GGAAAAATAA TTGATTTATC AAACAAAAAA ACAATTTTGG TATTTGGAGA TATTTCGAGA
GCTTTATTAG TAATATTAGT GCCATATTTC ACCAATTATG TGTTTTTGAT TGTGTTTATT
ATATCTTCTA TCACCGCTAT TTATGAAAAT ACAAGAAATA GTATAATTCC AGAACTAATC
ACCAAAGAAG AAATACGTAA AATTAACAGT CTTAGTAGCT CATTAAATTC TGTTATGATG
GTTGTTGGTC CCTTAATAGG TGGGTTATTA ACTTCTTATC TTAACTTGAA ATATTGTTTT
TTTATTGATT CATTTACTTT TCTTGTTTCT GCTATTTTTA TTTGTCAAAT TTCATCCCAT
AAACACAAAA GTACGGAGAA CCAAAATGAA AATATAAGAT ATTTGGAATT CTTAGAATAT
TTGAAATCTA ACTTTATCAT AAAAAGTTTA ATAATTCTCA ATGGTTTAAT TGGTTTATTT
GCAGGAATTC TGAACGGACT GTTAATTGTT TATATGATTA ACTATTTACA TACTGATTCT
AAAGGATATG GTTTTATTCT GACATCAAAA GGAATTGCAA TGGTTATTAC CTCGCTTTTT
GTTTATAAAT ATATAAAAAC AATAAAAAAC GAAACTCTTC TTTTAACAGG AGTAATAGGT
CTCGGAATTT CTATCGCTTT GTTTTCATTA AATAAGATTT TTGTATTTGC TCTTATCATC
TATTTTGTAA ATGGAATATG TAATTCATTT TATGCTATTG CTCGAACTAC TATAATCCAA
GAAAACTGTA ACAAGAAACT ATTAGGAAGA GTTTTTAGCT TTAATTCAAT AGTGGGAAAT
ATCTCCTCAA TTATTTCATT ATTAATTGGA GGAATTATAT CTAATACTAT ATCTGTTAAG
ACAATCTTTT TAACCAGCGG AGTTAGCATT ACACTAATAG GAATGGTTTA TTTTATCAAT
TTATCAAAAA ACTATCATGA AATACTTCAG CACAGTAGAT AA
 
Protein sequence
MKNNRKAINL LCSQFISEIG NWIDRVALLT LVYSVSKSNL QMSILSILIL LPAVIFGIPF 
GKIIDLSNKK TILVFGDISR ALLVILVPYF TNYVFLIVFI ISSITAIYEN TRNSIIPELI
TKEEIRKINS LSSSLNSVMM VVGPLIGGLL TSYLNLKYCF FIDSFTFLVS AIFICQISSH
KHKSTENQNE NIRYLEFLEY LKSNFIIKSL IILNGLIGLF AGILNGLLIV YMINYLHTDS
KGYGFILTSK GIAMVITSLF VYKYIKTIKN ETLLLTGVIG LGISIALFSL NKIFVFALII
YFVNGICNSF YAIARTTIIQ ENCNKKLLGR VFSFNSIVGN ISSIISLLIG GIISNTISVK
TIFLTSGVSI TLIGMVYFIN LSKNYHEILQ HSR