Gene Athe_1648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1648 
Symbol 
ID7409478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1745124 
End bp1746311 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content35% 
IMG OID643716017 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002573515 
Protein GI222529633 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGAACA ATAACTCAAA GCAGAACATT AAAGTTATTT TTTCAAGTAT TTTTTTATTT 
ATTTTTGCCA ACGCTTTTAT GGGAATTGGC GGTGGAATAA ACGATACAAT ATTTAACAAC
TACATCGCTG CAACATACAA GATATCTCCC ATGGCAAGAG GAATTTTGGA GTTTCCCCGT
GAAACTCCTG GTTTTTTGAT TATATTCTTA ATAGGCTTTT TATATTTCCT CGGTGATTTG
AGGGTAAGTA TAATTGCGAC ATCTTTGTGT TCATTTGCCT TGCTTGGTCT TGGATTTTTT
GCCCCAACTT TTCTGCTATT GATTGTATGG ACTGCTATTT ACAATACCGG AACACACCTG
AACATGGTCT TGAGCTCAAG TATTGGAATG GAGCTTTCTA AAGAAGAGGA ATACGGAAAA
ACTTTGGGTC TTATCAGTTC TGTAGCAACA GCTGCATCTA TAATAGGTTA TTTTATAGTT
ATGGTAGGAT TTAAATTTCT GAATTTTTCT TTCAAGACAG CATATGTTAT TGCTGCTCTG
ATGTATCTAT TTGCAGCATT ATTTTTACTG CCTGTTAAAC TCCCAAGAAA ACCTCAGCAC
AAAGGATTCA AGTTTGTCAT CAAAAAAGAT TACTGGCTTT ATTATGTACT TTCAATCTTC
TTTGGAGCAA GAAAGCAAAT ATTTATCACC TTTGCACCCT GGGTCTTGAT TAAGATTTTC
AAACAGCCTG TTGAAAATTT TGCACTTGTA GGAATCATTT GTTCGTTTTT AGGAATTGGT
TTTAGAAACA TAATCGGAAG GCTCATTGAC AGACTTGGCG AAAAGACCAT ACTTACATTT
GATGCGTTAG TAATCTTTTT TATATGCCTT GGATATGCAG CAACTGAGAA CATAAAAATA
AAATGGGTTG CACTTTCGGT TGCATATGGT TGTTACATCA TTGACAACTT GATGTTTGCA
ACATCAATGG CAAGGTCAAC ATACATAAAG AAGATTATAA AACATCCTGA TGACCTAACT
CCTACTCTTT CAACAGGTAC AAGTATGGAT CACGCTGTTT CTATGAGCCT CCCCATGCTT
TCTGGTTTTT TGTGGAATAA GTTTGGGTAT GAATATGTGT TCTTACTTGC AGCTTTCTTT
GCGCTGGGGA ATTTGTATTT TGTGAGGAAG ATTGAAATTG AAAGTTAA
 
Protein sequence
MLNNNSKQNI KVIFSSIFLF IFANAFMGIG GGINDTIFNN YIAATYKISP MARGILEFPR 
ETPGFLIIFL IGFLYFLGDL RVSIIATSLC SFALLGLGFF APTFLLLIVW TAIYNTGTHL
NMVLSSSIGM ELSKEEEYGK TLGLISSVAT AASIIGYFIV MVGFKFLNFS FKTAYVIAAL
MYLFAALFLL PVKLPRKPQH KGFKFVIKKD YWLYYVLSIF FGARKQIFIT FAPWVLIKIF
KQPVENFALV GIICSFLGIG FRNIIGRLID RLGEKTILTF DALVIFFICL GYAATENIKI
KWVALSVAYG CYIIDNLMFA TSMARSTYIK KIIKHPDDLT PTLSTGTSMD HAVSMSLPML
SGFLWNKFGY EYVFLLAAFF ALGNLYFVRK IEIES