Gene Athe_0342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0342 
Symbol 
ID7409272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp390273 
End bp391538 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content37% 
IMG OID643714728 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002572251 
Protein GI222528369 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGGT TCAAGGCACA GCTGCAGCAG ATTGTTTTAG CATTCAAAGA GATAAATCCA 
AACGCTAAAA AGATCCTATT TTTTGAGCCC GTTTTTACTA TTCCATATGC TATGTTTATC
ATCTACTCCT CACTTTACAT GACAAGAGTT GGAGTAAAGG ATTACCAGAT AGGACTGCTG
TCAACAGTGT TGAACTTGGT GATGCTTATC ACATCACCCT TTGCAGGAAT GCTTGTGAAC
AGGTTTGGAC GCAAAAAGGT TCTCCTCATT GGCGATTTTC TGTCGTGGTG CGTGTATGCA
TATATATTTT TCTTTGCCAA AGATTTTGCA TGGTTTTTAA TTGCTACCAT CTTTAACGGA
CTTATGAGAA TTCCTGAACT TGCATGGCGG CTTCTTTTAA TGGAAGATGC AACCGAAAAC
GAAAGAGTTG CAATTTATTC TGTAACTGTA TTTGTGTGGA ATATGGGTAA CCTTTTTGCG
CCTGTGATGG GCGTCCTTGT TGCAAGGTTT GGCTTGATAC CTGCAACTAA ATGGACAGTC
CTTGCGTTTG GGATTTTAGT AAATATACTA ATTGTTGTAA GACATCTTGT TACATCTGAG
AGTTCTGTGG GGCAAAAGCT TGTACAGGAA AATTCTGACA AAAATAATAA TGGTTTTTCT
GAGTGGTTTG ACAGTTTAAA GTATATGTTC AGAAACGGAC AGCTTCTTTT GATTGTGCTT
GTAACAATAT TTGGCAACGT TGCCCTGATA TTCAGAGACA CATATAAAAA TATATATTTA
AGCGAAGCTT TGCATTATCC AGATAGCATA ATTTCGGTAT TTCCAACACT GTGGAGTGCG
GTAGCTCTCA TATTTGTAAT ATTTTTAATT CCAAATTTAA AAGAACAAAA ACATGATACT
GTCCTTTTTT GGGGAATGTT TTCAATTACA GTTTCCAATG CATTGATTTT AGTTGCACCT
CCTGGGACAT TTGGCTTTAT TTTGATGATA ATTGTAACAG TGCTTGGCAG CATAGGGGCT
GCAGTATATT ATTCATTTGT TGATGCTATC TTGGCAAATT CTGTTGATGA TGAAAGAAGA
GCACATGTCC TGTCAATTAC AATGTTTTTG ATTTCTCTTT TTTCAATGCC AGTTGGTGCA
ATAGCCGGAC AGTGTTATAC CTTTTCAAAG AGTTTGCCTT TTGTGCTTGC TACAATCTTT
ACTCTCTTGT GCACAATTTT GATTTTTTTC AAGATAAGAA TAAGAAGAGC CCAAAAAGAA
AAGTAG
 
Protein sequence
MTRFKAQLQQ IVLAFKEINP NAKKILFFEP VFTIPYAMFI IYSSLYMTRV GVKDYQIGLL 
STVLNLVMLI TSPFAGMLVN RFGRKKVLLI GDFLSWCVYA YIFFFAKDFA WFLIATIFNG
LMRIPELAWR LLLMEDATEN ERVAIYSVTV FVWNMGNLFA PVMGVLVARF GLIPATKWTV
LAFGILVNIL IVVRHLVTSE SSVGQKLVQE NSDKNNNGFS EWFDSLKYMF RNGQLLLIVL
VTIFGNVALI FRDTYKNIYL SEALHYPDSI ISVFPTLWSA VALIFVIFLI PNLKEQKHDT
VLFWGMFSIT VSNALILVAP PGTFGFILMI IVTVLGSIGA AVYYSFVDAI LANSVDDERR
AHVLSITMFL ISLFSMPVGA IAGQCYTFSK SLPFVLATIF TLLCTILIFF KIRIRRAQKE
K