Gene Athe_0601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0601 
Symbol 
ID7406942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp679679 
End bp680899 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content38% 
IMG OID643714984 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002572500 
Protein GI222528618 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAATG GAAAGAACAA TCTTGAGGAA AAGAAAATTC TTGGTATACC GTGGAATGCT 
TTTATATTCG GTTTTGTAAG TTTTCTCAAT GACTTCTCAA GTGAACTGAC AATAAGGGCT
CTCCCTCTCT TTTTGAAAAA CGTTTTAAAT GCAAAAACCT CTGTAATAGG TCTTATTGAA
GGTGTGGCAG ATTCAACCGC AACAATCCTA AAAATCTTTT CAGGGTATCT TTCTGACAAG
CTAAATCAAC GAAAGTGGCT TGTAACCATA GGTTACGGCC TTTCTGCACT TTCAAAACCA
CTTTTATATT ATGCCAATAA CTGGGTATTC GTATTGATTA TAAGGTTTTT GGACAGAGTT
GGAAAAGGTA TTAGGACTTC TCCTCGTGAT GCCTTGATTG CGAACACAAC AAAAAAAGAA
GAACTTGGAA AGGCATTTGG ATTTAACAGA GCAATGGATC CGGCAGGGGC AATTTTAGCT
TTGATTGTGG GCAGTTTTAT AATATACTTT ACCTCTAAAA ACGCCTTAAA GCTAACGCAA
CACTTGTTTC AGATTCTTGT TTTAGTGTCA ATTTTTCCAG TCTTTGTTGC GCTTTTTTTA
ATAATTGCAT TTGCAGTAGA TACTAAAAAC CAAAACCCAT CGGCAGCAAA GGTCAACCTA
TCATTGAAAG GATTTGATAA AAAGTTTAAA CTATATCTTT TGACTATTTC AATCTTTACC
CTTGGAAATT CTTCAGATGC TTTTTTAATC CTGCAGGCTC AAAACAGAGG ATTGACAGTT
TTAGAGATAT TTTTGATGCT GGCCGCCTTC AATTTGATAA CAACTCTTAG CGCTTACCCT
GCTGGAATTT TGTCAGATCG AATAAAACGC CAATATTTGA TTGTGATGGG CTGGATTGTA
TATGCCTTGA TATACTTAGG CTTTGGACTT GCAACAAAGA CATATCAAAT AGTTGCTCTG
TACATTTTGT ACGGTCTTTA CTATGGACTT ACAGAGGGCG TTGAAAAGGC ACTTGTTGCA
GATTTAGTTC CCCCTGAAAA AAGAGGTACA GCTTACGGTC TTTACAACGG CGCTGTCGGA
ATTTTCGCAT TTCCAGCAAG TTTGGTTGCA GGATTTTTGT GGCAATATAT AAGTCCTTCT
GCACCTTTCA TCTTCGGCGC CATCCTTGCT ATTTTTGCCT CAGTGATGCT ACTGAAGGTA
GTGAACATGA AGCAAGAATA A
 
Protein sequence
MKNGKNNLEE KKILGIPWNA FIFGFVSFLN DFSSELTIRA LPLFLKNVLN AKTSVIGLIE 
GVADSTATIL KIFSGYLSDK LNQRKWLVTI GYGLSALSKP LLYYANNWVF VLIIRFLDRV
GKGIRTSPRD ALIANTTKKE ELGKAFGFNR AMDPAGAILA LIVGSFIIYF TSKNALKLTQ
HLFQILVLVS IFPVFVALFL IIAFAVDTKN QNPSAAKVNL SLKGFDKKFK LYLLTISIFT
LGNSSDAFLI LQAQNRGLTV LEIFLMLAAF NLITTLSAYP AGILSDRIKR QYLIVMGWIV
YALIYLGFGL ATKTYQIVAL YILYGLYYGL TEGVEKALVA DLVPPEKRGT AYGLYNGAVG
IFAFPASLVA GFLWQYISPS APFIFGAILA IFASVMLLKV VNMKQE