Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0601 |
Symbol | |
ID | 7406942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 679679 |
End bp | 680899 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643714984 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002572500 |
Protein GI | 222528618 |
COG category | [R] General function prediction only |
COG ID | [COG2270] Permeases of the major facilitator superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAATG GAAAGAACAA TCTTGAGGAA AAGAAAATTC TTGGTATACC GTGGAATGCT TTTATATTCG GTTTTGTAAG TTTTCTCAAT GACTTCTCAA GTGAACTGAC AATAAGGGCT CTCCCTCTCT TTTTGAAAAA CGTTTTAAAT GCAAAAACCT CTGTAATAGG TCTTATTGAA GGTGTGGCAG ATTCAACCGC AACAATCCTA AAAATCTTTT CAGGGTATCT TTCTGACAAG CTAAATCAAC GAAAGTGGCT TGTAACCATA GGTTACGGCC TTTCTGCACT TTCAAAACCA CTTTTATATT ATGCCAATAA CTGGGTATTC GTATTGATTA TAAGGTTTTT GGACAGAGTT GGAAAAGGTA TTAGGACTTC TCCTCGTGAT GCCTTGATTG CGAACACAAC AAAAAAAGAA GAACTTGGAA AGGCATTTGG ATTTAACAGA GCAATGGATC CGGCAGGGGC AATTTTAGCT TTGATTGTGG GCAGTTTTAT AATATACTTT ACCTCTAAAA ACGCCTTAAA GCTAACGCAA CACTTGTTTC AGATTCTTGT TTTAGTGTCA ATTTTTCCAG TCTTTGTTGC GCTTTTTTTA ATAATTGCAT TTGCAGTAGA TACTAAAAAC CAAAACCCAT CGGCAGCAAA GGTCAACCTA TCATTGAAAG GATTTGATAA AAAGTTTAAA CTATATCTTT TGACTATTTC AATCTTTACC CTTGGAAATT CTTCAGATGC TTTTTTAATC CTGCAGGCTC AAAACAGAGG ATTGACAGTT TTAGAGATAT TTTTGATGCT GGCCGCCTTC AATTTGATAA CAACTCTTAG CGCTTACCCT GCTGGAATTT TGTCAGATCG AATAAAACGC CAATATTTGA TTGTGATGGG CTGGATTGTA TATGCCTTGA TATACTTAGG CTTTGGACTT GCAACAAAGA CATATCAAAT AGTTGCTCTG TACATTTTGT ACGGTCTTTA CTATGGACTT ACAGAGGGCG TTGAAAAGGC ACTTGTTGCA GATTTAGTTC CCCCTGAAAA AAGAGGTACA GCTTACGGTC TTTACAACGG CGCTGTCGGA ATTTTCGCAT TTCCAGCAAG TTTGGTTGCA GGATTTTTGT GGCAATATAT AAGTCCTTCT GCACCTTTCA TCTTCGGCGC CATCCTTGCT ATTTTTGCCT CAGTGATGCT ACTGAAGGTA GTGAACATGA AGCAAGAATA A
|
Protein sequence | MKNGKNNLEE KKILGIPWNA FIFGFVSFLN DFSSELTIRA LPLFLKNVLN AKTSVIGLIE GVADSTATIL KIFSGYLSDK LNQRKWLVTI GYGLSALSKP LLYYANNWVF VLIIRFLDRV GKGIRTSPRD ALIANTTKKE ELGKAFGFNR AMDPAGAILA LIVGSFIIYF TSKNALKLTQ HLFQILVLVS IFPVFVALFL IIAFAVDTKN QNPSAAKVNL SLKGFDKKFK LYLLTISIFT LGNSSDAFLI LQAQNRGLTV LEIFLMLAAF NLITTLSAYP AGILSDRIKR QYLIVMGWIV YALIYLGFGL ATKTYQIVAL YILYGLYYGL TEGVEKALVA DLVPPEKRGT AYGLYNGAVG IFAFPASLVA GFLWQYISPS APFIFGAILA IFASVMLLKV VNMKQE
|
| |