Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1257 |
Symbol | |
ID | 3833052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1299608 |
End bp | 1300996 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637829193 |
Product | major facilitator transporter |
Protein accession | YP_430114 |
Protein GI | 83590105 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.375741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00856521 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCAGTAG AAACATTGGG ACGCAGGGTT GAGAAAGAAG GCGACCTTAA GAAACACGTC CAGAGTTCCT GGATACCAAG ATCTGTGCAC CCATACTCCT GGGTTTCCCT GCTTGTCTGC TGGGGTATCT GGGTAATCAA CGCCTTTGAT CGCGAGATTA TCTTACGCCT CGGGCCCAGT ATTACCGAAG AGTTTCACCT TTCCCCGGAA CAATGGGGCA ATATGGTGGC CCTCATTATG CTGGCCCTGG CTGTGTTGGA TATACCGGGC AGTATTATGA GCGATCGCTA CGGTTCCGGC TGGAAACGGG CCCGTTTTCA GGTGCCAATT GTAATCGGGT ATACAGTTTT GTCGTTCCTA TCAGGTTTAC GTGCCCTTAG TGCCCAGTTG AGCCATTTTA TTGCCTGGCG GGTTGGAGTC AACCTGGGTG CCGGCTGGGG CGAACCGGTT GGCGTCAGTA ATACGGCCGA GTGGTGGCCG GTAGAAAACC GCGGTTTTGC GCTGGGTGTC CATCATACAG GTTATCCCAT TGGTGCTTTA TTAAGTGGCG TTGTCGCCAG CTATGTACTG AGTACCTTTG GAGCCGAAAA TTGGCGTTAT AGCTTCTTTT TCGCCATTAT TGCCGTCCCA ATTATGCTCT TTTGGCTGTG GTATTCAACC CCGGAACGGG TAGATACCCT ATATAAGGAT ATTGAAGCTA AAGGTCTAAC TAAACCGGAA CTCGATGTAG GGGTCAACGT AGGTAAAGGA CAGGGTATGA ATGTTTTTAT TAAAACCCTT AAAAATAAAA ATGTCTCTTT AACTGCCGGG AATACCCTTC TAACCCAGAT TGTTTATATG GGCATTAATG TAGTCTTAAC TCCTTATCTC CATTATGTAG TCGGGTTCTC CGTAGCCGCT TCGGCAGGAT TGAGTATTAT CTTTACCCTG ACTGGCGCCT TCGGGCAGAT CCTCTGGCCC TGGCTGTCAG ACTACCTGGG TCGAAAATGG ACCCTGGTTG TCTGCGGCTT ATGGATGAGC GCCGGTATCG CTGCCTTCTA TTTTGCCACC AATATGAGTA AACTCGTATT AATCCAGTTA CTTTTCGGTG TTGTTTCCAA TGCTGTTTGG CCAATTTACT ATGCCATGGC CTCCGACTCG GCCGAAAAGG CTGCTACCTC TACTGCCAAT GGCATTATCA CTACGGCCAT GTTTATTGGT GGCGGCATCT CCCCGGTATT AATGGGTTGG TTGATTGGCC TCGGCGGTGG TTGGAATAGC CCGACAGGTT ATATCTATAC CTTCTTTGCC ATGGCCGCCT GCGCCCTTAT GGGAGTAGTA TTACAATTGT TTACAGTTGA AAAAGCAGGT ATTTTTGCCA AATCGGATGA ATCTATCTTC GTTAAGTAA
|
Protein sequence | MAVETLGRRV EKEGDLKKHV QSSWIPRSVH PYSWVSLLVC WGIWVINAFD REIILRLGPS ITEEFHLSPE QWGNMVALIM LALAVLDIPG SIMSDRYGSG WKRARFQVPI VIGYTVLSFL SGLRALSAQL SHFIAWRVGV NLGAGWGEPV GVSNTAEWWP VENRGFALGV HHTGYPIGAL LSGVVASYVL STFGAENWRY SFFFAIIAVP IMLFWLWYST PERVDTLYKD IEAKGLTKPE LDVGVNVGKG QGMNVFIKTL KNKNVSLTAG NTLLTQIVYM GINVVLTPYL HYVVGFSVAA SAGLSIIFTL TGAFGQILWP WLSDYLGRKW TLVVCGLWMS AGIAAFYFAT NMSKLVLIQL LFGVVSNAVW PIYYAMASDS AEKAATSTAN GIITTAMFIG GGISPVLMGW LIGLGGGWNS PTGYIYTFFA MAACALMGVV LQLFTVEKAG IFAKSDESIF VK
|
| |