Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1590 |
Symbol | |
ID | 3832736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1624610 |
End bp | 1625905 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637829519 |
Product | major facilitator transporter |
Protein accession | YP_430439 |
Protein GI | 83590430 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0171149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.72938 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAATCC AGAACCCTGA GACCCCGTTC GGGCCCCGGG AAGTACCCAA ACCTTTTTTC GAGAATAAAT GGGTGCAATT GATTATTGCC ATGATCGGCA TGATCATGAT TGCCAACCTC CAGTATGCCT GGACTCTCTT TGTGCCGGAA GTAGTAAAAG GTCTTAACTC AACGAAAGCT GCAGTCCAGA TGGGGTTTTC CCTCTTTATC GCCTTTGAGA GCTGGGGCCA GCCTATTGCC GGATACTTCA TGGATCGCTA CAGCCCCCGG GCTTTGCTGA CTGTAGCCGC CTTGATGATC GGTATTGGAT GGGCCGGTAT GGGATTAGTC AAATCCCTTG GTGCTTTGTA TTTCTTATAT AGTATGGCGG GTGTGGGTGC AGCCCTGATC TATAGCGGCT CCATTGCCTC GGGCGTGCGC TGGTTTGAGG CCTCCAAGAG GGGAATGGCC TCCGGACTGG TAGCTGCCGC TTTCGGTTCC GGGGCCGCCC TCTTTATCCC CTTTATCGCT ATCATATTGA AAAAGCAGGG TTATAATTCA GCATTCGTAA CTACAGGCGT TATCCAGGGA ATTATCGCCT TGATTGCCGC CCAGTTCATG CGTTTCCCTG CTAAACCTAA GACGAGTAGC TCCAGTACGG CGAAAGCCCA GGTTTCCGCC AGCCAGCGTG ATTTTACTAC AGTGGAAATG TTTAAAACAG CTCATTTCTG GATTATTTAT TTAATGTTCT TGTTTATTTG TACCGGCGGC ATGATTGTAA CGGCCCAGAC AAAACCCTTT GGTACGGAGG CGGGTATAGC TGCCAGCATC ATTGTGACAG CAGCCACAAT CAATACCATT GCCAATGGTG CCGGGCGGAT AATCTGGGGT ATGATTTCCG ACAAACTGGG GCGTTACCAG ACAATGTTTG TGGCCTTTAC CATTAACGCC ATCGCCATGG CCCTGGTTCC TTTTATCGGC CATAACGCCT TTATGTTTGT CTTTATCTTT GCCCTGATTA TGTTCACCTG GGGTGAACTA TATTCCTTGT TCCCGGCCGT CAACGCCGAT ATTTTCGGAA CTACCTACGC CGCAACAAAT TATGGTTTCA TTTACAGTGC CAAGGGTTTG AGCGGTATTG TTGGCGGCTT TGTGGCCGCC CTGGTGGCCC AGATGAGCGG CTGGACACCG GTATTTTTAA CTGGTGCCGT AATGTCCCTG CTGGCCGGTT TGGGTGCCCT TTTATTGCGG AGTATTCCTA AACCAGTCCC CTCAGGGATT AAGAGCGGAA GCAGTGGTTT TTCCAGCCAG GCTTAA
|
Protein sequence | MSIQNPETPF GPREVPKPFF ENKWVQLIIA MIGMIMIANL QYAWTLFVPE VVKGLNSTKA AVQMGFSLFI AFESWGQPIA GYFMDRYSPR ALLTVAALMI GIGWAGMGLV KSLGALYFLY SMAGVGAALI YSGSIASGVR WFEASKRGMA SGLVAAAFGS GAALFIPFIA IILKKQGYNS AFVTTGVIQG IIALIAAQFM RFPAKPKTSS SSTAKAQVSA SQRDFTTVEM FKTAHFWIIY LMFLFICTGG MIVTAQTKPF GTEAGIAASI IVTAATINTI ANGAGRIIWG MISDKLGRYQ TMFVAFTINA IAMALVPFIG HNAFMFVFIF ALIMFTWGEL YSLFPAVNAD IFGTTYAATN YGFIYSAKGL SGIVGGFVAA LVAQMSGWTP VFLTGAVMSL LAGLGALLLR SIPKPVPSGI KSGSSGFSSQ A
|
| |