Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0632 |
Symbol | |
ID | 3832530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 655898 |
End bp | 657313 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637828574 |
Product | general substrate transporter |
Protein accession | YP_429504 |
Protein GI | 83589495 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000142658 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTAG GTGAATTATG GGGGAATTTT TATATGGACC AAGTAAGAAA TAAAATTTTG GATTCCGGCA TGCGCGGAAG TTCTATAACC AATCAATCCT CACAACTCAT TACACGAATA GAAAGCGTTC CTTTTTCGCG CTGGCACATC AAGCCGCGTG TGATTATGGG CAGCGCTACC TTTTTTGATG CATTTGACAC CTTATCCCTC TCATATGCTA TGCCGGTATT AATAGGGCTA TGGCATTTGA ATCCAGGCCA AATCGGAATA CTTATTGGCA TTGGATATCT TGGACAGGCT ATAGGTGCGC TATTGTTCGG ATCGATTGCC GAGCGTTTTG GCCGCGTTTT TAGTGCGAAA TGGGCCACTT TAGTGATGTC TATAATGGCT ATTGCTTGCG CCTTTGCAGG AAATTACAAT GAGTTGGTGG CACTGCGTTT TATACAGGGA ATCGGTGTTG GTGGCGAAGT TCCTGTAGCA GCCGCCTATA TTAATGAAAT TTCTCGTGCT TCTGGTCGGG GTCGTTTCTT CATGCTTTAT GAGATGGTTT ATCCTATTGG ATTGATGGTA ACTGCCCAGC TTGGGACCAT TATTGTACCA AGCCTGGGGT GGAAATGGAT GTTCTTCATA GGCGGTGGAA CAGGCATAAT CATTGTTCTA CTTATGAATT TGCTGAAGGA ATCACCTCGC TGGCTTATTT CCAAAGGACG GTTCGAGGAG GCCGAGCGCA TAATTGAAGA GATTGAGGCA AGCACCGACC AACGCATACC TGTCAATATT AAGGGAACTC AGGAGGCAGT TAAAGGTAAC TGGAAGGAGT TATTCTCACC ATTCTACCGG GGGAGGACAA TAGTCGTTTG GATGTTATGG TTTTCAACAT ATTTTGTTTC AAACGGCCTG AATAACTGGT TACCCAGTCT GTACAAGACA GTCTATAAAC TTCCCCTACA GACTTCTTTG CGGGTAGCAT CGCTTACAAA CCTTATCCAA ATAGTTGCTG TATTTGCATG TGCGATGCTA ATTGATAAGG TAGGCCGTAA ATTATGGGCA ACTATAGCAT TTCTCGTGGC TAGTTTGCTT CTTGGAATAC TTTGGATAAA CGGTGCAGCG ACTGCCTACA GCGTCATGTA CCTTGGGTCG TTAGCTTATG GCGTCATTGG CACGGTAACG GTTCTGCTTT ATTTGTATAC TCCGGAAATT TATCCAACCA GGATGCGAGC AGTTGGAACA GCATTTGCTA CTACATGGTT GCGTCTCGCA TCAGCAATTG CTCCTACCAT AGTAGGATTT ATTTTAGGGA CTAGAGGGAT TTCCAAGGTT TTTGCACTAT TTGCATGTGT TAGCGTTATT GGTGCTTTTA TGGCTATCCG GATGGTTGAA ACGAGGGAAA AGATGTTAGA AGAGATTGCA CCCTAA
|
Protein sequence | MNLGELWGNF YMDQVRNKIL DSGMRGSSIT NQSSQLITRI ESVPFSRWHI KPRVIMGSAT FFDAFDTLSL SYAMPVLIGL WHLNPGQIGI LIGIGYLGQA IGALLFGSIA ERFGRVFSAK WATLVMSIMA IACAFAGNYN ELVALRFIQG IGVGGEVPVA AAYINEISRA SGRGRFFMLY EMVYPIGLMV TAQLGTIIVP SLGWKWMFFI GGGTGIIIVL LMNLLKESPR WLISKGRFEE AERIIEEIEA STDQRIPVNI KGTQEAVKGN WKELFSPFYR GRTIVVWMLW FSTYFVSNGL NNWLPSLYKT VYKLPLQTSL RVASLTNLIQ IVAVFACAML IDKVGRKLWA TIAFLVASLL LGILWINGAA TAYSVMYLGS LAYGVIGTVT VLLYLYTPEI YPTRMRAVGT AFATTWLRLA SAIAPTIVGF ILGTRGISKV FALFACVSVI GAFMAIRMVE TREKMLEEIA P
|
| |