Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0664 |
Symbol | |
ID | 3832151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 694102 |
End bp | 696930 |
Gene Length | 2829 bp |
Protein Length | 942 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637828603 |
Product | hypothetical protein |
Protein accession | YP_429533 |
Protein GI | 83589524 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000150651 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000068678 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGACAAGA TGAAAACTAG AATCCTCGTT ATCAGTGAGT ACTTCTCACG TGGCGGTTTA GAAACTCATA TTGTCGGTCA AGCGCGCGTT CTATCGAAAC TTGGTGTGGA TCTTCTGTTA GCAACAGGTT CCTCAGCAGC CGATTGCCCC GATGGTGTGT TTGCTGCAGC CCTTACAGAT CTTCGAATGG GCGCCCAGGT ATCATATGAC GAATTATGTG TGACGCTTAA AAGGCTCAAG AAATTCATTT CAGACGAGCG CATAACGTTA ATCCATGCGC ATCCGTTTTA TAGCGCAATC GTCGGTCTCT TGGCAGCGCA ACAATCACGA TTGCCTTTTG TCGTTACGAT ACATAGTCCG TTATCATTAA GTAGTACGTT TGGGCAACTC TATGATTTTT TGCTCAAGTC TGTTGTGCTC CCAGTAGCAG GTCGAGTTTT CTGTGTATCG AAAGAGACAG AACTTTTGTG TCGCTCTTTG GCAGAATGTA GGACTGAACT TTTACTGAAC GCTGTTCAAA TACAGAACTC GAACCCACCA AACGTAGCCA AAGATGGCCC ATGGTTATGG GCTGGACGGC TTGATAAAGA TAAGTCTAAC GGCCTTCTCG ATTTGATAGA GAAAATTGAT CAGGCAACAG TCGGCGAACT CCATATATTT GGAGATGGGC CTCAAGTTCA TTTAATTGAA TCGTTTCTTA ATAGTCGGCC AGACAAGGCT GAGTTCGTGC GACTGATGGG ATGGCGTCAT AATATAACCA CGATAATGCC TGCTTATGCT GGCATTGCTG GGATGGGGCG CGTAATTCTT GAGGGTTCTG CCTTAAACAG ACCATGTTTG CTGGTTGGCT ATGATGGGGT TAAAGGCCTA CTCGATATTA ATAGGTTCGA ACGAGCCTCT TTTTGGAATT TCTCGGGTCG TGGATTACCA ACAATTACAG CGGATGCCCT TCATCAGGAA TTCTATAGAT TGTCCAAAGA TAAAGGCCCT TTTCTTCTTC GCCAATGGGT CGCCGATAAT CGGGACGAAA GAGTGATCTG GCAGCGCTAT GCGGAAAAGA TAAAAGATTT AGCTCCGCTT GATAATCCGT TGGCTCAAAA CATCTTGGAT GCGTTGCAAT ATCGCGGTTC CATTTCCGAA CCTGTGTGGT GGGACAAAGA GTTGATGCGG GTTATTCTGG GCTTGTTCTC CAATGAGCCA TACGGGAAGG AACAGGCGAA GAGTATGACC CATCAGATCT TGGTAGCGCA CCTCCACTCT AAATTAGAAG CGATAAAGTA TGAAACAACA ATGTTACGGG AGAAAATAGA TTTCCTTAAG AGTGCGCTGG CTGAGCGTGA CGAAAAGATC ACTTCTCTCA ATCAAGCTGT GGCTGAACGT GACGAAAAGA TCATCTCCTT AAATCAAGCT GTGATTGAGC GAGATGAAAA GATTGCTTCG CTACAAAAGC ACATTCAAGA TATTTGGGCC AGCACATCGT GGCGGATTAC ACGCCCTTTA AGATTTCTTA AGAAGCTTGT TAGTGATCCC GAGCCGACAA CTTATTTTAT ACTAAAGCGC ATTTATTGGG GACTTCCAGG AAATTTGCGG ATACGTCTAA ATGGACTTAG ATGTCTTATT ATTCGTTTTT TCTTATCAAA GCGCAAAAAC AATGTAGGTC TAATGGCTAA TCAAGAAGGT ATACATGGCC TTTCTTGGGA AGAGTTTCAA GATAAAGTAC TATCGAAACG GGAACAACAC AAGGGAATTT TTATTTTAGA GGCTACCCAT ATAGATTGGA ATATGAATTT ATTTCAGAGA CCTCAACATA TGGCAAACGC ACTTTCAAAG CTTGGTTACC TCGTAATTTT TAAGACTGCA AATTTTTATG ATAATGTATC TGGGTTTAAA AAAATATCCG ATAACCTTTG GTTGACAAAT AATGATAAAG TAGACAGTAT TTACGGAGCA GTAAGAAGCT TTTACAGTAC CTCTTCTGTT TACACTAAAG AAATCTATGA TGACCGCAGA AAATACGGCC TTGTTGTTTA CGAATACATA GATCATATTG ACCCCGCTAT ATCTGGAGAT GAGGAGAACA TCCGACGCCT GAATGCTCTC AAGAATTATG CCTTCAATGG TGGAGTAGAT TTTATCGTGG CTTCAGCTAA AGCATTGTAT AGAGAGGCTG TTCAAGCTGT AGGCGAAGAT AAAGTTATTT TAATTCCAAA CGGTGTGGAT GTTGAGCATT ATCGCGACCA ACGCCACAAG TACTGTACTA TTCCTAAGTC GTTGATTAAG TTCAAGAATG TACATAAAAT AATTGTGGGT TACTTTGGGG CACTTGCACC TTGGCTGTGG TATGAAGAGA TAGAGAAACT TGCAGCCCTT AGGCCAGAAG TAGGTTTTGT TTTCATTGGG CCTGATTATT ATGGCGGTTC ATCTCTGTTA CCGAAAGCGA AGAATATTTT TTGGATGGGG CCGGTTGATT ACAAAATTTT ACCCGGTTAT GCACTTCATT TTGATATATG TTTTATTCCA TTCCGACCAG GCGAGATAGC TCGGACCACA TCTCCATTAA AATTATTTGA ATATTTTGCT CTAGAGAAAC CTGTAATAGT AACTTCGAGT ATGTTAGAAT GTATACAATT TTGCGAGGTT TTGAGTGGAA GTTGTGCTAC AGAGCTATCA AAATGCATAG ATAAAGCTTT AGATTTATCT CGTGATGAAC ATTTTAAAAA GCGTTTGGCT GAATTAGCTG ATCAAAATTC TTGGATTGAA AGAGCGAAAA AATATGAGAT TATTTTTGAA CAGACTAAGA AATGGATTTG TCATAAGAAG GATATTTAA
|
Protein sequence | MDKMKTRILV ISEYFSRGGL ETHIVGQARV LSKLGVDLLL ATGSSAADCP DGVFAAALTD LRMGAQVSYD ELCVTLKRLK KFISDERITL IHAHPFYSAI VGLLAAQQSR LPFVVTIHSP LSLSSTFGQL YDFLLKSVVL PVAGRVFCVS KETELLCRSL AECRTELLLN AVQIQNSNPP NVAKDGPWLW AGRLDKDKSN GLLDLIEKID QATVGELHIF GDGPQVHLIE SFLNSRPDKA EFVRLMGWRH NITTIMPAYA GIAGMGRVIL EGSALNRPCL LVGYDGVKGL LDINRFERAS FWNFSGRGLP TITADALHQE FYRLSKDKGP FLLRQWVADN RDERVIWQRY AEKIKDLAPL DNPLAQNILD ALQYRGSISE PVWWDKELMR VILGLFSNEP YGKEQAKSMT HQILVAHLHS KLEAIKYETT MLREKIDFLK SALAERDEKI TSLNQAVAER DEKIISLNQA VIERDEKIAS LQKHIQDIWA STSWRITRPL RFLKKLVSDP EPTTYFILKR IYWGLPGNLR IRLNGLRCLI IRFFLSKRKN NVGLMANQEG IHGLSWEEFQ DKVLSKREQH KGIFILEATH IDWNMNLFQR PQHMANALSK LGYLVIFKTA NFYDNVSGFK KISDNLWLTN NDKVDSIYGA VRSFYSTSSV YTKEIYDDRR KYGLVVYEYI DHIDPAISGD EENIRRLNAL KNYAFNGGVD FIVASAKALY REAVQAVGED KVILIPNGVD VEHYRDQRHK YCTIPKSLIK FKNVHKIIVG YFGALAPWLW YEEIEKLAAL RPEVGFVFIG PDYYGGSSLL PKAKNIFWMG PVDYKILPGY ALHFDICFIP FRPGEIARTT SPLKLFEYFA LEKPVIVTSS MLECIQFCEV LSGSCATELS KCIDKALDLS RDEHFKKRLA ELADQNSWIE RAKKYEIIFE QTKKWICHKK DI
|
| |