Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2023 |
Symbol | |
ID | 3831398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2110872 |
End bp | 2112164 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637829952 |
Product | hypothetical protein |
Protein accession | YP_430862 |
Protein GI | 83590853 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCTT GCTGGTGGCG GGAAATAGAT TTCCGTGGCT GGAATGGGGC TGTCGAGTTC GGAAATGACC TGATCCGGGT GGTTATGGTC CCGAACCTGG GTGGAAGAAT TATGGCCTAC GATCTTGGTG ATTATTCCTT TCTTTACGTT AATAAAGAGT TAGAGGGTAA ACTTTTCACG CCCGAAGAAA ATTACGGTGA TGGTTCCATT GCCGCCTGGA AGAATTACGG TGGCGATAAA ACCTGGCCAG CACCCCAGGG GTGGGATACC GAGGAAGAAT GGCACGGGCC GCCGGACCCG GTACTTGACA GCGGGGTATA TACTGGGCGA TGGCTGGAAT GCAGCCCGGA AAAAGTAAGC TATGAAGTGG AGAGCCCGCC GGATCCCCGC ACGGGAATTA AGCTCTTTCG TAAGGTTACT ATCCGGCAGG ACAGCAGCAA GCTGTGGCTG GAGCTGAGGA TGAAGAATAT CAGCTCCCGG CCAGTGGCGT GGAGCATCTG GAATGTAACC CAACTGGATA CGCGGTTGAG GAATGGCAAA GGGTATGATC CTAATTGTCG CCTTTATATA CCATTGAACC CTGTGAGCAG GTTTGCAAAA GGATACCGGG TAATCTTCGG CGAAGAAGAT AACCCGCAAT GGGGCCAACG GGAAGGCAAT GACCTCTTAG TAATACCTTA CCTATTTTAC GTCGGTAAAA TTGGCGTCGA TTCACCGGTG GGTTGGATGG CTTTTGTCAA TGATACTGAA GGCTATACCT GGTGCCTGCG CTACCCCTAC TATCCGGAAG AGAAAGATGC ATATCCCGAC GGGGGTTGCT CGGTAGAATG CTGGACGGTG GGTCGCGGTG TAGTGACCGG TAAGGATTAT TCCCAGGAGA CCGGTTATCA TATAGAGGCA GAAGTACTGG GGCCAGTAAG AAAGCTTAAA CCAGGTGAAG AGCAGTTTTT AGAACTGGAA ATGGGGGTAG CGAAAGGGGG CGGTAGATTT AAAAAAGTCA CAGCAGGGGG TTATATTATC ATGGGAGGTG GTGCCAGGTT AGAAAAAGGG AAATTAATAA TAAATCTTTC TGGCGGTGTT TTTTATAAAG GAAGGTTGCA GGTAGTTGTA ACTGATGCCA GGCATAACGT TATTTTGCAA CAAGATTTAG GAGAAGTATC TCCCCACGAA GAGGTAAAAG TTAACCAAAA AATAGATCTT CCATTTTCCA GGGTAGTTTT TCCATCCCTG CAAGCTCATT TAATCATTGA CCATCCGGGG GGTATGGATG AATACTACCT AGCGTACCTC TAA
|
Protein sequence | MNPCWWREID FRGWNGAVEF GNDLIRVVMV PNLGGRIMAY DLGDYSFLYV NKELEGKLFT PEENYGDGSI AAWKNYGGDK TWPAPQGWDT EEEWHGPPDP VLDSGVYTGR WLECSPEKVS YEVESPPDPR TGIKLFRKVT IRQDSSKLWL ELRMKNISSR PVAWSIWNVT QLDTRLRNGK GYDPNCRLYI PLNPVSRFAK GYRVIFGEED NPQWGQREGN DLLVIPYLFY VGKIGVDSPV GWMAFVNDTE GYTWCLRYPY YPEEKDAYPD GGCSVECWTV GRGVVTGKDY SQETGYHIEA EVLGPVRKLK PGEEQFLELE MGVAKGGGRF KKVTAGGYII MGGGARLEKG KLIINLSGGV FYKGRLQVVV TDARHNVILQ QDLGEVSPHE EVKVNQKIDL PFSRVVFPSL QAHLIIDHPG GMDEYYLAYL
|
| |