Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0511 |
Symbol | |
ID | 3831813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 530025 |
End bp | 531188 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637828445 |
Product | hypothetical protein |
Protein accession | YP_429384 |
Protein GI | 83589375 |
COG category | [S] Function unknown |
COG ID | [COG2718] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02877] sporulation protein YhbH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGTGG AGTACAACCT TTCCCGGGAA GACTGGTCCC TGCACCGCAA GGGTTACCTT GATCAGCAGC GGCACCAGGA AAAGGTCCGG GAAGCCATTA AAAAAAACCT GCCTCACATC ATAGCTGAAG AGAGTATCAT CATGGGCCGG GGTAAAAAGG TGGTCCGGGT GCCCATCCGC AGCCTGGAGG AGTATCACTT CCGCTTTAAC TACAACCAGG GGCAGCATGC CGGCCAGGGC AGCGGCGGTA CTCGGAAGGG GACCGTTATT GGCCGTGAGG TCATCGAAGG CGCTGGCGGG GGGGCCGGGG CCGGCGACGA ACCGGGGATG GATTACTATG AGGCCGAGGT CACCCTGGAA GAGGTCCAGG AGATGCTCTT CCGGGACCTG GAGCTTCCCA ACCTCCGGGA GAAAAAGAAA CCGGTGATGG CTTCCCCCGC TTATGAGTTT CGCGACGTGC GCCGCAAGGG CCTCATGGGC AACCTGGATA AAAAACGTAC CCTCCTGGAA AACCTCAAGC GTAACGCCAT GAAGGGCAAG CTGGCCATCG GCGGTATTAC GCCGGAGGAC CTGCGCTTTA AAACCTGGGA GGAGAAGATC CGCTACGAGA CCAGCGCCGT AGTCCTGGCC ATGATGGATA CCTCCGGCTC CATGGGGACA TATGAGAAGT ATATCGCCCG CACCTTCTTC TTCTGGATGG TGCGTTTCCT GCGCAGCAGG TACCAGCAGG TGGAACTGGT CTTTATCGCC CACCATACCC AGGCCCGCGA GGTAACGGAG GAAGAGTTCT TCGCCAAGGG TGAGAGCGGC GGTACCCGCT GCTCCTCGGC CTACCGCCTG GCCCTGGAGA TCATCGACCG GCGCTATCCA CCGGCGGATT ACAATATCTA CCCCTTCCAT TTTACCGACG GCGACAACCT CCCCAGCGAC AATGAGGCCT GCCTGGAGGC TGTCCAGGAA CTCCTGCCCA GGGTCAACCT CCTGGGTTAC GGGGAGATTG TCAATCCCTA TTACCGTACC AGCACCCTGA TGAATGTCCT CAAGCGCATC AAGGACGACC GCCTGGTGAC GGTGGCGGTC AAGGATAAGA GCGAGGTCTA CCAGGCTCTG CGGCAATTCT TTGCCGGGTC AAAAGGGGGA GAAGCTGGTG GAACAAGAAT TTAA
|
Protein sequence | MPVEYNLSRE DWSLHRKGYL DQQRHQEKVR EAIKKNLPHI IAEESIIMGR GKKVVRVPIR SLEEYHFRFN YNQGQHAGQG SGGTRKGTVI GREVIEGAGG GAGAGDEPGM DYYEAEVTLE EVQEMLFRDL ELPNLREKKK PVMASPAYEF RDVRRKGLMG NLDKKRTLLE NLKRNAMKGK LAIGGITPED LRFKTWEEKI RYETSAVVLA MMDTSGSMGT YEKYIARTFF FWMVRFLRSR YQQVELVFIA HHTQAREVTE EEFFAKGESG GTRCSSAYRL ALEIIDRRYP PADYNIYPFH FTDGDNLPSD NEACLEAVQE LLPRVNLLGY GEIVNPYYRT STLMNVLKRI KDDRLVTVAV KDKSEVYQAL RQFFAGSKGG EAGGTRI
|
| |