Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0690 |
Symbol | |
ID | 3832514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 720787 |
End bp | 721917 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637828623 |
Product | hypothetical protein |
Protein accession | YP_429553 |
Protein GI | 83589544 |
COG category | [S] Function unknown |
COG ID | [COG3287] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000000847515 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGCGGGTTG GGGTAGGTTT TAGCAGTGCA AATGACCCTA GCGCTGCTGG ACAGGTTGCT TCCGAGCAGG CTGTGCGGCA ATCCGGAAGT CCCGTAATAA CTTTGGTTCT GACCACTGAC AATTACGATC AAGAGCGGGT CTTGTCGGCC GTCAAAAGAG TTATCGGTAA TTCCAGGCTG GTAGGCGCCT GCGTTCCGGG GGTCATTGTC AACGCGAGGC TGTACAAGAG AGGGGTGGGT ATCTGTACAG TCAGTGGAGA AGGTGTGGAG GCGGTAACAC ACCTGCAGAG GAATATTTCC CAGCACTCGT ACAGAAAAGG CGAGAAAGCG GGCGAAGCTT TGCTGGAAAA GGGCGGTGAA ACACCGGGGA CAGTATTACT CTTTCCAGAT GGTTTTGCAG CAAACATTTC TGGCCTGCTC AGGGGATTAT ATAACGTTAT GGGGCCTGCC TTTGAATACA TTGGGGGCGG GAGTGGCGAC AATCTGCGGT TTTACAGAAC TTATCAGTTT ACTGAGGAAG GTATCAGCAG CGATGCCGTA GCGGCAGCAG TAATAAGGGG TATAAACTTT CAGATGTGCC TGAGTCATGG CTGGAGGCCG GTAGGAGAAC CGCTCATGGT GACGAAAGCG AAAGGGAGAA AGGTTTATGA AATCGATGGA CTTCCGGCAC TGGAGAGATA TTCGGCTCTG GTTGGCGCCT ACGACAAGAA CGATTTTTCC TGCTACAGCA TGAAGTATCC TTTGGGCTTA CCCTGTGCGG GGGGAGAATT TATTATCCGC GATCCACTCA AAGCCGAAGA AGATGGGGGC ATTTTATTCG TAACTGAAAT TCCTGAAAAC ACTATCGCCA CTCTGATGGA AGGGGATACC GCAAGCCTTC TTGCGGCTGC GGAAGAAGTA TCGAAAAAGG CGTTAAATAC GCCAGCTGCT CCCAAGACCT TTATGGTGTT TGATTGTGTT TCCCGCTATT TATTGATGGG AGAGGACTTC TCTCGCGAAA TGGAAGCAAT AGCCAAAAAC ATCAAAGCAG AAATTCCAGT TATAGGGATG CTATCCTTCG GCGAAATTAG CAGCATCTCA GGGACACCGC TATTTTACAA CAAGACCATT GTAGCTGCCG CGGGGTGGTA G
|
Protein sequence | MRVGVGFSSA NDPSAAGQVA SEQAVRQSGS PVITLVLTTD NYDQERVLSA VKRVIGNSRL VGACVPGVIV NARLYKRGVG ICTVSGEGVE AVTHLQRNIS QHSYRKGEKA GEALLEKGGE TPGTVLLFPD GFAANISGLL RGLYNVMGPA FEYIGGGSGD NLRFYRTYQF TEEGISSDAV AAAVIRGINF QMCLSHGWRP VGEPLMVTKA KGRKVYEIDG LPALERYSAL VGAYDKNDFS CYSMKYPLGL PCAGGEFIIR DPLKAEEDGG ILFVTEIPEN TIATLMEGDT ASLLAAAEEV SKKALNTPAA PKTFMVFDCV SRYLLMGEDF SREMEAIAKN IKAEIPVIGM LSFGEISSIS GTPLFYNKTI VAAAGW
|
| |