Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0039 |
Symbol | |
ID | 3830905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 39307 |
End bp | 40602 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637827971 |
Product | hypothetical protein |
Protein accession | YP_428921 |
Protein GI | 83588912 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000157521 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTTAAAA CCTTTCGCCA GCGGGAAGAA GATATTGTCC GTTGTAATAG ATGCGGTTTT TGTGAGGAAG TTTGTCCCAC CTACAAGGCG ACGGGGGAAG AGTTTTCCCT GGCCCGGGGA CGTAACCGTT TAATGCGCCA GTCCATGGAG GGCAAACTGG ATTTAACGAA AGAGCCCGAG ATCAACCAGC ATATCTATTC CTGCCTTCTT TGCGGCGCCT GTGTAGCGGC CTGCCCCTCG TCCGTCATCA CCGACACCCT GATCAAGACC GCCCGGGCCG AAATTACCCG GGCCAAGGGC CAGCCCTTCC CCATCCGCAT GGCTTTGCGG GGGGTCCTGG CCAACCAGCG GCGCCTGACC CTGGGGGCCA AAGTCCTGCG CTTCTACCAG CGCAGCGGCG CACGCTGGCT GGCCCGGCAT ATCGGTTTTC TTAACTTGAT GGGTTCCCTG GGCAAGGCCG AGGGGCTGCT GCCGGCCATC CCCGAGAAAA CCCTGCGCGT CCAGTTACCC CAACTCTTAA AGAAGCCGAT GAAGCCCCGG CATAAAGTCG CCTACTTTGC CGGCTGCATG ATTAACAACT TTTTTACTGC TGTTGGCGAG GCCACCCTGC GGGTTTACCA GGAAAACGAT ATCGAAGTAG TAGTGCCGAC CAGCAACTGC TGCGGCATCC CCCATGAGGC CTATGGCGAT ATAGAGATGC AAATAAAACT GGCCAAAGAA AATCTGGACG CCTTCAGCCG CTATGAGGTT GAAGCAATTG TCACCGATTG CGCCAGCTGT GCCCACGGCC TTCACAGTTA CGCCGAACTC CTTCAGGACG ATCCCCATTA TGGTCCCCTG GCGGCGCAGC TAGCGGCTAA AGTAAAGGAT GCCTCTCAGT ACCTGGTCGA GATTGGCTTT AAAAAGGAGA TGGGGCCGGT CAACGCTACC GTAACTTACC ACGATCCCTG CCATGCAGCC CGGGGCCTGA AGGTCAAGGA GCAACCGCGG GAGATCTTGA AGAGTATCCC GGGGGTTAAA TTCGTCGAGA TGAATGAATC CGACTGGTGC TGTGGCGGTG CCGGTTCCTA TAACGTAACC CACTACGAAC TATCACGTAA GATCCTCGCC CGCAAGATGG ATAACTTTAA GAAGACCGGA GCCGAATACC TGGCAACCTC CTGCCCGGCC TGCCTCATGC AACTGGCCCA CGGCCTGGAT GTCTACCGCT TGTCTGGCAA AGCAATCCAT GTTATGCAAA TATTGGACCA GGCCTACCAG AACCGGGCCG TCCGGAGCAA GGCCAAGGCC GGCTGA
|
Protein sequence | MVKTFRQREE DIVRCNRCGF CEEVCPTYKA TGEEFSLARG RNRLMRQSME GKLDLTKEPE INQHIYSCLL CGACVAACPS SVITDTLIKT ARAEITRAKG QPFPIRMALR GVLANQRRLT LGAKVLRFYQ RSGARWLARH IGFLNLMGSL GKAEGLLPAI PEKTLRVQLP QLLKKPMKPR HKVAYFAGCM INNFFTAVGE ATLRVYQEND IEVVVPTSNC CGIPHEAYGD IEMQIKLAKE NLDAFSRYEV EAIVTDCASC AHGLHSYAEL LQDDPHYGPL AAQLAAKVKD ASQYLVEIGF KKEMGPVNAT VTYHDPCHAA RGLKVKEQPR EILKSIPGVK FVEMNESDWC CGGAGSYNVT HYELSRKILA RKMDNFKKTG AEYLATSCPA CLMQLAHGLD VYRLSGKAIH VMQILDQAYQ NRAVRSKAKA G
|
| |