Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2305 |
Symbol | |
ID | 3831419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2422717 |
End bp | 2424909 |
Gene Length | 2193 bp |
Protein Length | 730 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637830229 |
Product | hypothetical protein |
Protein accession | YP_431135 |
Protein GI | 83591126 |
COG category | [C] Energy production and conversion |
COG ID | [COG1139] Uncharacterized conserved protein containing a ferredoxin-like domain |
TIGRFAM ID | [TIGR00273] iron-sulfur cluster-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCGGTA AAGAATTCAA GCAACGTATC CGGCAGGCCC TGAATAACGC CAGCCTGCGG GGAGCCCTTG GTCGCTTTGC CGATTCCTAT GTAGTTTCCC GGGAAGAGGT TTATGCCGGC CGGGATTTTG AATCCTTAAG GCAGAGGATT GCCGCTATCA AGGCTGATGC CGCCGGCCGT TACGAGGAAC TGGCCGACCG GTTCAGCCGG GCGGTGGAGG CCCGAGGCGG CAAGGTGTTC CGAGCTAAGG ACGCAGCGGC CGCCAGGGAA TATATCTACC AGGTAGCTAA AGAACACGGC GTTACAGAAA TCGTCAAGTC CAAGTCCTTT GCTTCGGAAG AGATCCACCT GAACGAATTC CTCCAGGAAC GGGGTATCAA TCCCTACGAA ACCGACCTGG CCGAGTGGAT CCTCCAGCTC ATGCCCGGGG AGAGGCCTTC CCATATGGTC ATGCCGGCCA TTCACCTCCC CAAGGAGGAG GTCGCCCGGG TCTTCAGCCG TTACCTGGGT GAACCGGTAG AACCTGATAT TAAAAATATG GTCCGTATCG CCCGCCGGGA GTTGAGGAAA AAATTTCTGA CTGCCGGCAT GGGCATCAGC GGCGCCAACA TTGCCGTGGC TGAAACGGGG ACCATTGTCC TCTGCACCAA TGAAGGCAAC GCCCGCTTGA CTACTACTGT ACCACCGGTT CACGTGGCCA TTGTCGGCTA CGAGAAGCTG GTGCCCAGCA TTAAAGATAT TGTTCCCATC CTTGAGGCCC TGCCCCGCAG CGGTACGGCT CAGCCCATTA CCAGTTACGT AACCATGATC ACCGGTCCGG TGCCGGCCTG GCGGGGCGAA GGCGAGGGTA TTAAGGAACT GCACGTCGTT CTTCTGGACA ACGGGCGCAC CAGGATGGCC GCCGACCCGG TCTTCAAGGA GGCCCTGCAG TGTATCCGCT GCGCCTCCTG CACCAACGTC TGCCCGGTCT TCCAGCTGGT CAGCGGCCAG GTCTATGGTT ATATTTACAA CGGCGGTATC GGCAGTGTCC TCACGGCCTT CTTCAATTCC CTGGAAGACG CCGTCGACCC CCAGAGCCTG TGTATCGGCT GCCGGCGCTG TGCCGAGGTC TGCCCGGCAA AGATTAATAT CCCCGATTTG GTGTTGAAGC TTCGGGAGCG GGTCGTCACG AAGCAGGGGC TTTCCAGCGG CTACCGGATC GCCCTCCACG GCATAGTGGC TAAACCGAAG CTGATGCACA CCCTCCTGCG GGCGGCCTCC CGCCTCCAGG GTCCGGTAAC CCACGGCCAG CCCCTGATCC GGCACCTGCC TCTCTTCTTC AGCAACTTGA CCTCCGGCCG CAGCCTGCCG GCCATTGCTA AGGAGCCCCT GCGGGACCGG GTCAAACGCC TGGAGGCCCG TACCGGCAGG CCGCGTCTCA AGGCCGCCTT TTACAGCGGT TGCGTAATTG ACTTTGCCTA CCCGGAGATC GGTGAGGCTG TTTATAAAGT GCTGGGGCGG GAAGGGGTGC AGGTAACATT TCCCCAGGGC CAGGCCTGCT GCGGTGCCCC GGCAGTTTAT GCCGGCGATC GGGAGACGGC GGTGAAGTTG GCCAAACAAA ATATTACCGC GCTGGAAGAG GCCCGGGCCG ATGTTGTGGT CACCGCCTGC CCTACCTGCG CCGTGGCCCT GAAAAAGGAC TTTCCCGAGC TCCTGGCCGG CGAACCGGCC TGGGAGGAGC GGGCCAGGGC CCTGGCGAAG AAGGTAAAAG ACTTTACCGA GCTGGTCCAT GAATTAACTG GCGGGCAGGG GAAAAAGGTC AAGCAGGCTA AAAAATCCGG CAGCGGGGCA GTAAAGGTTA CCTATCATGA CTCTTGTCAT TTCAAGCGGC ACCTGGGCCT GGACCAGGTG GCCAGGCAGG TTTTAAAGGA ACAGCCGGGG GTGGAACTGG TAGAGATGCA GGAAAGCGAC CGCTGCTGCG GTTTTGGTGG TTCTTATAGC ATCAAGTACC CGGAGATCAG CGCCCCTATC CTGGAACGCA AGCTGAAAAA TATTACCGAA AGTGGTGCGC AAGTGGTGGC TGCGGACTGC CCGGGCTGCG TCCTGCAGCT ACGCGGCGGC CTGGATCAGA AGGGCAGCTC TATCAAGGTC AAGCATACGG CTGAGGTGCT GGCGGCCTTG GAGAACTTGC AGGGTGGCGG GAACAGTAAA TAA
|
Protein sequence | MAGKEFKQRI RQALNNASLR GALGRFADSY VVSREEVYAG RDFESLRQRI AAIKADAAGR YEELADRFSR AVEARGGKVF RAKDAAAARE YIYQVAKEHG VTEIVKSKSF ASEEIHLNEF LQERGINPYE TDLAEWILQL MPGERPSHMV MPAIHLPKEE VARVFSRYLG EPVEPDIKNM VRIARRELRK KFLTAGMGIS GANIAVAETG TIVLCTNEGN ARLTTTVPPV HVAIVGYEKL VPSIKDIVPI LEALPRSGTA QPITSYVTMI TGPVPAWRGE GEGIKELHVV LLDNGRTRMA ADPVFKEALQ CIRCASCTNV CPVFQLVSGQ VYGYIYNGGI GSVLTAFFNS LEDAVDPQSL CIGCRRCAEV CPAKINIPDL VLKLRERVVT KQGLSSGYRI ALHGIVAKPK LMHTLLRAAS RLQGPVTHGQ PLIRHLPLFF SNLTSGRSLP AIAKEPLRDR VKRLEARTGR PRLKAAFYSG CVIDFAYPEI GEAVYKVLGR EGVQVTFPQG QACCGAPAVY AGDRETAVKL AKQNITALEE ARADVVVTAC PTCAVALKKD FPELLAGEPA WEERARALAK KVKDFTELVH ELTGGQGKKV KQAKKSGSGA VKVTYHDSCH FKRHLGLDQV ARQVLKEQPG VELVEMQESD RCCGFGGSYS IKYPEISAPI LERKLKNITE SGAQVVAADC PGCVLQLRGG LDQKGSSIKV KHTAEVLAAL ENLQGGGNSK
|
| |