Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1371 |
Symbol | |
ID | 3832294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1416749 |
End bp | 1417831 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829307 |
Product | hypothetical protein |
Protein accession | YP_430227 |
Protein GI | 83590218 |
COG category | [S] Function unknown |
COG ID | [COG3581] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTAA CCTTTCCCCA TATGGGCCAT CTCTGGCTGG TGTTGAAGGC AGCCCTGACC GGCATCGGAC TGGAGGTAGT AGTACCACCC CCCTGTACCC GTCGCACCCT GGAACTGGGA GTGCGCCATG CCCCGGAATC GGCCTGTCTG CCCCTGAAAG TCAACCTGGG CAACTACCTG GAAGCCAAAG AGCTGGGGGC CGATACCATC GTCATGGCCG GCGGGGTAGG ACCCTGCCGG ATCGGCTATT ACAGCCAGGT GCAGAGGGAG ATTCTCCGGG ACCTGGGCTG TGCGTATGAA ATGGTCGTCT TTGAGCCTCC GGACGTCCAT TTTAACGAAG TCTGGGATAA GATTAAGTAT TTAAACCGCC GTCCCTGGCA GGATGGCGTC CACGGGGTCG TTATGGCCTG GCTGAAGGCC TGTGCCGTCG ACGCCCTGGA ACAGGAAGTC CAGCGCCTGC GACCGCGGGA AGCCCAGACC GGGCTGGCTG ATGCCGTTTT CCGGCGGGCC TTGCAGGAAC TGGATGCCGC CGGCAGTCGT AAGGAAGTAA ACCGGGTAGT TAAAGAAATA AAAGGTGAAC TGGCTGCCCT CCCCTTGAAA CCAGATGTCC GGGTGATCCG GGTAGGCATT GTGGGGGAGA TCTATACCGT CCTGGAACCC CTGGTCAACC TGGATATCGA GAAACGCCTG GGGGCCTTGG GAGTAGAAGT AGTCCGTGGT CTCTTCCTCA GCCGCTGGAT TAATGACAAC CTCTTCAAGG GCCTGCTGCC CCTACCGGGA CATCACCCGG AAAAGACGGC TCCACCTTTT TTAAACCACT TCGTCGGCGG CCATGGCTGG GAGTCGGTAG GAGATACGGT AACCTTTGCC CGCAGGGGAT GTGACGGCGT CATCCAGCTG GCCCCTCTAA CCTGCATGCC GGAGATCGTC GCCCACTCCG TCATGCCGGC CGTGCAGCAG GCTACCGGCA TTCCCGTGAT GACCATCTAC CTGGACGAGC AAACCGGCGA AGCCGGCCTG CAGACGCGCC TGGAAGCCTT TGTCGACATG CTCCGGCGGC AGAAGGGCGT GAGAGCGGGG TAA
|
Protein sequence | MKVTFPHMGH LWLVLKAALT GIGLEVVVPP PCTRRTLELG VRHAPESACL PLKVNLGNYL EAKELGADTI VMAGGVGPCR IGYYSQVQRE ILRDLGCAYE MVVFEPPDVH FNEVWDKIKY LNRRPWQDGV HGVVMAWLKA CAVDALEQEV QRLRPREAQT GLADAVFRRA LQELDAAGSR KEVNRVVKEI KGELAALPLK PDVRVIRVGI VGEIYTVLEP LVNLDIEKRL GALGVEVVRG LFLSRWINDN LFKGLLPLPG HHPEKTAPPF LNHFVGGHGW ESVGDTVTFA RRGCDGVIQL APLTCMPEIV AHSVMPAVQQ ATGIPVMTIY LDEQTGEAGL QTRLEAFVDM LRRQKGVRAG
|
| |