Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2415 |
Symbol | |
ID | 3832166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2537446 |
End bp | 2539083 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637830334 |
Product | hypothetical protein |
Protein accession | YP_431240 |
Protein GI | 83591231 |
COG category | [R] General function prediction only |
COG ID | [COG0661] Predicted unusual protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0766121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACGAGGC ACTACCGTTA CCTGCAGCGC TATACTGAAG TGGCTACCGT ACTTCTCCGC CACGGCTGGC AGGCTTACTG CGAAACCCGC CCCCGCCGCG CTCCGGCCCG GCGCCCCCTG GTTACAGGTC GGGAGCCAGG CGATCCGGTT TACCGGCACC TGCGCCTGGC CTTTGAGGAA CTGGGACCGG TTTTTATCAA GCTGGGCCAG CTCCTGAGTA CCAGGCCCGA CCTGATACCG ACGGAGATGG CTGCGGAATT CAGCTACCTC CAGGACCGGG TACGGCCCCT GGCCCCGGAC GTCATCCGGC AGCAGGTCTT CCGGGAACTG GGGACTGCTC CGGAAAAGGC CTTCAGTTAC TTTGATTACC AGCCCCTCGC GGCGGCATCC ATCGCCCAGG TCCATAAAGC CCGGTTACCT GGTGGCCAGG AGGTGGCCGT CAAGGTGCAG CGCCCCCAGC TCGATGGGGT GGTCGTTACC GACCTGGCCG TCCTGGAGAA TCTCGGCCGG AGATTCAAGG GCACCGTCGT CGGCCGTATT TGTGCCTTGG AGGAGATCCT GGCCACCTTT CGCCGCCAGA TTGAACGGGA ACTCGATTTT ACCGTGGAAG CCCTGGCCAT GGAGAATTTT CGCCGCCTGT ACCGTGAGTT TCCGCAGATA GTGGTCCCCA GGGTTTACTG GGATTATACG ACCAGGGGCC TCCTGACCAT GGATTACCTG GCCGGGAAAA GGCTCAGCGA CTGGTACGGG AAGGGTACGG ACTGTCAGCG GGCAGCCCTG CTTATCAAGG CGCTCCTGGC GCCTTTTTTC CAGGAAGGCA TCTTCCACGG CGACCCCCAT CCGGGAAACA TTCTCTTTCT TCCCGGCGGT CGCCTGGGCT TAATTGATTT TGGCATCGTC GGTCGCCTGG ATGAGGATTA TCGTTACCAG GCTGCCAGGC TGATTCTAGG CCTCCAGGAA CGCGATTTAC AGGCCGTAAT GGAAGTAACC CTGAAACTGG GTAAGCCCAT GGCCGCAGTA GATTACCAGG CCCTCTATGA AGACACGGCA GAACTGGTTG ACCGGGTGAC CGGCATGGGC AAAGGGGATG TCAATCTGGC CGGTCTCCTG CTGGGAATGG TGGAACTGGC CCGCCGCCAT AGCATCCGTA TGCCCGGTAC CTTCTTCGTC CTGGGGCGGA CGATTATGGA AGGGGAGAGC CTGGCCCGCC GCCTGGATCC TTCCCTGGAT CTGGTGCAGG TAAGCGGGCC CCTGGCTGCC AGTTACCTGC GCAGCCGCCT GCGTCCCAAC CCCACGCCCG GGCGAACCTA CCACCGGGCG GCCTCAACCC TGCAGGATTT GCTGGAACTG CCGCGGGATA TCTCCCGCAG CCTGGATAAA CTTGCCCGGG GGCAGTTAAC TACCATTTTT GTCCACCGGG GCCTGGAAAC CCTTTACCAC AGACTGGATA TGGTTTCCGC CCGGCTTTCT GCCGCTCTCA TCGTGGCTGC CCTCATCGGC GCCGGGGCCC TAATCCTCCA CGCGGGTGCC GGTCCTAAAA CCGGTGGCCT TTCCCTCCTC GGTCTGGGAG TGCTGGGCGG CGCCCTTATC CTGGGTTGTC TCTGGGCTCT GCTCCTCAAG GTAGGACAGA AGGAATAG
|
Protein sequence | MTRHYRYLQR YTEVATVLLR HGWQAYCETR PRRAPARRPL VTGREPGDPV YRHLRLAFEE LGPVFIKLGQ LLSTRPDLIP TEMAAEFSYL QDRVRPLAPD VIRQQVFREL GTAPEKAFSY FDYQPLAAAS IAQVHKARLP GGQEVAVKVQ RPQLDGVVVT DLAVLENLGR RFKGTVVGRI CALEEILATF RRQIERELDF TVEALAMENF RRLYREFPQI VVPRVYWDYT TRGLLTMDYL AGKRLSDWYG KGTDCQRAAL LIKALLAPFF QEGIFHGDPH PGNILFLPGG RLGLIDFGIV GRLDEDYRYQ AARLILGLQE RDLQAVMEVT LKLGKPMAAV DYQALYEDTA ELVDRVTGMG KGDVNLAGLL LGMVELARRH SIRMPGTFFV LGRTIMEGES LARRLDPSLD LVQVSGPLAA SYLRSRLRPN PTPGRTYHRA ASTLQDLLEL PRDISRSLDK LARGQLTTIF VHRGLETLYH RLDMVSARLS AALIVAALIG AGALILHAGA GPKTGGLSLL GLGVLGGALI LGCLWALLLK VGQKE
|
| |