Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0578 |
Symbol | |
ID | 3832491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 602425 |
End bp | 603462 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637828519 |
Product | hypothetical protein |
Protein accession | YP_429451 |
Protein GI | 83589442 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02867] stage II sporulation protein P |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTATCC AGGTCATCAC CAGCCGGCAG GCCCGGGCCC TGGGAGCTTT CCTCAAAGGG GTGCTCCTGG CCGGGTTAAT TATTATGGGA CTGGTCACCG GGGCCTGGCA ATTGATCCGG ACCGGTAGCC AGGTGCTGTC CGGCTCCCCC GCCAGGCTAC CTATGCCCCT GCTGGAGGGC ATCCTGCGTG AGGGTTTGCC GGCCCTGGGC CTGCAGGCAG GAGAAGCCCC CCGGAATACT TCCGGTGAGG CCCTGACAAC GGCCCTGCAG GTCCTGGCCT CGCCCCTGGT AGTTTTACCT GGACAGGAGA AGACCCCGAG TCTCCTGACA ACTGAAGAGG AGTATGTCCA GCCGGAACCG CCGGTAGAAG AAAGCCCGCC CCCGGCGCCA TCAGAGGTCG AAACCACCAG CTCCAAAAAC CCCCTGGTGG CTATATACAA CACCCATAAC GCGGAATCCT ACCAGCCCAG TGAGGGTAGC GCCAAGTTTC CCGGCAAAAA CGGCGGCGTC AGCCAGGTGG CTGCCACCCT GGCCGACAGC CTGAGTAAAG ATTATGGAAT CCCGGTGGTT CGTTCAACCA CCATCCATGA TTATCCCGAT TTTACCCTGG CTTATGCCAA CTCCGAGAAA ACCTTGAAGC GAATGCTGGC CGCCAACCCC TCGGTCCTGG TGGCCCTGGA CGTCCATCGG GACGCCGGCC TGCCGTCACC GCCTGTTGTC GAAATTGACG GCCAGAAAGT CGCCCAGGTG CTCATCATCG TCGGCAGCAA CGCCCGGTTG GAACACCCCA ACTGGCGCCA GAATGAGGCC TTTGCCAGGC AACTGGCCAA AAAAATGGAT GAACTCTATC CCGGGCTGTG CCTGGGAGTC CGGGTCCAGG AGGGGCGCTA CAACCAGCAT CTCCTGCCCC GGGCTCTGCT CCTGGAATTC GGCAGCGACA ATAATACCCT CCAGGAGGCT GAAGGTTCCG CCCGCCTGGT AGCCAGGGTC CTGGCGGCGG TAATCAAAGA CCTGCGGCAG GAAAACCCGG CCAGTTGA
|
Protein sequence | MRIQVITSRQ ARALGAFLKG VLLAGLIIMG LVTGAWQLIR TGSQVLSGSP ARLPMPLLEG ILREGLPALG LQAGEAPRNT SGEALTTALQ VLASPLVVLP GQEKTPSLLT TEEEYVQPEP PVEESPPPAP SEVETTSSKN PLVAIYNTHN AESYQPSEGS AKFPGKNGGV SQVAATLADS LSKDYGIPVV RSTTIHDYPD FTLAYANSEK TLKRMLAANP SVLVALDVHR DAGLPSPPVV EIDGQKVAQV LIIVGSNARL EHPNWRQNEA FARQLAKKMD ELYPGLCLGV RVQEGRYNQH LLPRALLLEF GSDNNTLQEA EGSARLVARV LAAVIKDLRQ ENPAS
|
| |