Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0622 |
Symbol | |
ID | 3832597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 644867 |
End bp | 645856 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637828563 |
Product | deoxyguanosinetriphosphate triphosphohydrolase-like protein |
Protein accession | YP_429495 |
Protein GI | 83589486 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.153363 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCTGC GGGAAGAAGC CGAAAAGCGA GAAGAGATCC TTTTGAGCCC CCTGGCCAGC CTCAGCAGCC GCACCCGGGG CCGCCAGAAA CCGGAGGAGC CCTGCCCCAT CCGCACCGAA TACCAGCGCG ACCGCGACCG TATCATCCAC TGTAAGGCAT TCCGGCGCCT GAAGCATAAG ACCCAGGTTT TTATCGCCCC CGGCGGCGAT CACTACCGCA CCCGGCTGAC CCACACCCTG GAGGTAGCCC AGATCAGCCG CACCATCGCC CGGGCCCTGC GCCTCAATGA AGACCTTACC GAGGCCATCG CCCTGGGCCA CGACCTGGGC CATACGCCCT TCGGCCACTC CGGGGAAGAA GCCTTAAACG AGGTCGTGCC CGGCGGGTTC AAGCACAACC TCCAGAGCCT GCGAGTGGTG GAGGTCCTGG AGGGGGGGTC GGGCCTGAAC CTTACCTGGG AGGTCCGCGA CGGCATCGCC CACCATACCG GCCCGGTCAA ACCCGGCACC CTGGAAGGCC GCATCATCTG TTACGCCGAC CGCATAGCCT ATATCAACCA CGACATCGAC GACGCCATCC GGGCCGGGAT CATCGTGCCG GAACAGCTCC CGAAGGCCTG CCTGCGGGTC CTGGGCGACA GCCATCGCCA GCGCATCGAT ACCATGGTCA CCGACCTCAT CCACCACAGC TGGCGAATGG GCCAGATCGG CATGAGTCCG GAGGTCCAGC AGGCGACCGA CACCTTGCGC GCCTTCCTTT TCGAAAAAGT ATATATCGGT TCCCGGGCCA AGGCGGAAGA AGGCAAAGCC AAAAGGCTCC TCCAGCAGCT CTACCGTTAT TACCTGGAAA ACCCGGAGGA ACTGCCCCCG GCCGCCAATC CAGGCGACGA CCTGGCCCGG CGGGCCTGCG ACTACATCGC CGGCATGACG GACAGCTACG CCATCCTCCA GTACACCCGT ATTTTTGTCC CCAGGGGTTT TCCCACCTAA
|
Protein sequence | MLLREEAEKR EEILLSPLAS LSSRTRGRQK PEEPCPIRTE YQRDRDRIIH CKAFRRLKHK TQVFIAPGGD HYRTRLTHTL EVAQISRTIA RALRLNEDLT EAIALGHDLG HTPFGHSGEE ALNEVVPGGF KHNLQSLRVV EVLEGGSGLN LTWEVRDGIA HHTGPVKPGT LEGRIICYAD RIAYINHDID DAIRAGIIVP EQLPKACLRV LGDSHRQRID TMVTDLIHHS WRMGQIGMSP EVQQATDTLR AFLFEKVYIG SRAKAEEGKA KRLLQQLYRY YLENPEELPP AANPGDDLAR RACDYIAGMT DSYAILQYTR IFVPRGFPT
|
| |