Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2032 |
Symbol | |
ID | 7083791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2294695 |
End bp | 2295882 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643699058 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_002355676 |
Protein GI | 217970442 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.683986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCTTCA CCATCGACTC CACCTTGCCC CTCCCGGCCG GCGCCGTTCT TCCCGACTAT GGCGACGGCG GGCTCTACGG CTTCGCGCGC GGGCTCCGCC ACTGGCTGCA CGACCGCAAG GCCGGGTGGC CTGCGGTCGA GGTCGCGCCG GGCGAGCGCG CGCTGGTCGT GCTGCTGGTC ATCGACGGGC TGGGCGAACG CTTTCTCGAC ACGGTCGGGT GGGGCTCCGC GCTGCATGCG GCCAAGCACG CCGGCCTGAG CTCGGTGTGC CCGAGCACCA CGGCGAGTGC GATCACCACG CTGGCGACCG GCGTCGCGCC GGTCGAGCAC GGCCTCAACG GCTGGTTCAT CCACGATCGC CGCTTCGGTG GCGTGATCGC GCCCTTGCCG CTGATCCGCC GCAGCGGCGA GCCGCTGGAG GCCTTCCGCC TGCTGCCGCG CCTGTTCCCG GTGGCGCCGA TGTATCGCCA CGCCTGCCGA CCGGTCACCC TGGTCTCCCC CGTGCAGATC GCATTCTCGC GCTTCTCGCT GCACCATGGG CGCGGGGCAC ACATCGAGCC TTACGAAGGG CTGCAGGACT ACGTGGCCGC CATCGTCGAC ATGGCCGATG CGCTCGCGCA CAGTGGCGGG CTGATCCACG CCTATTACCC GGTGTTCGAC ATGCTGAGCC ACCAGCACGG CTGCCGCTCG GCCGAGGCGG TCGCGTGCTT CACGCGCGTG GATGCCGCCT TCGTGTCGCT GCAGCAGGCG CTGGCGGGGC GCGACGTGCG TCTGCTGGTG ACGGCCGACC ACGGCTTCAT CGACGCGCCA CCCGAGCGCC GCATCGACCT CGCGCCCGAC GGCGAGGTCG CCGCCATGCT CGCCGCGCCG CTGTTCGGCG AGCGCCGGCT GGCTTTCTGC CGGGTGCGCG CCGGTGCGCA GGCCGAATTC GAAGCGTGGG CTGCGGACGA GCTGCGTGGC AAGGCGGTGG CGGTGCGCGG CGAAGACTTT CTCGCCGCCG GTCTGCTCGG CCCGGGTCAG GTGCATCCGC GGCTGTCCGA ACGCCTGGGC AGCCACGCGC TACTGATGGA GGCCGGGTGG ACGATCGTGG ATCACGTGGC GGGCGAGCAC GAGCACACCA TGATCGGCGT GCATGGTGGC CTCAGCGCGG ACGAGATGCG CGTGCCGCTG ATGCTGGCAC GTACCTGA
|
Protein sequence | MPFTIDSTLP LPAGAVLPDY GDGGLYGFAR GLRHWLHDRK AGWPAVEVAP GERALVVLLV IDGLGERFLD TVGWGSALHA AKHAGLSSVC PSTTASAITT LATGVAPVEH GLNGWFIHDR RFGGVIAPLP LIRRSGEPLE AFRLLPRLFP VAPMYRHACR PVTLVSPVQI AFSRFSLHHG RGAHIEPYEG LQDYVAAIVD MADALAHSGG LIHAYYPVFD MLSHQHGCRS AEAVACFTRV DAAFVSLQQA LAGRDVRLLV TADHGFIDAP PERRIDLAPD GEVAAMLAAP LFGERRLAFC RVRAGAQAEF EAWAADELRG KAVAVRGEDF LAAGLLGPGQ VHPRLSERLG SHALLMEAGW TIVDHVAGEH EHTMIGVHGG LSADEMRVPL MLART
|
| |