Gene Tmz1t_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2032 
Symbol 
ID7083791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2294695 
End bp2295882 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content72% 
IMG OID643699058 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_002355676 
Protein GI217970442 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.683986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCTTCA CCATCGACTC CACCTTGCCC CTCCCGGCCG GCGCCGTTCT TCCCGACTAT 
GGCGACGGCG GGCTCTACGG CTTCGCGCGC GGGCTCCGCC ACTGGCTGCA CGACCGCAAG
GCCGGGTGGC CTGCGGTCGA GGTCGCGCCG GGCGAGCGCG CGCTGGTCGT GCTGCTGGTC
ATCGACGGGC TGGGCGAACG CTTTCTCGAC ACGGTCGGGT GGGGCTCCGC GCTGCATGCG
GCCAAGCACG CCGGCCTGAG CTCGGTGTGC CCGAGCACCA CGGCGAGTGC GATCACCACG
CTGGCGACCG GCGTCGCGCC GGTCGAGCAC GGCCTCAACG GCTGGTTCAT CCACGATCGC
CGCTTCGGTG GCGTGATCGC GCCCTTGCCG CTGATCCGCC GCAGCGGCGA GCCGCTGGAG
GCCTTCCGCC TGCTGCCGCG CCTGTTCCCG GTGGCGCCGA TGTATCGCCA CGCCTGCCGA
CCGGTCACCC TGGTCTCCCC CGTGCAGATC GCATTCTCGC GCTTCTCGCT GCACCATGGG
CGCGGGGCAC ACATCGAGCC TTACGAAGGG CTGCAGGACT ACGTGGCCGC CATCGTCGAC
ATGGCCGATG CGCTCGCGCA CAGTGGCGGG CTGATCCACG CCTATTACCC GGTGTTCGAC
ATGCTGAGCC ACCAGCACGG CTGCCGCTCG GCCGAGGCGG TCGCGTGCTT CACGCGCGTG
GATGCCGCCT TCGTGTCGCT GCAGCAGGCG CTGGCGGGGC GCGACGTGCG TCTGCTGGTG
ACGGCCGACC ACGGCTTCAT CGACGCGCCA CCCGAGCGCC GCATCGACCT CGCGCCCGAC
GGCGAGGTCG CCGCCATGCT CGCCGCGCCG CTGTTCGGCG AGCGCCGGCT GGCTTTCTGC
CGGGTGCGCG CCGGTGCGCA GGCCGAATTC GAAGCGTGGG CTGCGGACGA GCTGCGTGGC
AAGGCGGTGG CGGTGCGCGG CGAAGACTTT CTCGCCGCCG GTCTGCTCGG CCCGGGTCAG
GTGCATCCGC GGCTGTCCGA ACGCCTGGGC AGCCACGCGC TACTGATGGA GGCCGGGTGG
ACGATCGTGG ATCACGTGGC GGGCGAGCAC GAGCACACCA TGATCGGCGT GCATGGTGGC
CTCAGCGCGG ACGAGATGCG CGTGCCGCTG ATGCTGGCAC GTACCTGA
 
Protein sequence
MPFTIDSTLP LPAGAVLPDY GDGGLYGFAR GLRHWLHDRK AGWPAVEVAP GERALVVLLV 
IDGLGERFLD TVGWGSALHA AKHAGLSSVC PSTTASAITT LATGVAPVEH GLNGWFIHDR
RFGGVIAPLP LIRRSGEPLE AFRLLPRLFP VAPMYRHACR PVTLVSPVQI AFSRFSLHHG
RGAHIEPYEG LQDYVAAIVD MADALAHSGG LIHAYYPVFD MLSHQHGCRS AEAVACFTRV
DAAFVSLQQA LAGRDVRLLV TADHGFIDAP PERRIDLAPD GEVAAMLAAP LFGERRLAFC
RVRAGAQAEF EAWAADELRG KAVAVRGEDF LAAGLLGPGQ VHPRLSERLG SHALLMEAGW
TIVDHVAGEH EHTMIGVHGG LSADEMRVPL MLART