Gene Tmz1t_1721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1721 
Symbol 
ID7084141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1937779 
End bp1939098 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content69% 
IMG OID643698740 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_002355370 
Protein GI217970136 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.474668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCGAAG CCGTAGACCT GCTGATCAAC GCCCGCTGGA TCGTCCCCGT CGAACCGCGC 
GACGCTGTGC TCGCCGACCA CGCGGTGGCG ATCCGCGGCG GACGCATCGT CGATCTCCTC
GGCCAGGACG AGGCGCGCAC GCGCTACGTC GCGGACGAGG TGGTCGAGCT GCCGCGGCAC
CTGCTGATCC CCGGCCTGGT GAACCTGCAC ACCCACGCCG CGATGAGCCT GCTGCGCGGC
ATCGCCGACG ACCTGCCGCT GATGCGCTGG CTGGAGGAGG CGATCTGGCC GGCCGAGAGC
CGTCACGTTT CCGCCGCCTT CGTGCGCGAC GGCACCCTGC TCGCCGCGGC CGAGATGATC
CGTGGCGGCA TCACCACCTG CAGCGACATG TACTTCCACC CCGAGGCGGC GGCGGAAGCC
TTCGCCGCCG CGGGCATGCG CGCGGTGGTC GGCGCGGTCG TGCTGGAATT CCCCACCTCG
TACGCGAGCG ACCCCGAGGA CTACCTGCGC AAGGGCCTGG CCGCCCGCGA CCGCTGGCAG
GGCCACCCGC GGCTCGGCTT CTCGATCGCC CCGCACGCGC CCTACACCGT CTCGGACGAC
AGCTTCCACC AGGTGCAGAC CCTCGCGGAC GAACTCGGCC TGCCGATCCA TGTGCACATC
CACGAGACCG CACAAGAGAT CGCCGACTCC CTCGCCGTCC ACGGCTGCCG GCCGCTCGCT
CGCCTCGCGC GCCTCGGCGT GCTCGGCAGC AATCTGATCG GCGTCCATGC GGTGCATCTG
GACGAGGCCG ACATCGAACT GCTCGCCCGC CACGGCTGCA GCGTCGCGCA CTGCCCCACC
TCGAACATGA AGCTCGCCAG CGGCATCGCG CCGGTGCCGC GCCTGCTCGC CGCCGGCGTC
CCGGTCGGCC TCGGCACGGA CGGCGCGGCG AGCAACAACC GCCTCGACCT GCTCCAGGAG
ATGCGCCACG CCGCGCTGCT CGCCAAGGTC GGCAGCCTGG ATGCCACCGC GGTACCGGCG
CATGCCGCCT TGCGCATGGC CACGCTCGGG GGCGCGCGCG CATTGGGCAT GGACGATCGC
ATCGGCTCGA TCGAAAAAGG CAAATGCGCC GATCTTTGCG CACTCGACCT TTCCGCACCG
CAATGCCGGC CCTGTTTCGA TCCGGTGTCG CATCTCGTCT ACGTATGCGG TCGCGAAAAC
GTCTCCCACG TGTGGATCGA CGGCGAAACC CGCGTGGACA AAGGCGTCTC GCTGTTGCAT
ATTAACGACA CCGAATTGCT CCGGCTCGTG TCGATGTGGC AAACTAAGCT CGGTAATTGA
 
Protein sequence
MSEAVDLLIN ARWIVPVEPR DAVLADHAVA IRGGRIVDLL GQDEARTRYV ADEVVELPRH 
LLIPGLVNLH THAAMSLLRG IADDLPLMRW LEEAIWPAES RHVSAAFVRD GTLLAAAEMI
RGGITTCSDM YFHPEAAAEA FAAAGMRAVV GAVVLEFPTS YASDPEDYLR KGLAARDRWQ
GHPRLGFSIA PHAPYTVSDD SFHQVQTLAD ELGLPIHVHI HETAQEIADS LAVHGCRPLA
RLARLGVLGS NLIGVHAVHL DEADIELLAR HGCSVAHCPT SNMKLASGIA PVPRLLAAGV
PVGLGTDGAA SNNRLDLLQE MRHAALLAKV GSLDATAVPA HAALRMATLG GARALGMDDR
IGSIEKGKCA DLCALDLSAP QCRPCFDPVS HLVYVCGREN VSHVWIDGET RVDKGVSLLH
INDTELLRLV SMWQTKLGN