Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1721 |
Symbol | |
ID | 7084141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1937779 |
End bp | 1939098 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643698740 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_002355370 |
Protein GI | 217970136 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.474668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCGAAG CCGTAGACCT GCTGATCAAC GCCCGCTGGA TCGTCCCCGT CGAACCGCGC GACGCTGTGC TCGCCGACCA CGCGGTGGCG ATCCGCGGCG GACGCATCGT CGATCTCCTC GGCCAGGACG AGGCGCGCAC GCGCTACGTC GCGGACGAGG TGGTCGAGCT GCCGCGGCAC CTGCTGATCC CCGGCCTGGT GAACCTGCAC ACCCACGCCG CGATGAGCCT GCTGCGCGGC ATCGCCGACG ACCTGCCGCT GATGCGCTGG CTGGAGGAGG CGATCTGGCC GGCCGAGAGC CGTCACGTTT CCGCCGCCTT CGTGCGCGAC GGCACCCTGC TCGCCGCGGC CGAGATGATC CGTGGCGGCA TCACCACCTG CAGCGACATG TACTTCCACC CCGAGGCGGC GGCGGAAGCC TTCGCCGCCG CGGGCATGCG CGCGGTGGTC GGCGCGGTCG TGCTGGAATT CCCCACCTCG TACGCGAGCG ACCCCGAGGA CTACCTGCGC AAGGGCCTGG CCGCCCGCGA CCGCTGGCAG GGCCACCCGC GGCTCGGCTT CTCGATCGCC CCGCACGCGC CCTACACCGT CTCGGACGAC AGCTTCCACC AGGTGCAGAC CCTCGCGGAC GAACTCGGCC TGCCGATCCA TGTGCACATC CACGAGACCG CACAAGAGAT CGCCGACTCC CTCGCCGTCC ACGGCTGCCG GCCGCTCGCT CGCCTCGCGC GCCTCGGCGT GCTCGGCAGC AATCTGATCG GCGTCCATGC GGTGCATCTG GACGAGGCCG ACATCGAACT GCTCGCCCGC CACGGCTGCA GCGTCGCGCA CTGCCCCACC TCGAACATGA AGCTCGCCAG CGGCATCGCG CCGGTGCCGC GCCTGCTCGC CGCCGGCGTC CCGGTCGGCC TCGGCACGGA CGGCGCGGCG AGCAACAACC GCCTCGACCT GCTCCAGGAG ATGCGCCACG CCGCGCTGCT CGCCAAGGTC GGCAGCCTGG ATGCCACCGC GGTACCGGCG CATGCCGCCT TGCGCATGGC CACGCTCGGG GGCGCGCGCG CATTGGGCAT GGACGATCGC ATCGGCTCGA TCGAAAAAGG CAAATGCGCC GATCTTTGCG CACTCGACCT TTCCGCACCG CAATGCCGGC CCTGTTTCGA TCCGGTGTCG CATCTCGTCT ACGTATGCGG TCGCGAAAAC GTCTCCCACG TGTGGATCGA CGGCGAAACC CGCGTGGACA AAGGCGTCTC GCTGTTGCAT ATTAACGACA CCGAATTGCT CCGGCTCGTG TCGATGTGGC AAACTAAGCT CGGTAATTGA
|
Protein sequence | MSEAVDLLIN ARWIVPVEPR DAVLADHAVA IRGGRIVDLL GQDEARTRYV ADEVVELPRH LLIPGLVNLH THAAMSLLRG IADDLPLMRW LEEAIWPAES RHVSAAFVRD GTLLAAAEMI RGGITTCSDM YFHPEAAAEA FAAAGMRAVV GAVVLEFPTS YASDPEDYLR KGLAARDRWQ GHPRLGFSIA PHAPYTVSDD SFHQVQTLAD ELGLPIHVHI HETAQEIADS LAVHGCRPLA RLARLGVLGS NLIGVHAVHL DEADIELLAR HGCSVAHCPT SNMKLASGIA PVPRLLAAGV PVGLGTDGAA SNNRLDLLQE MRHAALLAKV GSLDATAVPA HAALRMATLG GARALGMDDR IGSIEKGKCA DLCALDLSAP QCRPCFDPVS HLVYVCGREN VSHVWIDGET RVDKGVSLLH INDTELLRLV SMWQTKLGN
|
| |