Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2187 |
Symbol | |
ID | 3786212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2482949 |
End bp | 2484295 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637812274 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_412871 |
Protein GI | 82703305 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.147574 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGATC CGGTAAAAAT CGATACGCTG CTGGAGGCAC GCTGGATAAT CCCGGTTGAG CCCGCGGGCG CGGTGCTTCA TGGTCATGCC ATTGCAATTG ACCAGGGCAT GATTCGCGCA ATCCTGCCCC GCGCCGAGGC ACACGTGCAA TTCGAGCCGC GCGAACGCGT CGTGATGAAC AGCCACGCGC TCATTCCCGG CCTGATAAAT CTCCACACGC ATGCGGCCAT GTCGCTGATG CGAGGAATGG CAGATGACCT GCCTTTGATG GAGTGGCTAA CCCATCATAT CTGGCCGGCG GAAGCGAAGC ACGTGGACCA GGGTTTTGTT TTCGATGGTA CGCGTCTGGC ATGCGCCGAG ATGCTGCAGG GTGGAGTTAC CTGTTTCAAT GACATGTACC TGTTTCCGGA AGCCGCTGCG CGCGCCGCAT TGGCCGCGGG CATGCGTGCC AGCATCGGCA TGATCGCAAT CGACTTTCCC ACTGCCTACG CCAGCGATCC TGACGACTAT CTGACCAAGG GTCTGGCGCT GCGGGACGAT TACAATCCGC ATTCCCTCCT GTCATTCTGT TTTGCCCCAC ATGCGCCTTA CACCGTAGGT GACAAGAATT TGTCCCGGGT TCTCACCTAT GCAGAGCAGT TGGATGTACC CATTCACATT CATCTGCACG AAACCGGGGA TGAAATCGAC AACAGCCTGA AAAGCTATGG AATGCGTCCC CTTGAGCGCA TTCACAAGCT CGGACTGCTC GGACCCAATC TGATTGCGGT ACACATGGTG CATCTTACCG GGGGAGAAAT CGAACTGCTG GCGCAGCAAG GCTGTTCCGT AGCCCATTGC CCTTCTTCCA ACCTTAAACA TGCGAGTGGG CTGGCACCCG TTGCCGCTCT TATAGAAGCA GGCGTCAACG TAGGATTGGG AACGGATAGC GCTGCAAGTA ACAGTCGGCT CAAGATGTTC GAGGAAATGC GGCTTGCGGC ACTCCTAGCA AAAGGACAAA GCGGGAGGGC AGAAGTGCTG CCTGCATGGC AGGTGCTGCA AATGGCCACG CTCAACGGGG CCAGGGCTTT AGGGTTGGGA GACCGTATCG GTTCCCTTGT TCCAGGCAAA GCTGCGGATA TTGCCGCCGT GGATTTCTCC AGTCTTGATA TGGCGCCCTG CTATGACCCT GTTTCCCATC TTGTCTATGC TGCCGGACGT GAACATGTGA GTCATGTGTG GGTAAATGGT AAAATGCTGC TGCGTGACTC GGAATTGACT ACGCTGGACC GGGAAGAATT GGTGCACAGG GCTGAATTCT GGCGGGAACA AATGACAACG GGCGTGATAG CAGTTCACGA AAAATGA
|
Protein sequence | MMDPVKIDTL LEARWIIPVE PAGAVLHGHA IAIDQGMIRA ILPRAEAHVQ FEPRERVVMN SHALIPGLIN LHTHAAMSLM RGMADDLPLM EWLTHHIWPA EAKHVDQGFV FDGTRLACAE MLQGGVTCFN DMYLFPEAAA RAALAAGMRA SIGMIAIDFP TAYASDPDDY LTKGLALRDD YNPHSLLSFC FAPHAPYTVG DKNLSRVLTY AEQLDVPIHI HLHETGDEID NSLKSYGMRP LERIHKLGLL GPNLIAVHMV HLTGGEIELL AQQGCSVAHC PSSNLKHASG LAPVAALIEA GVNVGLGTDS AASNSRLKMF EEMRLAALLA KGQSGRAEVL PAWQVLQMAT LNGARALGLG DRIGSLVPGK AADIAAVDFS SLDMAPCYDP VSHLVYAAGR EHVSHVWVNG KMLLRDSELT TLDREELVHR AEFWREQMTT GVIAVHEK
|
| |