Gene Nmul_A2187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2187 
Symbol 
ID3786212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2482949 
End bp2484295 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content56% 
IMG OID637812274 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_412871 
Protein GI82703305 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.147574 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGATC CGGTAAAAAT CGATACGCTG CTGGAGGCAC GCTGGATAAT CCCGGTTGAG 
CCCGCGGGCG CGGTGCTTCA TGGTCATGCC ATTGCAATTG ACCAGGGCAT GATTCGCGCA
ATCCTGCCCC GCGCCGAGGC ACACGTGCAA TTCGAGCCGC GCGAACGCGT CGTGATGAAC
AGCCACGCGC TCATTCCCGG CCTGATAAAT CTCCACACGC ATGCGGCCAT GTCGCTGATG
CGAGGAATGG CAGATGACCT GCCTTTGATG GAGTGGCTAA CCCATCATAT CTGGCCGGCG
GAAGCGAAGC ACGTGGACCA GGGTTTTGTT TTCGATGGTA CGCGTCTGGC ATGCGCCGAG
ATGCTGCAGG GTGGAGTTAC CTGTTTCAAT GACATGTACC TGTTTCCGGA AGCCGCTGCG
CGCGCCGCAT TGGCCGCGGG CATGCGTGCC AGCATCGGCA TGATCGCAAT CGACTTTCCC
ACTGCCTACG CCAGCGATCC TGACGACTAT CTGACCAAGG GTCTGGCGCT GCGGGACGAT
TACAATCCGC ATTCCCTCCT GTCATTCTGT TTTGCCCCAC ATGCGCCTTA CACCGTAGGT
GACAAGAATT TGTCCCGGGT TCTCACCTAT GCAGAGCAGT TGGATGTACC CATTCACATT
CATCTGCACG AAACCGGGGA TGAAATCGAC AACAGCCTGA AAAGCTATGG AATGCGTCCC
CTTGAGCGCA TTCACAAGCT CGGACTGCTC GGACCCAATC TGATTGCGGT ACACATGGTG
CATCTTACCG GGGGAGAAAT CGAACTGCTG GCGCAGCAAG GCTGTTCCGT AGCCCATTGC
CCTTCTTCCA ACCTTAAACA TGCGAGTGGG CTGGCACCCG TTGCCGCTCT TATAGAAGCA
GGCGTCAACG TAGGATTGGG AACGGATAGC GCTGCAAGTA ACAGTCGGCT CAAGATGTTC
GAGGAAATGC GGCTTGCGGC ACTCCTAGCA AAAGGACAAA GCGGGAGGGC AGAAGTGCTG
CCTGCATGGC AGGTGCTGCA AATGGCCACG CTCAACGGGG CCAGGGCTTT AGGGTTGGGA
GACCGTATCG GTTCCCTTGT TCCAGGCAAA GCTGCGGATA TTGCCGCCGT GGATTTCTCC
AGTCTTGATA TGGCGCCCTG CTATGACCCT GTTTCCCATC TTGTCTATGC TGCCGGACGT
GAACATGTGA GTCATGTGTG GGTAAATGGT AAAATGCTGC TGCGTGACTC GGAATTGACT
ACGCTGGACC GGGAAGAATT GGTGCACAGG GCTGAATTCT GGCGGGAACA AATGACAACG
GGCGTGATAG CAGTTCACGA AAAATGA
 
Protein sequence
MMDPVKIDTL LEARWIIPVE PAGAVLHGHA IAIDQGMIRA ILPRAEAHVQ FEPRERVVMN 
SHALIPGLIN LHTHAAMSLM RGMADDLPLM EWLTHHIWPA EAKHVDQGFV FDGTRLACAE
MLQGGVTCFN DMYLFPEAAA RAALAAGMRA SIGMIAIDFP TAYASDPDDY LTKGLALRDD
YNPHSLLSFC FAPHAPYTVG DKNLSRVLTY AEQLDVPIHI HLHETGDEID NSLKSYGMRP
LERIHKLGLL GPNLIAVHMV HLTGGEIELL AQQGCSVAHC PSSNLKHASG LAPVAALIEA
GVNVGLGTDS AASNSRLKMF EEMRLAALLA KGQSGRAEVL PAWQVLQMAT LNGARALGLG
DRIGSLVPGK AADIAAVDFS SLDMAPCYDP VSHLVYAAGR EHVSHVWVNG KMLLRDSELT
TLDREELVHR AEFWREQMTT GVIAVHEK