Gene Noc_2295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2295 
Symbol 
ID3704501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2647658 
End bp2648986 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content47% 
IMG OID637738774 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_344283 
Protein GI77165758 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.853343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAACA TTAAAACGTT AATCCATGCC CGCTGGGTTA TTCCCGTTAT TCCAGAAGGA 
CAAATACTGG AAAACCATAG CCTTGCCATC TACCAAGGGC GTATTGTGGA TTGTCTCCCA
AGAATAGAAG CGGAAACCCG CTACCGAGAC GCTCACCAAA TTGAGCTTAC CCAGCATGCA
CTCATACCCG GTCTTATTAA TGCCCATATC CATAGTCCCA TGTCTCTCCT ACGCGGTCTC
GCTGACGACC TCCCCCTTAT GGAGTGGCTT GAAAAGCATA TCTGGCCAGC CGAAACCAAA
TGGGTAAGCG AAACCTTTGT GCGTGATGGC GCCCTTCTAG CGATTGCTGA AATGCTCCGT
GGAGGCATTA CTTGCTTTAA CGATATGTAT TTTTTCCCCG AAGTCGTAGC GCAGGCGGCA
GTCGAAGCTA ACATGCGCGC TGTTATTGGC ATGATTGTCA TTGATTTTCC CAGCAGATGG
GCAAAAACCC CCGAGGATTA TCTCCGCAAA GGGTTAGAGT TAAATGATAA CTATCAGAAT
CATCCTCTTA TCAAAACGGC CTTTGCTCCC CATGCCCCTT ATACTGTCAG CGATGAATCC
TTAACGCAAG TCGCTATTTT ATCTAAGGAA CTAAATATTC CAGTCCATAT GCATATCCAT
GAGACGGTGG AGGAAATTAA TAGGAGTATA ACCCAATATG GCATGCGACC CTTAGGGCGG
TTACAGCGCC TAGGACTACT TTCATCCCGG CTCTTAGCCG TCCATATGAC TCAACTTACC
GATCAAGAGT TTCAGACGAT AACGAAGCAT GGAATTCATA TCGTTCATTG TCCAGAGTCT
AATCTCAAAC TTGCCAGCGG CTTTTGTCCA GTAGCTAAAC TATATCAAGC GGGAATTAAT
ATTGCTTTAG GTACCGATAG TGCAGCAAGC AATAACGATT TGGATATGTT TGTGGAAATG
CGCCTTACCG CTTTGTTGGC TAAAGCATTA GCTAGCGACG CTAGCGCCAT TCCCGCAAAG
CAGGCGCTTC GCATGGCTAC CTTAAATGGT GCCCAAGCGT TAGGCTTGGA GCAAGAAATT
GGATCCCTGG AAATCGGTAA GATAGCTGAC ATAGTCGCCG TCGATTTGGG TGGGCTAGAA
ACACAGCCGC TCTATGATCC CATTTCCCAA TTAGTCTATA CGGCAGGCCG CGATAAGGTC
AGCGATGTTT GGATAGCCGG CCAACAGGTC CTTAAGAGGC GCCAATTTAC AACTTTGGAT
GAACGATTGC TTCTATCCCG AACCCAAGCC TGGGCGGAAA GAATAAAAGA ATCAAGGAAA
TTCACATGA
 
Protein sequence
MQNIKTLIHA RWVIPVIPEG QILENHSLAI YQGRIVDCLP RIEAETRYRD AHQIELTQHA 
LIPGLINAHI HSPMSLLRGL ADDLPLMEWL EKHIWPAETK WVSETFVRDG ALLAIAEMLR
GGITCFNDMY FFPEVVAQAA VEANMRAVIG MIVIDFPSRW AKTPEDYLRK GLELNDNYQN
HPLIKTAFAP HAPYTVSDES LTQVAILSKE LNIPVHMHIH ETVEEINRSI TQYGMRPLGR
LQRLGLLSSR LLAVHMTQLT DQEFQTITKH GIHIVHCPES NLKLASGFCP VAKLYQAGIN
IALGTDSAAS NNDLDMFVEM RLTALLAKAL ASDASAIPAK QALRMATLNG AQALGLEQEI
GSLEIGKIAD IVAVDLGGLE TQPLYDPISQ LVYTAGRDKV SDVWIAGQQV LKRRQFTTLD
ERLLLSRTQA WAERIKESRK FT