Gene Noc_2821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2821 
Symbol 
ID3705570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3197544 
End bp3198593 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content52% 
IMG OID637739297 
Productmembrane-bound metal-dependent hydrolase 
Protein accessionYP_344798 
Protein GI77166273 
COG category[R] General function prediction only 
COG ID[COG1988] Predicted membrane-bound metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0774774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACACAC TCACCCATGC TTTAAGTGGT GCGCTGCTAG CTCGCGCCAC TGCCTCTTCC 
AAGCAACCAA CCGGCCAGCT CAGGGTAGGA GAGCGAGTGC TCCTGGGCGC CTTGGCTGCG
ACTTTTCCGG ATAGCGATTT TATTCTCCGT TGGACTACCG ACCTGCTTAC TTATCTTAAT
CTGCATCGAG GAATTACCCA TTCCGTAGTC ATGCTTCCCA TATGGGGAGC GCTGCTAGCT
ACCCTATTTT GGCGGCTTCG GGGAAAGCAA AAACCTTGGC AAGTATATTT CGGCGTGTCC
CTATTAGGAA TAAGCATTCA TATCGCCGGA GATGTTATCA CCGCTTATGG CACCCAAATT
TTCGCCCCTC TCTCCAATTA CAAAGCAGCT TGGCCCACTA CTTTCGTCAT TGACCCTTGG
TTTACCGGTA TTATTGTCAT TGGATTATTG GGCTGCTGGT ATTGGCGCCA TTCCCGTCTA
CCCGCAGTAA TAGGCTTAGC TATATTGGCA GTCTATGTGG GATTTCAGGG GATGCTTCGA
GCGCAAGCGC TAGCATTGGG TCATGAGTAT GCCCGCCAAC AATCACTAGC CAACATCCGA
GTTCATGCTC TACCTCAGCC CCTGTCCCCT TTTAATTGGA AAATTGTGGT AGCGGCGCCT
CAAAAATATT ACGTTAGCCA GGTTAACCTT CTGCGCAAAC AGATCCCGGT TCCTGCGCTG
CCCAATTCCC CTTTTTGGGT TGGCCTTTAT ACAGCCTATC CGCCGCTAAC TGCTATGAAG
TGGACGCAGT ACCAACGGTA CGGGAATTCC AATGAAGAAT CTTTGGCCCG CACCGTTTGG
CAGCAGGAGA TTTTGCAAGG TTACCGCCAA TTTGCGCAGC TCCCAACTCT TTATGCCATC
GACCGGAAGG CTGGCCGCCT GTGCGTCTGG TTTGTAGATT TACGCTTTAT TCTCCCTAAC
CTTATTCCAC CGTTCCGTTA TGGAGGTTGC CGCCAAGCGC AAGACTCTGG TTGGGAGCTA
CGCCAATTAC CCGGAACTCC TGGCGCTTAG
 
Protein sequence
MDTLTHALSG ALLARATASS KQPTGQLRVG ERVLLGALAA TFPDSDFILR WTTDLLTYLN 
LHRGITHSVV MLPIWGALLA TLFWRLRGKQ KPWQVYFGVS LLGISIHIAG DVITAYGTQI
FAPLSNYKAA WPTTFVIDPW FTGIIVIGLL GCWYWRHSRL PAVIGLAILA VYVGFQGMLR
AQALALGHEY ARQQSLANIR VHALPQPLSP FNWKIVVAAP QKYYVSQVNL LRKQIPVPAL
PNSPFWVGLY TAYPPLTAMK WTQYQRYGNS NEESLARTVW QQEILQGYRQ FAQLPTLYAI
DRKAGRLCVW FVDLRFILPN LIPPFRYGGC RQAQDSGWEL RQLPGTPGA