Gene Noc_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2501 
Symbol 
ID3704386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2845776 
End bp2846966 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content51% 
IMG OID637738980 
Productmonooxygenase subunit B protein 
Protein accessionYP_344484 
Protein GI77165959 
COG category 
COG ID 
TIGRFAM ID[TIGR03079] methane monooxygenase/ammonia monooxygenase, subunit B 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000158949 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATAGCCT CAAGCGTTTT CTATATTCCG ACAGTAGCTG CCCATGGCGA GAAGGCGCAG 
GCAGCTTTCC TGCGCATGCG GACAATCCAT TGGTATGACA TGGTATGGTC CAAGGATACC
ATTGCGGTTA ATGAGACCTA TACCATAAGC GGGAAGTTCC GGGTTTTTGA GGATTGGCCG
GAAGCAGTCG AAAAACCCCA TGTATCCTTT TTAAATGCGG GTCAACCTGG TCCAGTCACG
GCTCGGCTTA CTTCCTACGT CAATGGTATG TTCGTCCCTC GTTCGATAGG TCTTGAATTG
GGCGGCGATT ACGAGTTTGA GATGACGATG CAAGGGCGCC GTCCTGGGAC GTGGCATGTT
CATACCTTGC TAAATGTCCA AGGAGGGGGG CCGCTCATCG GTCCAGGTAA ATACATCACC
ATTACCGGAG ATATGGCTGA TTTTGAGAGC AAAATCACGG ATCTGACCGG TAATACGGTC
AACCTGGAAA CCATGGCCAC GGGCACGGTT ATTGGTTGGC ATCTGTTCTG GTACGTTCTT
GGTATCGCCT GGATTTGGTG GTGGGCCCGC CGTCCCATGT TCTTGCCCCG CTACATGAGA
ATAGAGGCGG GCGAGGCTAA TGATCTAGTA ACTGCCCAGG ACAAAAAATT GACTATAGGC
GTTCTTGTGG GCGTCCTGCT CATTATTTTG TTCGGCTTCA AGAGTGCTGA GGATAAATTC
CCAGTCACCA TTCCGTTGCA GGCTGGGCTG CTGGGCACTA TTGACTCCTT GCCGGTGGAT
TATAATTCGA TGGTAAGCGC TAACGTGCTT AAGGCTAACT ATCGGGTGCC GGGGCGGACT
ATCAGCATGA CGGTTGAAAT CACTAACCAT ACTGACCAGG TGATTTCTAT TGGCGAGTTC
AATACTGGGG GCATTCGATT CATGAATGCA AATGTGCGGG TTGATGAGAC GGATTATCCT
GAGGAGTTGT TGGCACCGGA AGGGTTGGAA GTGAGTCAAC AGGATATCGC TCCAGGTGAA
ACCGTAGTTG TTGACATCTC CGCCACCGAT GCCGCCTGGG AAGTTCAGCG TATGGCCGAC
GTCATTTATG ATCCAGACAG CCGCTTTGCG GGCTTGATCT TCTTCGTTGA TCCAGAGGGG
AATGAGATTC CGATACCTAT CGGCGGTCCA TTAGTTCCCA CGTTTGTCTA G
 
Protein sequence
MIASSVFYIP TVAAHGEKAQ AAFLRMRTIH WYDMVWSKDT IAVNETYTIS GKFRVFEDWP 
EAVEKPHVSF LNAGQPGPVT ARLTSYVNGM FVPRSIGLEL GGDYEFEMTM QGRRPGTWHV
HTLLNVQGGG PLIGPGKYIT ITGDMADFES KITDLTGNTV NLETMATGTV IGWHLFWYVL
GIAWIWWWAR RPMFLPRYMR IEAGEANDLV TAQDKKLTIG VLVGVLLIIL FGFKSAEDKF
PVTIPLQAGL LGTIDSLPVD YNSMVSANVL KANYRVPGRT ISMTVEITNH TDQVISIGEF
NTGGIRFMNA NVRVDETDYP EELLAPEGLE VSQQDIAPGE TVVVDISATD AAWEVQRMAD
VIYDPDSRFA GLIFFVDPEG NEIPIPIGGP LVPTFV