Gene Noc_2904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2904 
Symbol 
ID3707421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3284539 
End bp3285816 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content46% 
IMG OID637739381 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_344880 
Protein GI77166355 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATA CGGTTCCTGA GGGGTGGGAA GTTAAGCCGC TAGGAAAACT CGTAGACGTT 
CGATCTAGTA ATATTGACAA GAAGACTGAA ACGTCGGAAA TCCCGGTTCG TTTGTGTAAC
TACACTGATG TGTATTACAA CAACAGGATC ACGTCTGCAA TTGATTTTAT GGCGGCGAGT
GCGAAACAGC GGGAAATAGA CCGCTTCTCG CTAGAAAAAG GAGATGTGAT AATCACGAAG
GATTCTGAAA CTCCTGATGA CATAGCAGTC CCATCGTATG TGAGTGATGA TCTTTCTGGG
GTGGTTTGTG GCTATCATTT AACCTTATTG AAGCCAGATC AAGATGAATC CGACGGTGAA
TTCCTTTCCC ATCTATTCCA GTTGCCAAGC GTTCAGCACT ACTTTTACAT ACTGGCAAAT
GGAATAACTC GCTTTGGTCT GACTGCGGAT GCTATCAATG AGGCCCCACT TCTCACGCCC
CCTCTCCCCG AACAACAAAA AATCGCCGCC ATCCTGTCCT CCGTCGATGA CGTGATTGAA
AAAACACGCG CCCAGATCCA CAAGCTGAAA GATCTGAAAA CCGCCATGAT GCAGGAATTG
TTGACCAAAG GGATTGGGCA CACGGAATTC AAGGACTCGC CGGTGGGAAG GATTCCGGTG
GGGTGGAGTA TTTGCAGCGC GGGGGAAGTC GCTGTTGCCA TAATGGTTGG GGTCGTCGTT
AAACCAGCGC AATACTATGT TGAATCAGGC GTTCCTGCAT TGCGCTCCGC AAATGTTCGT
GAAAACGGTT TAACCATGGA TAACTTGAAA TATTTTTCAG AAGACTCAAA TGAAATACTC
AAAAAAAGCC GGCTAATAAA GGGTGACCTT TTGACAGTCA GAACAGGTTA TCCCGGCACG
ACAGCGGTAG TTACTGATGA ATTTGAAGGC TGTAACTGCA TAGATGTTGT CATTACTCGT
CCATCTTCGC GTATTGACTC AGACTTTTTT TGTTTATGGG TGAATTCTGA CCACGGAAAA
GGGCAAGTCT TGAAGGCACA AGGTGGACTT GCTCAGCAGC ACTTTAACGT CAGTGATATG
AAAAACCTTA CAGTGGTAGT TCCTTCACTA ACTGAGCAAA AAGCTATCTT CAATGCTGTT
AATTCAGTAA CTAAGAAAAT AGCCTTAACT GAAAAACGCC TTACTCTCTT GCTCGATACC
AAAAAAGCCC TGATGCAAGA CCTGCTCACC GGCAAAGTCC GCGTCAACGT CGAACAAGAG
GAACCAGTGA TCGCCTGA
 
Protein sequence
MSDTVPEGWE VKPLGKLVDV RSSNIDKKTE TSEIPVRLCN YTDVYYNNRI TSAIDFMAAS 
AKQREIDRFS LEKGDVIITK DSETPDDIAV PSYVSDDLSG VVCGYHLTLL KPDQDESDGE
FLSHLFQLPS VQHYFYILAN GITRFGLTAD AINEAPLLTP PLPEQQKIAA ILSSVDDVIE
KTRAQIHKLK DLKTAMMQEL LTKGIGHTEF KDSPVGRIPV GWSICSAGEV AVAIMVGVVV
KPAQYYVESG VPALRSANVR ENGLTMDNLK YFSEDSNEIL KKSRLIKGDL LTVRTGYPGT
TAVVTDEFEG CNCIDVVITR PSSRIDSDFF CLWVNSDHGK GQVLKAQGGL AQQHFNVSDM
KNLTVVVPSL TEQKAIFNAV NSVTKKIALT EKRLTLLLDT KKALMQDLLT GKVRVNVEQE
EPVIA