Gene Noc_1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1204 
Symbol 
ID3706703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1310715 
End bp1312358 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content47% 
IMG OID637737706 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_343235 
Protein GI77164710 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.340549 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACCT CACAGCCAAC CGGGTGGGTG TTTTGTCGAT TCGGAGATAT AGCGCGAATT 
CGAAATGGGT ATGCATTCAG GAGCTCTGCC TTCAAGAAAA CCAAGACGCA TGATTGCGAC
GTTCCATTGA TCAGGCAATC TCAGCTAATT GGGACTGCAG TCAATATTGG CGAGGCGGTG
TATCTTCCAG CCGAATATCT AGAACGATTC GCTCAGTATG TAATCAACAA GGGAGATATT
CTTATCGGAA TGTCTGGTGC CATTGGGAAA GTGTGCCGTT ATAAAAACGG TTTTCCAGCA
CTCCAAAATC AGCGGACAGG GAAGATTGAA GTATTTGACG AATCTCAAAT GGATTCACGT
TTTTTTGGCC TTTATCTGTC AAGTATTGAA GGTGAGCTAA TTCGCCAGGC AAAAGGCATG
GCCGTTCAAA ATATCAGTGC AAAAGATATT GAAGCACTGC CGCTAGGATT ACCGCCGTAC
AACGAACAAC AGCGCATCGT CGCTAAAATC GAGGAACTCT TCTCCGAGCT GGATAAAGGC
ATCGAAAGCC TCAAGACCGC CCGCGAGCAG TTGAAGGTCT ATCGCCAAGC AGTGCTCAAA
CATGCCTTCG AGGGCAAGCT CACTGCCCAA TGGCGCGAGG AGAACAAGGA CAAACTGGAG
TCGCCCGAGC AGCTCCTTGC CCGCATCCAG CAGGAGCGTG AAGCCCGCTA CCAGCAGCAG
CTGGAGGAAT GGAAGGCGGC TGTTAAAGCG TGGGAGGCAA CGGGGAAAGA GGGGAAGAAG
CCGGGGAAGC CTAAGAAATC TTTAGCTATA AAAATTAACA GTTTCAAAAT ACCTAAAAAT
TTCCCAAATG GCTGGATCAG TATTCAACTC CGAGAGTTGT TTGAATCAAC TCAAAATGGA
TTAGCGAAGC GACAAGGTAC CTCGGGTAAA CCGATTCCTG TAATTAGGTT GGCTGATATT
AAAAATCAAG AAGTAGACAG TTCAGATTTG AGGTCTATAA AACTCGATGC TACAGAAATC
CAGAAGTATG AGCTTAGCAG GAATGATCTT TTATGTATCC GAGTAAATGG CAGCCCGAAC
CTAGTGGGGC GAATGATACT GTTTAAACAC GATAATGTAA TGGCCTACTG CGATCATTTC
ATTCGGTTTC GGTTCCCACA GGGAATAGTG CTACCAAGCT ATATTCAAAT GCTTTTCGAT
ACCCAGACTG TTCGTCGCTA CATTGAGTTA AACAAAGTAT CAAGTGCGGG ACAGAATACG
GTAAGCCAAA CGACCATTAG CGCTCTTGCA ATTCCTTATT GTTCTCTAAT GGAGCAAAAA
ATAATTGTAT CGAGGTTGGA AGAACAGCTT ACGTCAATAT CAGCAGTCAA GGTAGAAATA
GAGGAGAATT TTCAAAGATT AAAATCCCTC CGCCAATCCA TCCTCAAAAA AGCTTTTTCC
GGCCAGCTTG TTCCCCAGGA CCCCAAAGAC GAACCCGCCT CCAAGCTGCT GGAACGTATT
CGCGCCGAGA AGGAAAAAAT ACCCCACCCA ACGCGGCGTA CCCGGAAGCC GACGGCAAGC
CGGAATCCCA GCAGGAAAAC CGGGATCAAG GAACGTTCAG ACGAGGCGCT CATGGGCAGA
TCGAGCATAA AGGAAAGGCG ATGA
 
Protein sequence
MQTSQPTGWV FCRFGDIARI RNGYAFRSSA FKKTKTHDCD VPLIRQSQLI GTAVNIGEAV 
YLPAEYLERF AQYVINKGDI LIGMSGAIGK VCRYKNGFPA LQNQRTGKIE VFDESQMDSR
FFGLYLSSIE GELIRQAKGM AVQNISAKDI EALPLGLPPY NEQQRIVAKI EELFSELDKG
IESLKTAREQ LKVYRQAVLK HAFEGKLTAQ WREENKDKLE SPEQLLARIQ QEREARYQQQ
LEEWKAAVKA WEATGKEGKK PGKPKKSLAI KINSFKIPKN FPNGWISIQL RELFESTQNG
LAKRQGTSGK PIPVIRLADI KNQEVDSSDL RSIKLDATEI QKYELSRNDL LCIRVNGSPN
LVGRMILFKH DNVMAYCDHF IRFRFPQGIV LPSYIQMLFD TQTVRRYIEL NKVSSAGQNT
VSQTTISALA IPYCSLMEQK IIVSRLEEQL TSISAVKVEI EENFQRLKSL RQSILKKAFS
GQLVPQDPKD EPASKLLERI RAEKEKIPHP TRRTRKPTAS RNPSRKTGIK ERSDEALMGR
SSIKERR