Gene Noc_3028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3028 
Symbol 
ID3705775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3422932 
End bp3425322 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content50% 
IMG OID637739502 
Producthypothetical protein 
Protein accessionYP_345000 
Protein GI77166475 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.194288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA AACAACTCAC GGAGCGAGAC ATCTGCACCA AGTTCATCAC ACCTGCGCTT 
GAACAGTCCG GATGGGATAT TGCTACTCAA ATACGTGAAG AATTCCCGCT GACCAAAGGG
CGAATTATCG TCCGTGGCAA ACTGCATACC CGTGCTAAAC ACAAACGTGC CGATTACGTA
CTTTTCTACA AGCCCAATAT TCCTATTGCT GTGATTGAAG CGAAAGACAA TAACCACAGC
CTGGGTGACG GTATGCAACA GGGCTTGGGT TATGCGGAGA TGCTTCAGGT TCCGTTTGTT
TTCAGTTCAA ACGGTGACGG CTTTCTGTTT CACAACAAAA TCGCCAAAGA CGGCATCATT
GAGCGCGAGT TAGCACTACA CGAATTCCCT TCGGCAGAGA CACTCTGGCA ATGGTGGGCA
GAGCACAAAG GGCTTGATCA ACAACAAAAC AAACTGGTGA CGCAAGATTA CTACAGTGAT
GGCAGTAACA AAACGCCGCG CTACTACCAG CTCTTGGCCA TTAACAAAAC CATCGAAGCC
ATTGCCAATG GCCAGAATCG CATACTACTG GTTATGGCGA CGGGCACAGG AAAAACTTTT
ACCGCCTTCC AAATCATCTG GCGCTTGTGG CGATCCAAAG CCAAAAAGCG CATCCTGTTT
TTGGCGGATC GCAATATTCT GGTCGATCAG ACCATGACCA ATGACTTCAA GCCCTTCGGT
TCGGCCATGA CCAAAATCCA GAAGCGCCAG GCTAACAAGT CATACGAAAT CTATCTCTCG
CTTTATCAGG CGGTTACCGG CAACGAAGAA AAGAAAAACA TTTACAAACA ATTCAGCCCG
GATTTTTTCG ACCTAATCGT CATTGATGAA TGCCACAGAG GCAGTGCCGC AGCCGACTCC
GCTTGGCGCG AAATCCTGGA GTATTTCTCC TCTGCCACCC AAATCGGCCT AACCGCCACA
CCGAAAGAAA CCAAAGAAGT GTCGAATATC GACTATTTTG GCGACCCGAT TTACACCTAC
AGCCTGCGTC AAGGCATTGA TGACGGTTTT TTGGCCCCCT ACAAGGTGGT GCGTATAGAT
CTCGACCGAG ATTTAACCGG CTGGCGGCCT GACAAAGGGA TGACCGATAA ACACGGTAAC
GAGATCGAAG ATCGTATCTA CAACCAGAAA GACTTTGATA AAACGCTGGT ACTAGAACAG
CGCACTCAAT TAGTCGCCAA GAAAATTACG GAATTTCTCA AGCAAACCAA TCGCTTTGAT
AAAGCCATCG TGTTTTGCGA AAACATCGAC CATGCCGAGC GTATGCGGCA GGCTTTGGTC
AACGAAAATG CCGATTTGGT CGCGCAAAAC AGCAAGTACA TCATGCGCAT AACCGGCGAC
AACGAAGAAG GCAAGGCCGA GCTAGACAAC TTTATTTTTC CTGAGAGCAA ATACCCGGTG
ATTGCCACGA CGTCCAAGTT AATGACCACA GGCGTCGATG CGCAAACCTG CAAATTGATC
GTGCTCGACC AACGCATCCA GTCCATGACG GAATTTAAGC AAATTATCGG GCGTGGCGCC
CGTATCAATG AAGATTACGG CAAGTTCTAC TTCACCATCA TCGACTTCAA AAAAGCCACC
GAACTCTTTG CTGATCCAGA CTTTGATGGC GACCCGGTAC AAGTGTATGA ACCCTCTGGA
TCTGAATCCC CCGTGCCGCC AGATATACCT TTAGAGGAAG TTGCAGGCGA GGCGGATAAA
GAAGGAATGA CATACCCCAA ACCCGGTGAT GATCGCGAGT GGAGTGGCAT TGCGGAGCCT
GATGATGAAG GCGGCGGTGT GCGTCGCTAT GTCGTGGCCA ATGTACCGGT CACAGTTGCG
GCTGAACGTG TACAGTATTT TGATGCTAAT GGCAAGCTCA TCACCGAATC ACTTAAAGAC
TACACCCGCA AGGCGGTGGC GAAAGAATAC GCCACACTCG ACGATTTCCT GCGCCGCTGG
AGCAGTGCCG AGAAGAAGCA GGCCATTATT AAAAAGCTGG CCGAGCACGG TGTGTTTTTT
GAAGCGCTGG CCGATGAAGT GGGCGAAAAA TCAGGCAAAG CCTTTGATCC TTTTGATCTG
GTGTGTCATA TCGCCTGGGA TATGCCGCCG CTAACCCGTA AAGAGCGAGC CGAGCAGGTT
AAAAAGCGCA ACTACTTTAC CCAGTACGGT GAGCAGGCTC GAAAGGTGTT AGAAGCTTTA
TTGGATAAAT ACGCCGATGA AGGCGTGGCG CAAATTGAAG AAACCCGAAT TCTTACTATC
GCGCCCTTTA CCCAATTTGG CACACCGCTG GAAATCATCC GCGCCTTCGG TGGTCCGGAT
CAGTACCAGC AAGCCGTAAA CGAACTCGAA CAGGCGCTTT ATAACGCCTG A
 
Protein sequence
MNKKQLTERD ICTKFITPAL EQSGWDIATQ IREEFPLTKG RIIVRGKLHT RAKHKRADYV 
LFYKPNIPIA VIEAKDNNHS LGDGMQQGLG YAEMLQVPFV FSSNGDGFLF HNKIAKDGII
ERELALHEFP SAETLWQWWA EHKGLDQQQN KLVTQDYYSD GSNKTPRYYQ LLAINKTIEA
IANGQNRILL VMATGTGKTF TAFQIIWRLW RSKAKKRILF LADRNILVDQ TMTNDFKPFG
SAMTKIQKRQ ANKSYEIYLS LYQAVTGNEE KKNIYKQFSP DFFDLIVIDE CHRGSAAADS
AWREILEYFS SATQIGLTAT PKETKEVSNI DYFGDPIYTY SLRQGIDDGF LAPYKVVRID
LDRDLTGWRP DKGMTDKHGN EIEDRIYNQK DFDKTLVLEQ RTQLVAKKIT EFLKQTNRFD
KAIVFCENID HAERMRQALV NENADLVAQN SKYIMRITGD NEEGKAELDN FIFPESKYPV
IATTSKLMTT GVDAQTCKLI VLDQRIQSMT EFKQIIGRGA RINEDYGKFY FTIIDFKKAT
ELFADPDFDG DPVQVYEPSG SESPVPPDIP LEEVAGEADK EGMTYPKPGD DREWSGIAEP
DDEGGGVRRY VVANVPVTVA AERVQYFDAN GKLITESLKD YTRKAVAKEY ATLDDFLRRW
SSAEKKQAII KKLAEHGVFF EALADEVGEK SGKAFDPFDL VCHIAWDMPP LTRKERAEQV
KKRNYFTQYG EQARKVLEAL LDKYADEGVA QIEETRILTI APFTQFGTPL EIIRAFGGPD
QYQQAVNELE QALYNA