Gene Noc_1808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1808 
Symbol 
ID3705325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2045515 
End bp2046966 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content48% 
IMG OID637738291 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_343808 
Protein GI77165283 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.300643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTGTG AGTGGCCTCT CGTGACATTA TCGAAGTTGA TAGAGATTAA GCATGGCTGG 
GCCTTCAAAG GAAAGCACAT GGCTGAGAGT GTGATCAAAG GACCTATCGT TGTAGCAATT
GGAAACTTTG ATTATTCCGG TGGTTTTCGT TTTTCAAGCA CTAGAATAAA GCGGTACACA
GAGGACTACC CCAAAGAGTA TCAACTTCAA CCTGGCGATG TTCTTCTTGC AATGACGTGC
CAAACCCCAG GTGGTGAAAT TCTAGGACTG CCGGGAATAA TTCCAGAAGA CGATGAAGTG
TACCTTCACA ACCAACGGTT GGGAAAGCTC ATAGTAAAAG AGCCCAAAAA AGTTTGGGCG
CCTTTTTTAT ATTGGGTCTT TCTTTCTTAT GACTTCAACC GTTATTTGGC GGGAAGTGCC
ACTGGCACAA AAATATTGCA CACATCGCCA AACAAAATAA CTTCATACGA AACTAGAATA
CCGCCAATTA ACCTGCAACA GTCCATTGCA AATATCCTTT GGAGCATTAG CGACAAAATA
AGTCTAAACC ACCAAATCAA CCAAATCCTC GAACAAATGG CCCAAGCCAT CTTCAAAAGT
TGGTTTGTGG ATTTCGAGCC GGTCAAAGCC AAAATCGCCG CACTGAAAGC CGGCGGCAGC
CAGGAAGACG CCCTGCTTGC CGCCATGCAG GCCATTTCCG GCAAATCCTC GGAACAACTC
ACCCGCCTTC AGGCCGAACA GCCCGAGCAA TACGCCCAAC TCCGCACCAC CGCCGAGCTA
TTTCCGTCGG CCATGCAAGA TAGTGAGTTG GGGGAGATTC CGGAGGGGTG GAGTTGCCGC
GCACTCGATG ACATTGCCAA ATACAAAAAC GGCTTAGCTC TTCAAAAATT CAGGCCAGAG
AATGAAAATG ACTACCTGCC CGTGGTTAAA ATCGCACAAC TGAAAAAAGG CTATGCAGAT
GGTGAAGAAA AAGCGTCTCC AAATATCAAC CCCGAGTGCA TCATAGATAA TGGCGACGTA
GTTTTTTCGT GGTCAGGCTC ACTACTCGTC GATACATGGT GTGGAGGCAG AGCAGCCTTA
AACCAGCATC TATTTAAGGT TACCTCCGAA ACACACCCCA AATGGCTTTA TTATCATTTC
ACCCAGCACC ATCTTGAAGA TTTTCAGCGT ATTGCAGCTG ACAAAGCCGT CACCATGGGG
CACATTAAAA GAGAACACCT GAAGCGTGCT CTATGCGCTA TTCCCTGTGA GCAACTTATC
TCGGACGCGG GTAACTCTTT ACGTAACATC CTGGAAAAGC AGATTGAGCT TCGGCTTGAA
TCAATCACAT TATCTACATT ACGCGACACC CTCCTACCCA AACTCCTCTC CGGTGAGCTT
TCGATATCGG ACGCCGAAAG CCGGGTATCC GAAATGGATG TCAGCGCTTG CACAGAGGAC
AGCCTCCTCT GA
 
Protein sequence
MSCEWPLVTL SKLIEIKHGW AFKGKHMAES VIKGPIVVAI GNFDYSGGFR FSSTRIKRYT 
EDYPKEYQLQ PGDVLLAMTC QTPGGEILGL PGIIPEDDEV YLHNQRLGKL IVKEPKKVWA
PFLYWVFLSY DFNRYLAGSA TGTKILHTSP NKITSYETRI PPINLQQSIA NILWSISDKI
SLNHQINQIL EQMAQAIFKS WFVDFEPVKA KIAALKAGGS QEDALLAAMQ AISGKSSEQL
TRLQAEQPEQ YAQLRTTAEL FPSAMQDSEL GEIPEGWSCR ALDDIAKYKN GLALQKFRPE
NENDYLPVVK IAQLKKGYAD GEEKASPNIN PECIIDNGDV VFSWSGSLLV DTWCGGRAAL
NQHLFKVTSE THPKWLYYHF TQHHLEDFQR IAADKAVTMG HIKREHLKRA LCAIPCEQLI
SDAGNSLRNI LEKQIELRLE SITLSTLRDT LLPKLLSGEL SISDAESRVS EMDVSACTED
SLL