Gene Noc_0446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0446 
Symbol 
ID3706617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp480607 
End bp481911 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content47% 
IMG OID637736956 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_342500 
Protein GI77163975 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000807768 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAGG CAATGAAGAA TGACAAACAA ATGCGGGTGC CCAAGCTGCG CTTTCCTGAG 
TTTCGGGATG CGGGAGAGTG GGAGAAAGTT GCGCTTTCAA CGCAAGTTGA ACTCCTCTCG
GGGCTTCATC TTTCACCGGA CGGATATACA GACACGGGAG ACATCCCGTA TTTCACAGGG
CCATCAGATT ACACAAATGA CTTAGCTTTA GTGAGTAAAT GGACAACTCG TAGTGCAAAC
GTAGGGCGCG CGGGGGATAC ACTAATAACT GTAAAGGGGA GCGGTGTGGG CGAGCTGCTT
AACTTAGAGC TCGATGAAGT GGCTATGGGT CGTCAGTTGA TGGCAGTCAG AGCACGTACA
GCACACGGAG AGTTTATTTT CCATTTTCTG ATAACGCAGC GCCTGCGGCT GATTGCTCTG
GCCTCTGGGA ACCTTATTCC GGGACTCTCA CGGGGCGACA TCTTAAGCCT CAAAGTGCCA
GTGCCAAGCC ATGAAGAACA ACAAAAAATC GCCGATTGTC TCTCCTCCCT CGATGCCCTG
ATTGCCGCCC AGACAGAAAA ACTCGACGCC CTCAAAACCC ACAAAAAAGG ACTGATGCAG
CAACTCTTCC CCCGGGCCGG CGAAACCGTC CCCCGGCTGC GCTTTCCCAA GTTTCGGGAT
GGGGGGCGTT GGACAAGTAA AAAGATGAGT GACGTGTACC GATTCCTCTC AACAAATACG
TATTCAAGAG ACAAGTTGAA TTACGAAAAA GGGGAAGTAA AAAATATTCA TTACGGAGAC
ATCCATACAA AATTTTCTAC GTTGTTCGAT GTAACACAAG AATACGTTCC ATATATTAAT
AGGACTGAAT CGCTAGAACG GATAAAAGAT GACAGCTATT GCTTAGAGGG CGATATCGTA
TTCGCAGATG CTTCAGAGGA CGTCGAAGAT GTAGGGAAAA GCATTGAAAT CGTAAACACT
GGTAACGAAA AAATACTATC TGGACTGCAT ACATTGCTGG CGCGACAAAA AAATAATGAC
TTAGTTATTG GTTTTGGTGG TTATCTATTT AAGTCTGGCT TAATTCGAGA ACAGATCAAA
AGAGAATCTC AAGGCGCTAA GGTTTTGGGC ATCTCCTCCG GGCGGTTGTC AAAGATTAAA
GTTTGTTTTC CATATGAAAA ACGCGAACAA CAAAAAATCG CCCATTGCCT CTCCTCCCTC
GATGCCCTGA TTGCCGCCCA GGCGGAAAAA ATCGACGCCC TCAAAACCCA CAAAAAAGGA
CTGATGCAGC AGCTCTTTCC TTCGCTGGAG GAAGTCCATG CATGA
 
Protein sequence
MSKAMKNDKQ MRVPKLRFPE FRDAGEWEKV ALSTQVELLS GLHLSPDGYT DTGDIPYFTG 
PSDYTNDLAL VSKWTTRSAN VGRAGDTLIT VKGSGVGELL NLELDEVAMG RQLMAVRART
AHGEFIFHFL ITQRLRLIAL ASGNLIPGLS RGDILSLKVP VPSHEEQQKI ADCLSSLDAL
IAAQTEKLDA LKTHKKGLMQ QLFPRAGETV PRLRFPKFRD GGRWTSKKMS DVYRFLSTNT
YSRDKLNYEK GEVKNIHYGD IHTKFSTLFD VTQEYVPYIN RTESLERIKD DSYCLEGDIV
FADASEDVED VGKSIEIVNT GNEKILSGLH TLLARQKNND LVIGFGGYLF KSGLIREQIK
RESQGAKVLG ISSGRLSKIK VCFPYEKREQ QKIAHCLSSL DALIAAQAEK IDALKTHKKG
LMQQLFPSLE EVHA