Gene Noc_A0028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_A0028 
Symbol 
ID3704331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007483 
Strand
Start bp24008 
End bp26758 
Gene Length2751 bp 
Protein Length916 aa 
Translation table11 
GC content54% 
IMG OID637736523 
Producthypothetical protein 
Protein accessionYP_342071 
Protein GI77163545 
COG category[V] Defense mechanisms 
COG ID[COG1002] Type II restriction enzyme, methylase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTCG ATCAAACCTA CTGCATGGAT ACCCTCATTG CTGCGACCCG GCCTGATGCT 
GATCTCGGAG AGTTCATTTT TGCCTTTCTC GACGCCTATA ACTTTCCCAA GGCGACGGTT
ACGCAGATTC GCAAGGGTGG TCAACGTAAC GTCGCCAGCC GTAAAAATGA AGGGCACGTC
GCGATGAAGA ACTGGCTGTA CTTCATGCCG GTGCGAGCGG GCGAGAGCAT TCACGAGGCG
CTCCAGGTAC TGGCAGATGA AGAAGAGCCG CCGCGCCATA AGTGTCGCTT TCTGGTGGTG
ACCGACTATC AAGAACTCAC TGCACTGGAT ACCAGGACGG ATGAGCGTTT GGAGGTGATT
CTCGGTGAGC TGGCCACCCA ATACCTGTTC TTTGCCCCCA TGGCGGGGCT TGAGCGTACC
AAGCCATTTA GCGAAGAATC GGCCGACCTC AAGGCCGCCG CCAAGATGGG GCGGCTGTTT
GATCGCCTGA AGGAAATCAA CGAGTTTCAG ACTCCCGAGC AGCTTCACGC GCTTAACGTG
TTTCTGACCC GGTTGCTGTT CTGCTATTTC GCTGAGGATA CCGGGATTTT CCCCAAGAAC
GCCTTCACCA AAGTGATCAC CGAAGCTAGC GGCAAGAACG GCGAGGGGCT ATCCGACCTA
CTAAAGCAGC TGTTCCGAGT GATGGATCAA GCCGAGGGCG AGCGACCGGC GGACTTGCCA
GCTCATATTG CCCAGTTCCC TTATGTCAAC GGCGGCCTAT TCCGTGACAA CATGCCAGCC
CCCAAGATGC GCGGTAAAGC CAGACGGATG ATGATCGAGT GCGGCAAACT CCAGTGGAAG
GCCGTCAACC CCGATATCTT CGGCTCGATG TTTCAGGCGG TAGTAGACGA AAAGAGCCGT
GACTCGTTGG GCCAGCACTA CACCTCTGTA CCGAACATCA TGAAGGTGAT CCGTCCCCTG
TTCCTCGACA AGCTGTATGC CGACCTGCAC AAGTCAAAGG GCAAACGCAG GCAACTGGAA
GCACTACTGG TACGGCTGGC CCGTATTTGG GTGTTCGATC CGGCGATGGG CTCCGGCAAT
TTTCTGATCA TCGCCTATAA GGAACTGCGC CGACTGGAGA TGGCCACCTT CCGGTCACTT
CAAGCTATGA GTGGTAGTGG CCAGCAGGAA ATTTTCATGA GTGGCATCCA GCTCAGCCAG
TTCTATGGTA TCGAGATCGA CGATTTCGCG CACGAAATTG CCCAGCTATC CCTATGGCTG
GTTGAGCATC AGATGAACAC GCTGTTTGTA AAGGAGTTTG GTCATGCTGA GCCAGTGCTA
CCGCTAAAAG ATACTGCCAA CTTGGTGCAA GGAAATAGCC TTCGGATGGA TTGGCAGAAG
GTATGTCCCA ATGATGGCAG CGCTGAAATA TATGTATGCG GGAACCCGCC ATTTATTGGC
CATGGTAGTC GAGAGAATAG CCAGCTCGAC GATATGCGAT TGGTGTTAGG GCAGCTGATT
CGCACATACA AGTCCCTCGA CTATGTGGCC TGCTGGTTCT TTTTGGCTGC GGAATACTGT
CGCCACGGGA CAGCAAACGC CGCCTTTGTG TCGACGAACT CTCTATGCCA AGGCAAACAA
GCGGGGCTAC TGTGGCCGTT GTTGGTAGAT ATGGGGATGA AAATCAGCTT TAGCTACCAA
ACTTTCCCTT GGCGGAACAG TGCAAAGGGT AACGCAGGTG TGCATGTTGT GGTCATTGGG
CTGGCTGCTC ATAACGGACC ACGGGTACTC TTCAACCGTA TTGATGGTGC CTGGCATCGT
AAAGAAGTTA CGAACATCAG CCCCTATCTA CTAGAAGGAG GCGACACTGT TGTACGGGAG
CGACGAGATC CGCTAATTCA GGACGCATTA CCAATGTACT TTGGCAATAT GCCTAACGAC
GGTGGCCACC TCCTGTTGAC CGCAGGGGAT AAAGAGAAGT TAATCGCACA AGAGCCCGCG
GCAGAGGCTT GGATCAAGCG CTTAATGGGG GCTAAAGAGT TCCTGCAAGG CCATGAGCGC
TGGTGTCTGT GGTTGGTTAA TGCCACGAAA GAAGAAATCG ATGCTATGCC TGTGGTTCGC
GAGCGAGTGG AGCGTGTGCG GGAAACACGT CTTGCTAGCA AAGATGCTGG CGCACGAAAG
CTCGCCGAAC GGCCGCACCA ATTCCGAGAC CTTAATAACC CGGAAAGCTT CATTTTAGTC
CCAAGCGTTA CATCCGAGCG CCGGAAATAT GCGCCCGTCG GGATTTTTGA AGAAGACGTA
ATTGCTACTA ACCTAACATT AATCATTCCA GATGCTGGGT TATACGATTT CGCCATTCTT
TCCACGCAAA TGCACATGGA CTGGCTACGC CTGGTGGGAG GCCGTTTAGA AAGCCGTTAC
CGCTATTCTG CAACTATCGT CTACAACACC TTCCCTTGGC CCAATGCTAC CGAAGCACAG
CGTAACGCTA TCGAAAAACT AGGCCGAGCC GTTATTCTGG CGCGTGCAGC GCATCCCGAT
AAAACCATGG CCCAGCTTTA TGACCCGGAC AAGATGCCGG ACAAACTGCT GGAGGCCCAC
CAAGCACTGG ACCGCGCCGT GGAGCGCCTG TATCGGGAGC GCCCCTTCCG CGATACCGCT
GAGCGTCAGG AATATCTGCT GGCCCGCTAT GAGTCGCTGA TTGAGGCGGA GAAAACCGCC
AAGGCTGGTA GCAGGAAACA GCCTCGAAAA GCCACGAGTA TGGAGAGTTA A
 
Protein sequence
MAFDQTYCMD TLIAATRPDA DLGEFIFAFL DAYNFPKATV TQIRKGGQRN VASRKNEGHV 
AMKNWLYFMP VRAGESIHEA LQVLADEEEP PRHKCRFLVV TDYQELTALD TRTDERLEVI
LGELATQYLF FAPMAGLERT KPFSEESADL KAAAKMGRLF DRLKEINEFQ TPEQLHALNV
FLTRLLFCYF AEDTGIFPKN AFTKVITEAS GKNGEGLSDL LKQLFRVMDQ AEGERPADLP
AHIAQFPYVN GGLFRDNMPA PKMRGKARRM MIECGKLQWK AVNPDIFGSM FQAVVDEKSR
DSLGQHYTSV PNIMKVIRPL FLDKLYADLH KSKGKRRQLE ALLVRLARIW VFDPAMGSGN
FLIIAYKELR RLEMATFRSL QAMSGSGQQE IFMSGIQLSQ FYGIEIDDFA HEIAQLSLWL
VEHQMNTLFV KEFGHAEPVL PLKDTANLVQ GNSLRMDWQK VCPNDGSAEI YVCGNPPFIG
HGSRENSQLD DMRLVLGQLI RTYKSLDYVA CWFFLAAEYC RHGTANAAFV STNSLCQGKQ
AGLLWPLLVD MGMKISFSYQ TFPWRNSAKG NAGVHVVVIG LAAHNGPRVL FNRIDGAWHR
KEVTNISPYL LEGGDTVVRE RRDPLIQDAL PMYFGNMPND GGHLLLTAGD KEKLIAQEPA
AEAWIKRLMG AKEFLQGHER WCLWLVNATK EEIDAMPVVR ERVERVRETR LASKDAGARK
LAERPHQFRD LNNPESFILV PSVTSERRKY APVGIFEEDV IATNLTLIIP DAGLYDFAIL
STQMHMDWLR LVGGRLESRY RYSATIVYNT FPWPNATEAQ RNAIEKLGRA VILARAAHPD
KTMAQLYDPD KMPDKLLEAH QALDRAVERL YRERPFRDTA ERQEYLLARY ESLIEAEKTA
KAGSRKQPRK ATSMES