Gene Noc_1259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1259 
Symbol 
ID3706371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1386199 
End bp1389189 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content43% 
IMG OID637737761 
Producthypothetical protein 
Protein accessionYP_343290 
Protein GI77164765 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAATT CCGCGCTTAG AAGGGTAAGA ACATCACCGG CTTGGATAAG AAAACGATGG 
GTGCGCAACA GTGCTCTAGG GGTAGTAATT ATTCTCCTTA TCTATACCTT GGTTGGTTTT
TTTCTTGTTC CCTATCTGCT AGAGAAACAA CTAATCAATT ACTTAAAAGA AAATCTGGGT
GTAGAAGCAA AAGTTAAGGA AATCACGCTG AATCCCTATG CGTTGACTCT AGCGGTTAAT
AATTTTTCCT TTCACAAGTC GGGCCATCCT AAGCTGTTTG GTTTTAAGCA ATTCTATGCC
AATTTCGAGC TTTCAAGTAT ATTTCGTAAA GCGTGGGCCT TCCAAAAAAT TAGTCTAACT
AAGCCTTACC TGAGGCTGCA AATCAATAAA AATGGACAGG TTAATCTTGC GGAATTACTG
CCCGCGGATG AAACGCCGGC GCTAAAAGAG CGTAAAAAAG AAGTACCGCT GACAAGCGAT
CAGATCCTTA TAACTGGAGG AGATATCCAT TTCATTGATT TGACTCAGCC TACTCCCTTT
GAAAAAAAAC TAGAAGCCAT TAATGTGGAT TTAAAGAAAT TTAGCACCTT ACCTGAGAAT
GATGGCAGTT ATTCCTTTAA GGCCACCACC CAGGCGGGGG AAATCCTGCG CTGGAAGGGA
GAGGTTACCT TAAGTCCTTT ACATTCCAAG GGGACTTTTG AGCTTGTGGG AGGTAAAGCC
CGTACACTCT GGAAGTATTT ACGAGACCAA GTGGCTTTTG AAATTACCAG TGGCCGGATG
GATGGGCGTG GGAATTACAC CTTGGAGAGC CAGGAAAGGG GATTACAGAT TATTTTGAAG
GGCGTAACGT TTGCATTGAC CCAGTTAGGT CTTAAGCCCA AAGAGGGTAA TAGGGAAATC
CTTACAGTGC CAAAGTTGGG GTTTTCCGGG GGGCAGTTAC GGTGGCCGGA AAAAATCATT
GGGGTTAAGT TTATCTACAT GGGCGGAACT CAGGTACGGG CCTGGCTGAA TAAGCAAGGC
GCATTAAATT GGCAGAAATT ATTCAAAAGT AAAAAAGCCG ATAGAAAAAT GACGGAATCA
ACCGCTAATT CTTCATCCTT AAGCTGGCGG GCGGCGATAA AAAAGATCAA GGTTGAAAAC
GTTAGCGCCA GCCTGCAAGA TCGAACGATC GAACCACCAG CCACGTTAAA TATTAGCGAT
TTAGGTTTAC AGCTTATTGA TGTCACTTCA GTACCCGGTT TGGCGTCTAC TTTTAATTTG
CAATTCCATC TTAACAAGCA AGGTCAGCTT TTAGCTCGAG GTCATGTCAC GGCGTTGCCC
CCGTCCGCTG ATTTAGAAAT CCACCTAGAA TCCCTTTCCC TATTACCTTT CCAGCCCTAT
TTGAATCGTT TTCTTAAACT GAAGCTAATT TCAGGTAACC TGGAGGCTAA AGGAAATGTC
GCCTACGCTC AAAGTAATGA TGCTCCTAAC TTCCAGTTTA AGGGCGATCT GGTTCTGCAA
AAATTTGCCG CGGAGGATAC TTTGCTAGAT GAGCGTTTTC TAGGGTGGGA GAATTTAAAA
TTTGAGCGGG TATTGGTTGG GTTATTTCCT TCTCGGATTC ATATCGATAA CATAGCACTG
GATGCTCCTT ACGGAAAAGT GACGATTAAC GAGAATAAGA AGATCAATAT CAAAGAAGTA
TTAAGTCCCT TAGCAGGGAA GAAAGAAAGC CAGTCTGCAG ACCCTTCCTC TACTTCCGCA
TCTAAGCCGC TTCCTATTGC TATTAATTCA ATTCGAATTA AAAAAGGTTC GGCCAATTTT
GCCGATCTAA GCGTGTCTCC GAAATTTTCC ATGGGTATCC ATTCGCTTCA GAGTGAAATT
CAAAATCTAT CCTCGATGGA CCAAGGTAGA TCTTCCATTT CTTTGGAGGG AACCGTGGAG
TCCTATGGAG AAATGAGTAT GACAGGAAAG AGCAATCTTT TTGCTCTAGA ACGCGCCACC
GAGTTCAGCG CCTTTGCCAG GAATATTGCT CTTCCTGAAT TCACGCCTTA TGCAACGGAA
TTTTTGGGAT ACCCGATAGA GAAGGGGAAA TTATCCCTTG ATCTCACCTA TCGGATTAAA
GAAGACCAAA TCCAGGGTAA GAATGGGATT CTATTGAAAA ATCTGGATCT AGGAAAAAAA
GTTGAAAGTC CAAAAGCCAT CGATGCGCCC ATAAAGCTGG CAATTGGATT GCTTAAGGAT
TCTCAAGGAA AAATTGCTAT TCAGGTGCCT ATTGAAGGGA ATTTGAATGC ACCTAAGTTT
AGCTATGGGC ATCTTATCGG TGAGGCTTTA ACGGGTGTTA TTGGTAAGGT CATTTCCTCG
CCGTTTAGAC TGTTAGGAAG CCTAGTCGGC GCCAAAGAAG ATGTAGATTT AGGATTTATT
GAATTTAGAC CTATGGGCAG CAAGTTGTTG CCTCCGGCTC AGGAGAAGCT TTTACAGTTG
GCTAAGGCTT TAAAGAAGCG CCCGGAATTG CAGTTACAAA TACAAGGCAG GTATGATCCT
ATCACGGACT CCAATTTTTG GAAAAAAGAA AAATTTGAAG TGATCCTGTC AGATCAACTT
AAACAGCAAA GTGGTGCTTC GGACAAAGGC AAGAATGCCC TTGTTCGGCA ACAAGCATTA
GAGCAGCTTT ATTTAAAGCA GTTTTCTATT AAGTCTCTTA ATCAGCAGCG CGCTCAATAT
GGACTTCAAC CGGTAAAGAC AGGCGCAGGA AATGTTGAGC CGAATAATGC TTCTTCTCTA
GAGAAGAAAT TGTCTTCTTA TCGGAAAGCA CTTGAGAAAA AACTTATTGA AGCGCAGCCA
GTCAGTAAAA ATAAACTCCA GCAGTTAGGG CAGGAGCGGG CAAATGCCAT TAAGGCATAT
CTGGTCAGCA AGGGAGGCAT TCAGGAAAAA CGTCTAGGGA TACTTCAAGT CGAATCGACT
CAATCACCAG CGAAAGATTT CGTTCGTTGC CAGCTTCATA TCAGTAGCTA G
 
Protein sequence
MVNSALRRVR TSPAWIRKRW VRNSALGVVI ILLIYTLVGF FLVPYLLEKQ LINYLKENLG 
VEAKVKEITL NPYALTLAVN NFSFHKSGHP KLFGFKQFYA NFELSSIFRK AWAFQKISLT
KPYLRLQINK NGQVNLAELL PADETPALKE RKKEVPLTSD QILITGGDIH FIDLTQPTPF
EKKLEAINVD LKKFSTLPEN DGSYSFKATT QAGEILRWKG EVTLSPLHSK GTFELVGGKA
RTLWKYLRDQ VAFEITSGRM DGRGNYTLES QERGLQIILK GVTFALTQLG LKPKEGNREI
LTVPKLGFSG GQLRWPEKII GVKFIYMGGT QVRAWLNKQG ALNWQKLFKS KKADRKMTES
TANSSSLSWR AAIKKIKVEN VSASLQDRTI EPPATLNISD LGLQLIDVTS VPGLASTFNL
QFHLNKQGQL LARGHVTALP PSADLEIHLE SLSLLPFQPY LNRFLKLKLI SGNLEAKGNV
AYAQSNDAPN FQFKGDLVLQ KFAAEDTLLD ERFLGWENLK FERVLVGLFP SRIHIDNIAL
DAPYGKVTIN ENKKINIKEV LSPLAGKKES QSADPSSTSA SKPLPIAINS IRIKKGSANF
ADLSVSPKFS MGIHSLQSEI QNLSSMDQGR SSISLEGTVE SYGEMSMTGK SNLFALERAT
EFSAFARNIA LPEFTPYATE FLGYPIEKGK LSLDLTYRIK EDQIQGKNGI LLKNLDLGKK
VESPKAIDAP IKLAIGLLKD SQGKIAIQVP IEGNLNAPKF SYGHLIGEAL TGVIGKVISS
PFRLLGSLVG AKEDVDLGFI EFRPMGSKLL PPAQEKLLQL AKALKKRPEL QLQIQGRYDP
ITDSNFWKKE KFEVILSDQL KQQSGASDKG KNALVRQQAL EQLYLKQFSI KSLNQQRAQY
GLQPVKTGAG NVEPNNASSL EKKLSSYRKA LEKKLIEAQP VSKNKLQQLG QERANAIKAY
LVSKGGIQEK RLGILQVEST QSPAKDFVRC QLHISS