Gene Noc_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1994 
Symbol 
ID3704878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2295049 
End bp2297067 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content52% 
IMG OID637738470 
Productmolybdopterin/thiamine biosynthesis family protein 
Protein accessionYP_343986 
Protein GI77165461 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1179] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACACAC AAGATGAGCC GCTAGAAAAA AATTCAGTAG AGAGTTCTAT TCAGCGTCAA 
GGCGAAGCTT TTATCTACCA GGAGGCTTTT AGCCGAAATA CCGGCTGGGT GACTGAGTGG
GAACAACAGA TATTACGTGG CAAGAAAGTG GCCATTGCGG GGATGGGAGG CGTGGGGGGC
GTCCATTTAT TAACTCTGGC GCGTTTAGGC ATTGGCGTGT TCCATATTGC GGATTTCGAC
GACTTTGAGC TTCCTAACTT TAATCGCCAG GTTGGAGCGA TGGTGAGTAC CCTAGGGCGG
CCTAAAGTCG AGGTGCTGGC TGAAATAGCC CGGGATATCA ACCCGGAAGT GGATCTGAAT
ATTTTTAATA AGGGAATTAA CCGCAACAAT GTTGATGCCT TTCTCGAAGG GGTAGACCTT
TTTGTCGATG GCTTCGATTT TTTTGTGTTG GATAGGCGGG CCCATGTTTT TGCCCGCTGC
GCGGAGCTTG GCATACCTGC AATTACGGCG GCGCCTATTG GAATAGGAAC GGCTTATTTA
GTATTTATGC CGGGGCATAT GACTTTTGAG GAGTACTTTT GCCTGGAAGG CCTGCCCATA
GAGCAGCAGT ATGTGAATTT TCTTGCGGGG CTAACGCCCA AAGGATTCCA CCGTGCTTAT
CTTGTAGACT CTTCGCGGCT TGATTTGGCT GCTCGCCGCG GTCCCTCAAC GGCCATGGGC
TGTCATCTGT GCGCGGGAGC CACAGGCGCG GAAGCGTTGA AGATTCTGCT GGGACGAGGC
CCTGTACGGT CTGCTCCACG CTACCATCAG TACGATGCCT ATCGGGGGAA ATGGCATCTT
GGGTGGCTGC CTGGCGGCAA TAACAACCCG TTCCAGCTTT TCAAGCGTAA GCGGGGATAC
CGGATGCTTG AACAGCTTTC TCAGAAAATA CCTACTGTAG CTTCGTCAAA GGCTGGTTCT
GAGGTTGAGC GTATTTTAGA TATGGCCCGG TGGGCGCCTA GCGGAGACAA TACCCAGCCC
TGGCGCTTTG AAATTAAGGA CTCTCATTAT GTCGTTGTCC ATGGCTTCGA TACCCGGGAT
CATTGTGTTT ATGATTTAGA AGGCCATGCT AGCCAGATCT CCGTGGGGAC ACTTCTGGAA
AGTATTACCA TTGCGGCGTC CCAATATGGC TGGCGTACCG ACATTCAACG AGATCTTAAT
ACTGCAGAGA CTCACCCTAA GTTCAATGTG CATTTTGTTC CTGAAGCTAC CCTTCGTCCC
GATCCCCTGT GGCCCTATAT TCCTGTGCGC GCTACCCAGC GCCGGGCTAT GTCTCTGCGC
GCGCTCACGG TCCGGGAAAA AAGCCTTCTC GAAGAATCTG TTAAGCCCTT GTTTTCCATT
CACTGGCTTG AAGGGCTAGA AAATAGGTTA AAGGTAGCCC GGTTACTGTT TATGAATGGA
AAGCTGCGGT TGACCATGCC GGAAGCTTAT GAGGTGCATC GAAGCGTCAT TGAATGGAAT
TCAAAATTTA GCAAAGATCG CATCCCGGAT CAGGCTGTGG GCTTGGATCC GCTGGGAGTG
GGACTCATGC GCTGGGCACT GAAAGACTGG GGGCGGGTAA AATTTTTAAA CACCTATCTT
GGGGGGACGC TACTACCCCG GATACAACTC GATTTTATTC CTGGTATTGC CTGTGCCGCT
CATTTTTTAA TTATCGCTCC AAAGCCGCCA CAATCCATGG ATGATTACAT TGCCACGGGA
AGAGCCTGGC AGCGTTTCTG GCTGACGGCC AGCAGGCTTA ACTTACGGCT TCAGCCAGAG
ATGACTCCCC TTATATTTAG TGCCTATCTT CGAGAGGGAA TACAATTTTC AAAAAGCGAA
TACAGCCAGC GTCTTGCCGC AGCGCTTTCG TCTCGACTAG AACAACTGCT GTCACCGGAT
ATCTGCCAGC GGGCCCAGGT GATGGGCCGG ATAGGAGCCG GCGCCGTCCC GAAAGCACGC
TCCACCCGGT TGCCTCTTGA ACGATTAATG GTGAGGTGA
 
Protein sequence
MHTQDEPLEK NSVESSIQRQ GEAFIYQEAF SRNTGWVTEW EQQILRGKKV AIAGMGGVGG 
VHLLTLARLG IGVFHIADFD DFELPNFNRQ VGAMVSTLGR PKVEVLAEIA RDINPEVDLN
IFNKGINRNN VDAFLEGVDL FVDGFDFFVL DRRAHVFARC AELGIPAITA APIGIGTAYL
VFMPGHMTFE EYFCLEGLPI EQQYVNFLAG LTPKGFHRAY LVDSSRLDLA ARRGPSTAMG
CHLCAGATGA EALKILLGRG PVRSAPRYHQ YDAYRGKWHL GWLPGGNNNP FQLFKRKRGY
RMLEQLSQKI PTVASSKAGS EVERILDMAR WAPSGDNTQP WRFEIKDSHY VVVHGFDTRD
HCVYDLEGHA SQISVGTLLE SITIAASQYG WRTDIQRDLN TAETHPKFNV HFVPEATLRP
DPLWPYIPVR ATQRRAMSLR ALTVREKSLL EESVKPLFSI HWLEGLENRL KVARLLFMNG
KLRLTMPEAY EVHRSVIEWN SKFSKDRIPD QAVGLDPLGV GLMRWALKDW GRVKFLNTYL
GGTLLPRIQL DFIPGIACAA HFLIIAPKPP QSMDDYIATG RAWQRFWLTA SRLNLRLQPE
MTPLIFSAYL REGIQFSKSE YSQRLAAALS SRLEQLLSPD ICQRAQVMGR IGAGAVPKAR
STRLPLERLM VR