Gene Noc_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2023 
Symbol 
ID3705174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2335103 
End bp2336209 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content56% 
IMG OID637738499 
ProductGTP cyclohydrolase II 
Protein accessionYP_344014 
Protein GI77165489 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCTTGA GCAAGACGGA AGCGATCATT CAGGATCTCC GTGAAGGAAA AATGGTTATC 
CTCATGGACG ATGAGGATCG GGAGAATGAG GGAGACTTGA TTATGGCAGC CTCCCAGGTG
AGGGCGCCAG ATATCAATTT TATGGCCCGC TATGGACGGG GATTGATTTG TCTGACCTTG
ACTGAGGCCC GTTGCCGCCA ATTACGGCTG CCCTTAATGG TGTCAGACAG TAACGCTAAA
TATAGTACTA ATTTTACGGT ATCCATCGAA GCGGCAACGG GGGTGACTAC GGGAATTTCC
GCAGCGGATC GGGCTCGCAC CGTGCAAGCG GCGGTTGCTC CAGAGGCGCG CCCCGAGGAT
TTGGAGCAGC CGGGCCATAT TTTCCCCCTC ATGGCTCGTC CAGGGGGAGT GCTTACCCGA
GCAGGGCATA CGGAGGCAGG TTGTGACTTG GCGCGGCTGG CTGGTTTTGA GCCGGCGGCA
GTCATTGTGG AAATTCTCAA TGAAGATGGA AGTATGGCCC GCCGCCCGGA TCTGGAAGTT
TTTGCCGAGC GCCACGGTTT GAAACTGGGC ACCATCGCCG ATCTGATTCG TTACCGTTTA
GAACATGAAC GCTCTGTCGC ACGGGTAGCC GAGTGCGCTC TGCCTACGGA ACAGGGGCTG
TTCCGCCTCT TAGCTTATCA GGATCTGGTG GACCAGCAGC TCCATCTGGC TTTAGTTAGG
GGCGAGCTTT GTCCTGAAGA GCCGGCTTTG GTGCGTGTTC ACATGGCTGA TACCCTTTGC
GATATTCTCC AGGTACGGCG CGGTGATTGT GGCTGGCCCT TGCACGATGC CATGACTCGC
ATCGCCAAGG CGGGTACCGG CGTGGTCGTG ATTCTACGCC GGCCGGAATC TTCTAGTGAT
TTGGTGCAGC GAATTCAGGA CTATAATCTG GAGGATCAGG GGGAGCGTTT GCCTCGGCAA
GAGCCTCCAA ATGATTTGCG GACTTATGGA GTTGGAGCCC AGATCTTGAC TGATTTGGGT
GTGCAAAAGA TGCGGGTGAT GAGCGCTCCC CGGCGAATGC ATGGGCTGGC GGGTTTTGGC
CTGGAAGTGG TGGATTATGT TACTTGA
 
Protein sequence
MPLSKTEAII QDLREGKMVI LMDDEDRENE GDLIMAASQV RAPDINFMAR YGRGLICLTL 
TEARCRQLRL PLMVSDSNAK YSTNFTVSIE AATGVTTGIS AADRARTVQA AVAPEARPED
LEQPGHIFPL MARPGGVLTR AGHTEAGCDL ARLAGFEPAA VIVEILNEDG SMARRPDLEV
FAERHGLKLG TIADLIRYRL EHERSVARVA ECALPTEQGL FRLLAYQDLV DQQLHLALVR
GELCPEEPAL VRVHMADTLC DILQVRRGDC GWPLHDAMTR IAKAGTGVVV ILRRPESSSD
LVQRIQDYNL EDQGERLPRQ EPPNDLRTYG VGAQILTDLG VQKMRVMSAP RRMHGLAGFG
LEVVDYVT