Gene Noc_0852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0852 
Symbol 
ID3707157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp933927 
End bp935207 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content55% 
IMG OID637737354 
Productphosphopyruvate hydratase 
Protein accessionYP_342895 
Protein GI77164370 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0148] Enolase 
TIGRFAM ID[TIGR01060] phosphopyruvate hydratase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA TCATTGATAT CACGGGGCGA GAAATTTTAG ACTCCCGCGG TAATCCCACG 
GTAGAGGCGG ATGTTGTTCT CGAAGGGGGA GTGCGGGGAC GGGCTGCGGT ACCTTCTGGC
GCTTCAACCG GCACTCGGGA GGCCGTGGAG TTGCGAGACG AAGATTCTAG CCGTTATGGG
GGCCAAGGGG TGCTTAAGGC CGTGGGCCAC ATTAACGGCG ACATTCGCCA GCGATTAGTG
GGGCAGGAAG CGGAAGCCCA GGAGAGCATC GATCAGGCCA TGCTCGATCT GGACGGCACG
TCCAGCAAAA GGCGTTTGGG AGCCAATGCC ATTTTAGCCG TTTCCCTCGC CGTAGCACGG
GCAGCAGCTC TGGCAGCGGA AAAACCCCTC TACCGCTATT TAAGCGCAGA TGAACGTTTC
CAGATGCCGG TACCCATGAT GAATATCATC AATGGCGGGG CCCATGCGGA TAACAATGTA
GATCTGCAAG AGTTTATGAT TGTGCCGGTA GGGGCGGGTA GTATCGCAGA GGCGGTGCGC
TATGGCGCCG AGGTATTTCA TGCATTAAAA AAAGTGTTAC GGGGACGGGG TTTGGGTACG
GGCGTGGGTG ATGAAGGTGG TTTCGCGCCG GATCTTTCAT CTAACGTGGC TGCTATTGAA
GCGATCTTGG AAGCGATTAC CCAGGCGGGT TTTGAGCCTG GCCGCGATAT CAGCCTGGCC
CTGGATACGG CGAGTTCCGA ATTTTATCAA GATGGCCGTT ATGTACTGGC TTCAGAGGGT
AAGACGCTCG ATAAAGAGGA GTTTACGGGC GTTCTTGCTT CTTGGGTGGA GCAGTATCCC
ATTCTCTCCG TAGAAGATGG CATGGCGGAG GACGATTGGG AGGGCTGGGC TTTGCTAACC
CAACGCCTAG GCCAGCGAGT GCAGCTCGTA GGGGATGATT TGTTTGTCAC CAATACCGCT
ATTCTCAAGG AAGGCATTAA CCGAGGTATA GCCAATTCCA TTTTAATCAA GGTTAATCAG
ATTGGGACTT TAACCGAAAC CCTGGCCGCT ATCCGTATGG CCCATGAGGC GGGTTATACG
GCAGTGATTT CCCACCGCTC TGGAGAAACC GAAGATACGA CGATTGCGGA TCTGGCCGTG
GCTACCCAGA GCGGTCAGAT TAAAACCGGT TCCCTATCCC GGACAGACCG AGTAGCTAAA
TACAATCAGT TGCTGCGAAT AGAAGCGGAG CTCGGGGACA AGGCCCATTA CCCAGGGCGT
CAGGCTTTTA CTCACCCCTA G
 
Protein sequence
MSKIIDITGR EILDSRGNPT VEADVVLEGG VRGRAAVPSG ASTGTREAVE LRDEDSSRYG 
GQGVLKAVGH INGDIRQRLV GQEAEAQESI DQAMLDLDGT SSKRRLGANA ILAVSLAVAR
AAALAAEKPL YRYLSADERF QMPVPMMNII NGGAHADNNV DLQEFMIVPV GAGSIAEAVR
YGAEVFHALK KVLRGRGLGT GVGDEGGFAP DLSSNVAAIE AILEAITQAG FEPGRDISLA
LDTASSEFYQ DGRYVLASEG KTLDKEEFTG VLASWVEQYP ILSVEDGMAE DDWEGWALLT
QRLGQRVQLV GDDLFVTNTA ILKEGINRGI ANSILIKVNQ IGTLTETLAA IRMAHEAGYT
AVISHRSGET EDTTIADLAV ATQSGQIKTG SLSRTDRVAK YNQLLRIEAE LGDKAHYPGR
QAFTHP