Gene Noc_0253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0253 
SymbolaroB 
ID3706327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp279084 
End bp280163 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content50% 
IMG OID637736769 
Product3-dehydroquinate synthase 
Protein accessionYP_342313 
Protein GI77163788 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACAG TACAAGTCGA TCTTCAAGAG CGTAGCTATC CTATCTATAT TGGCAGCGGC 
CTACTAAAAC AAAATTCCCT TTTAGCAAAA CATATCGTGG GATCTGAAGT CATGGTAGTC
ACTAATGAGA CGGTGGCTCC ATTATATTTA GATACTCTCT TGAAGGGCCT CAAAGATTAT
CGATGTGCCG AAATCATTTT ACCTGACGGC GAGCAGCATA AGACTTTGGC AGTGCTGCAG
CAAATCTTTG ATGATCTGCT TAAAGTGCCC TTCTCCCGGC ATTGCACGGT GATTGCATTA
GGTGGGGGAG TAATCGGTGA TATGGCAGGC TTTGCCGCTG CCTGTTACCA ACGGGGAGTC
GCTTATATTC AGGTTCCCAC CACCTTGTTA GCGCAGGTAG ATTCCTCTGT TGGGGGAAAA
ACAGCGGTGA ATCATCCTCT AGGCAAGAAT ATGATCGGGG CCTTTTACCA GCCTCGCTGC
GTTTTAGCGG ATACAGATAC TCTCGATACT CTTGATGAGC GGCAGCTGCG GGCGGGATTA
GCTGAAGTCA TAAAATACGG CCTTATCAGA GATATCGACT TCTTTACCTG GCTAGAGGAG
CATGCCAGCG AAGTATTGGC GCGAGAGCCT TCAGCATTGA TCCATGCCAT TGAACGATCT
TGTCGTAATA AAGCCGAGAT AGTCGCCGCC GATGAACGGG AATCAGGGGT GCGGGCGATT
CTCAATCTAG GCCATACTTT TGGCCATGCC ATTGAAACAG GGCTGGGCTA TGGTGCCTGG
CTACATGGCG AGGCCGTTGC GGCGGGGATG GCAATGGCGG CGGATTTATC GCAGCGGTTG
GGATGGCTGT CGGCCACGGA AGTAGGCCGG GTGCTCAATT TACTGGAGCG GGCGGGCCTT
CCCCGGCATT CTCCCGAAGC CATCCATAAG GCTCGTTTCC TGGAGTTGAT GGCAGTAGAT
AAAAAAGTCA TTGATGGATG TTTACGCCTA GTCTTACTAA GGCAGCTGGG ACAAGCTGTC
GTCACCGATG GTTTTGATTC TGATTTGCTC GAAGCCACAA TAGACAAGGC CACCGTTTGA
 
Protein sequence
MITVQVDLQE RSYPIYIGSG LLKQNSLLAK HIVGSEVMVV TNETVAPLYL DTLLKGLKDY 
RCAEIILPDG EQHKTLAVLQ QIFDDLLKVP FSRHCTVIAL GGGVIGDMAG FAAACYQRGV
AYIQVPTTLL AQVDSSVGGK TAVNHPLGKN MIGAFYQPRC VLADTDTLDT LDERQLRAGL
AEVIKYGLIR DIDFFTWLEE HASEVLAREP SALIHAIERS CRNKAEIVAA DERESGVRAI
LNLGHTFGHA IETGLGYGAW LHGEAVAAGM AMAADLSQRL GWLSATEVGR VLNLLERAGL
PRHSPEAIHK ARFLELMAVD KKVIDGCLRL VLLRQLGQAV VTDGFDSDLL EATIDKATV