Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0253 |
Symbol | aroB |
ID | 3706327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 279084 |
End bp | 280163 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637736769 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_342313 |
Protein GI | 77163788 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTACAG TACAAGTCGA TCTTCAAGAG CGTAGCTATC CTATCTATAT TGGCAGCGGC CTACTAAAAC AAAATTCCCT TTTAGCAAAA CATATCGTGG GATCTGAAGT CATGGTAGTC ACTAATGAGA CGGTGGCTCC ATTATATTTA GATACTCTCT TGAAGGGCCT CAAAGATTAT CGATGTGCCG AAATCATTTT ACCTGACGGC GAGCAGCATA AGACTTTGGC AGTGCTGCAG CAAATCTTTG ATGATCTGCT TAAAGTGCCC TTCTCCCGGC ATTGCACGGT GATTGCATTA GGTGGGGGAG TAATCGGTGA TATGGCAGGC TTTGCCGCTG CCTGTTACCA ACGGGGAGTC GCTTATATTC AGGTTCCCAC CACCTTGTTA GCGCAGGTAG ATTCCTCTGT TGGGGGAAAA ACAGCGGTGA ATCATCCTCT AGGCAAGAAT ATGATCGGGG CCTTTTACCA GCCTCGCTGC GTTTTAGCGG ATACAGATAC TCTCGATACT CTTGATGAGC GGCAGCTGCG GGCGGGATTA GCTGAAGTCA TAAAATACGG CCTTATCAGA GATATCGACT TCTTTACCTG GCTAGAGGAG CATGCCAGCG AAGTATTGGC GCGAGAGCCT TCAGCATTGA TCCATGCCAT TGAACGATCT TGTCGTAATA AAGCCGAGAT AGTCGCCGCC GATGAACGGG AATCAGGGGT GCGGGCGATT CTCAATCTAG GCCATACTTT TGGCCATGCC ATTGAAACAG GGCTGGGCTA TGGTGCCTGG CTACATGGCG AGGCCGTTGC GGCGGGGATG GCAATGGCGG CGGATTTATC GCAGCGGTTG GGATGGCTGT CGGCCACGGA AGTAGGCCGG GTGCTCAATT TACTGGAGCG GGCGGGCCTT CCCCGGCATT CTCCCGAAGC CATCCATAAG GCTCGTTTCC TGGAGTTGAT GGCAGTAGAT AAAAAAGTCA TTGATGGATG TTTACGCCTA GTCTTACTAA GGCAGCTGGG ACAAGCTGTC GTCACCGATG GTTTTGATTC TGATTTGCTC GAAGCCACAA TAGACAAGGC CACCGTTTGA
|
Protein sequence | MITVQVDLQE RSYPIYIGSG LLKQNSLLAK HIVGSEVMVV TNETVAPLYL DTLLKGLKDY RCAEIILPDG EQHKTLAVLQ QIFDDLLKVP FSRHCTVIAL GGGVIGDMAG FAAACYQRGV AYIQVPTTLL AQVDSSVGGK TAVNHPLGKN MIGAFYQPRC VLADTDTLDT LDERQLRAGL AEVIKYGLIR DIDFFTWLEE HASEVLAREP SALIHAIERS CRNKAEIVAA DERESGVRAI LNLGHTFGHA IETGLGYGAW LHGEAVAAGM AMAADLSQRL GWLSATEVGR VLNLLERAGL PRHSPEAIHK ARFLELMAVD KKVIDGCLRL VLLRQLGQAV VTDGFDSDLL EATIDKATV
|
| |