Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2414 |
Symbol | aroB |
ID | 4598250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 2574006 |
End bp | 2575103 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639777017 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_923606 |
Protein GI | 119716641 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGACA CCGTCGTCCG GGTGACCGGG GCCGCGCCGT ACGACGTCGT CATCGGCCAC GACCTGTCCG AGCGGCTGCC GCAGCTGCTC GGGGTCGGGG TGCAACGGGT CGCGGTCGTG TTCTCCGACG CGTTGGCCGA GCTGGTGAAC CCCGTCCTCG ACTCGCTGGC GGCGGCGTAC GACGTGATGG TGCTCCCGAT CCCCGACGGC GAACGGGCGA AGAAGGCTGC GGTCGCGGTC TCGTGCTGGG AGGCGCTCGG TGAGGCCGGG TTCACCCGCT CCGACGCGGT CGTGACCTTC GGGGGCGGTG CGACCACGGA CGTCGGTGGC TTCGTCGCCG CGACCTGGCT GCGCGGCGTG AAGGTCGTGC ACGTGCCGAC CACGCTGCTC GGCATGGTCG ACGCCGCGGT CGGCGGCAAG ACCGGCGTGA ACACCCGCAG CGGCAAGAAC CTGGTGGGTG CCTTCCACGA GCCGGCCGGC GTGCTCTGCG ACCTGTCGAC GCTGCGCTCC CTGCCCCGCG CCGAGCTGCT CGCCGGGCTC GGCGAGGTCA TCAAGTGCGG GTTCATCGCC GACCCCGCGA TCCTCAACCT CGTCGAGGGC AACGAGGCCT CCCGCCTCGA TGCCGACTCC CTCGTCCTGC GCGAGCTGGT CGAGCGTGCC GTGCGGGTGA AGGCCGGGGT CGTCTCGGCC GACCTCAAGG AGACCGGCGG CCGCCCGGAC GACCCGGGCC GGGAGATCCT CAACTACGGG CACACGATGG CGCACGCCAT CGAGCGCACC GAGAACTACC GGATCCGGCA CGGCGAGGCC GTCTCGATCG GGTGCGTGTA CGTCGCGGAG CTGGCCAACC GGGCCGGCAC CCTCGCCTCC GACATCGTCG AGCGGCACCG GCACGCGTTC GCCCGCGTCG GGCTGCCCAC GTCGTACTCC AAGGCCTCCT TCGACGACCT GCACCGGGCG ATGCGGGTCG ACAAGAAGGC GCGCGGCTCC CAGCTGCGCT TCATCGTGCT CTCCGACCTC GCGGTCCCGA CCGTGCTGGC CGGACCGTCC GTGGTGGACC TGCGCGACGC CTACGCCGCG ATCGGTGGCC CCGCGTGA
|
Protein sequence | MRDTVVRVTG AAPYDVVIGH DLSERLPQLL GVGVQRVAVV FSDALAELVN PVLDSLAAAY DVMVLPIPDG ERAKKAAVAV SCWEALGEAG FTRSDAVVTF GGGATTDVGG FVAATWLRGV KVVHVPTTLL GMVDAAVGGK TGVNTRSGKN LVGAFHEPAG VLCDLSTLRS LPRAELLAGL GEVIKCGFIA DPAILNLVEG NEASRLDADS LVLRELVERA VRVKAGVVSA DLKETGGRPD DPGREILNYG HTMAHAIERT ENYRIRHGEA VSIGCVYVAE LANRAGTLAS DIVERHRHAF ARVGLPTSYS KASFDDLHRA MRVDKKARGS QLRFIVLSDL AVPTVLAGPS VVDLRDAYAA IGGPA
|
| |