Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3092 |
Symbol | |
ID | 4597877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 3294173 |
End bp | 3295285 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639777698 |
Product | 3-deoxy-D-arabinoheptulosonate-7-phosphate synthase |
Protein accession | YP_924281 |
Protein GI | 119717316 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.495784 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAGTC GCGGCCGTCC GGCCGCCGCC GGTCACCTAC GGTTCTCCCG CCCCCGAGAG GAGTCCCGCA TGGTCGTCGT GATGTCCCCC GATGCCACCG CCGAGGACGT CGCCCACGTC GTCGCGCGGG TCGAGAGCGT CGGCGGCAAG GCGTTCGTCT CGACCGGCGT GGTCCGCACG ATCATCGGAC TGGTCGGCGA CATCGACTCC TTCCACCACC TCAACCTCCG CACCCTGCGC GGCGTCGCCG ACGTGCACCG GATCTCCGAC CCCTACAAGC TCGTCAGCCG CCAGCACCAC CCGGACCGGT CCACCGTCTG GGTCGGCGCC CCCGGCCGCC AGGTCCCGAT CGGCCCGGAG ACGTTCACGC TGATCGCGGG ACCGTGCGCG GTCGAGACCG CGGAGCAGAC GCTCGAGGCG GCGCAGATGG CCCGCTCCGC GGGGGCGACG ATCCTGCGCG GCGGCGCGTT CAAGCCACGG ACCTCGCCGT ACGCCTTCCA GGGGCTGGGC GTCGCCGGGC TCAGGATCCT CGCCGACGTC GGCGCCGCGA CCGGGCTGCC GGTCGTGACC GAGGTGGTCG ACGCCCGCGA CGTCGCCGTG GTCGCGGAGC ACGCCGACAT GCTCCAGGTC GGGACGCGGA ACATGGCGAA CTTCGGGCTG CTCCAGGCCG TCGGCGAGTC CGGGAAGCCG GTGCTGCTCA AGCGCGGGAT GACGGCCACG ATCGAGGAGT GGCTGATGGC GGCGGAGTAC ATCGCCCAGC GCGGCAACCT GGACGTGGTC CTCTGCGAGC GCGGCATCCG GACCTTCGAG CCGTCCACCC GCAACACCCT CGACATCTCC GCCGTGCCCG TCGTGCAGGC CACCAGCCAC CTCCCGGTCG TCGTCGACCC CTCGCACGCT GCGGGCCGCA AGGACCTGGT CGTCCCGCTG TCGCGGGCCG CGATCGCCGT CGGCGCCGAC GGCGTGATCG TCGACGTCCA CCCGGACCCG GAGACCGCCC TGTGCGACGG ACCCCAGGCC CTGCTCGGCT CCGAGCTGCG CGACCTGGCC CAGGCGGTAC GCCGGCTCCC CGAGATGGTC GGCCGCCGAC CCGCGGCCGA CCACGCAGGC TGA
|
Protein sequence | MLSRGRPAAA GHLRFSRPRE ESRMVVVMSP DATAEDVAHV VARVESVGGK AFVSTGVVRT IIGLVGDIDS FHHLNLRTLR GVADVHRISD PYKLVSRQHH PDRSTVWVGA PGRQVPIGPE TFTLIAGPCA VETAEQTLEA AQMARSAGAT ILRGGAFKPR TSPYAFQGLG VAGLRILADV GAATGLPVVT EVVDARDVAV VAEHADMLQV GTRNMANFGL LQAVGESGKP VLLKRGMTAT IEEWLMAAEY IAQRGNLDVV LCERGIRTFE PSTRNTLDIS AVPVVQATSH LPVVVDPSHA AGRKDLVVPL SRAAIAVGAD GVIVDVHPDP ETALCDGPQA LLGSELRDLA QAVRRLPEMV GRRPAADHAG
|
| |