Gene Noca_3092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3092 
Symbol 
ID4597877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3294173 
End bp3295285 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content74% 
IMG OID639777698 
Product3-deoxy-D-arabinoheptulosonate-7-phosphate synthase 
Protein accessionYP_924281 
Protein GI119717316 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.495784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAGTC GCGGCCGTCC GGCCGCCGCC GGTCACCTAC GGTTCTCCCG CCCCCGAGAG 
GAGTCCCGCA TGGTCGTCGT GATGTCCCCC GATGCCACCG CCGAGGACGT CGCCCACGTC
GTCGCGCGGG TCGAGAGCGT CGGCGGCAAG GCGTTCGTCT CGACCGGCGT GGTCCGCACG
ATCATCGGAC TGGTCGGCGA CATCGACTCC TTCCACCACC TCAACCTCCG CACCCTGCGC
GGCGTCGCCG ACGTGCACCG GATCTCCGAC CCCTACAAGC TCGTCAGCCG CCAGCACCAC
CCGGACCGGT CCACCGTCTG GGTCGGCGCC CCCGGCCGCC AGGTCCCGAT CGGCCCGGAG
ACGTTCACGC TGATCGCGGG ACCGTGCGCG GTCGAGACCG CGGAGCAGAC GCTCGAGGCG
GCGCAGATGG CCCGCTCCGC GGGGGCGACG ATCCTGCGCG GCGGCGCGTT CAAGCCACGG
ACCTCGCCGT ACGCCTTCCA GGGGCTGGGC GTCGCCGGGC TCAGGATCCT CGCCGACGTC
GGCGCCGCGA CCGGGCTGCC GGTCGTGACC GAGGTGGTCG ACGCCCGCGA CGTCGCCGTG
GTCGCGGAGC ACGCCGACAT GCTCCAGGTC GGGACGCGGA ACATGGCGAA CTTCGGGCTG
CTCCAGGCCG TCGGCGAGTC CGGGAAGCCG GTGCTGCTCA AGCGCGGGAT GACGGCCACG
ATCGAGGAGT GGCTGATGGC GGCGGAGTAC ATCGCCCAGC GCGGCAACCT GGACGTGGTC
CTCTGCGAGC GCGGCATCCG GACCTTCGAG CCGTCCACCC GCAACACCCT CGACATCTCC
GCCGTGCCCG TCGTGCAGGC CACCAGCCAC CTCCCGGTCG TCGTCGACCC CTCGCACGCT
GCGGGCCGCA AGGACCTGGT CGTCCCGCTG TCGCGGGCCG CGATCGCCGT CGGCGCCGAC
GGCGTGATCG TCGACGTCCA CCCGGACCCG GAGACCGCCC TGTGCGACGG ACCCCAGGCC
CTGCTCGGCT CCGAGCTGCG CGACCTGGCC CAGGCGGTAC GCCGGCTCCC CGAGATGGTC
GGCCGCCGAC CCGCGGCCGA CCACGCAGGC TGA
 
Protein sequence
MLSRGRPAAA GHLRFSRPRE ESRMVVVMSP DATAEDVAHV VARVESVGGK AFVSTGVVRT 
IIGLVGDIDS FHHLNLRTLR GVADVHRISD PYKLVSRQHH PDRSTVWVGA PGRQVPIGPE
TFTLIAGPCA VETAEQTLEA AQMARSAGAT ILRGGAFKPR TSPYAFQGLG VAGLRILADV
GAATGLPVVT EVVDARDVAV VAEHADMLQV GTRNMANFGL LQAVGESGKP VLLKRGMTAT
IEEWLMAAEY IAQRGNLDVV LCERGIRTFE PSTRNTLDIS AVPVVQATSH LPVVVDPSHA
AGRKDLVVPL SRAAIAVGAD GVIVDVHPDP ETALCDGPQA LLGSELRDLA QAVRRLPEMV
GRRPAADHAG