Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0509 |
Symbol | |
ID | 4597408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 539330 |
End bp | 540292 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 639775123 |
Product | O-succinylbenzoate synthase |
Protein accession | YP_921738 |
Protein GI | 119714773 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000393904 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGCATGA TCGTCTGGTC GGTGCCGATG CGGACCCGGT TCCGCGGCAT CACGGTCCGC GAGGGGGTGC TGCTGCGGGG CCCCGCCGAT GCCGAGCGGC CGGGTTGGGG GGAGTGGAGC CCGTTCCTCG AGTACGACGC GGCCGTCGCC GAGCCCTGGC TGCGCTGCGC GGAGGAGGCT GCGGCGGGCG ACTGGCCGGC GCCGCTGCGC GACCGGGTGC CGGTGAACGT CACCGTGCCG GCCGTGGGAC CACAGCAGGC GCACGCGATC GTGCTGGCCG GCGGCTGCCG GACGGCCAAG GTCAAGGTCG CCGAGCCCGG CCAGACCGCG GCCGACGACC AGGCGCGGCT CGAGGCGGTC CGAGACGCGC TGGGCGCAGA CGGGCGGGTG CGCATCGACG TCAACGGCCT GTGGGACCTC GACACCGCGG TCGCCTCGAT CCCCGTGCTC GACCGGGCGG CCGGTGGCCT CGAGTACGTC GAGCAGCCCT GCGCGAGCGT CGAGGACCTG GCCGCCGTAC GCCGCCGGGT CGACGTGCCC ATCGCCGCCG ACGAGTCGAT CCGCCGGGCC GCGGACCCCT ACCGGGTGCG CGACCTGGAG GCCGCCGACG TCGCGGTGCT CAAGGTGCAG CCGCTCGGCG GGGTGCGGGC CTGCCTGCGG ATCGCCGAGG ACATCGGGCT GCCGGTCGTG GTGTCCTCGG CGCTGGAGAC CAGCGTCGGG ATCGCCGCCG GGGTCGCCTT GGCCGCCGCC CTGCCGGAGC TGCCGTACGC CTGCGGGCTC GCCACCGTGC AGCTGCTCAC GGCCGATGTC GTCACCGAGT CACTCCTGCC GGTCGGCGGC GAGCTGCCGG TGCGCGCAGT GCAGGTCGAC GAGTCGCTCG CGCCCGCCGA CCCGGACAGG GTCGCCCACT GGGAGGCGCG GCTGGCCGAG GTGCGCGCGC TGCGACAGGA TCGTCCCTCG TGA
|
Protein sequence | MGMIVWSVPM RTRFRGITVR EGVLLRGPAD AERPGWGEWS PFLEYDAAVA EPWLRCAEEA AAGDWPAPLR DRVPVNVTVP AVGPQQAHAI VLAGGCRTAK VKVAEPGQTA ADDQARLEAV RDALGADGRV RIDVNGLWDL DTAVASIPVL DRAAGGLEYV EQPCASVEDL AAVRRRVDVP IAADESIRRA ADPYRVRDLE AADVAVLKVQ PLGGVRACLR IAEDIGLPVV VSSALETSVG IAAGVALAAA LPELPYACGL ATVQLLTADV VTESLLPVGG ELPVRAVQVD ESLAPADPDR VAHWEARLAE VRALRQDRPS
|
| |