Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0204 |
Symbol | |
ID | 4598957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 219141 |
End bp | 220118 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639774817 |
Product | 2,3-dimethylmalate lyase |
Protein accession | YP_921436 |
Protein GI | 119714471 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2513] PEP phosphonomutase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACAGCA GCAGCCACCA GGAGTTCCCG CACCTGGCCG ACCTGGTCGG CGTCGGCCCC GCGCCGACCG CGCACGACCG GCGTACGGCG CTGCGCACCG CACTGCGAGA GGCCACGGCC GGCGCCGGAC CGCTGCTGCT CCCGGGGGTC ACTGACGCCC TCGGTGCGCG GCTCGTCGAG GCCGCGGGCT TCGGCGCGGC CTATGCGACC GGGGCCGGGC TCGCCAATGC GCAGTACGGG CTGCCCGACC TCGGCCTGGT CTCCCTCGGC GAGGTCGCCG ACCACGTCGG CCGCATCACC GAGGCCACCA GGCTGCCGGT CGTCGTCGAC GCGGACACCG GGTACGGCGG TCCGCTGGCG GCGATGCGCA CGATGCGGCT GCTCGAACGC GCCGGGGCTG CCGGCATCCA GCTCGAGGAC CAGGAGATGC CCAAGCGGTG CGGGCACTTC GACTCCCACA CGCTGATCCC GCTCGGGCAC ATGCAGGCCA AGATCGCCGC GGTCCTCGAC GCCCGCGAGG ACGACGCGAC GGTGCTCGTC GCACGCACCG ACGCCCGCAG CGCCGAGGGG ATCGACCGGG CGGTCGAGCG GGCGCGCGCC TACGTCGAAG CGGGCGCGGA CGTGATCTTC GTCGAGGCGC CGCGCACGGT CGCGGAGCTG ACGCTGGTCG GCCGCGAGCT GGCCGGCACC CCCCTGGTCG TCAACGTCGT CGAGGGCGGC AAGACCCCAC ACCTCAGCGC GCAGGAGTAC GCCGACCTCG GCTTCACCGT CGTGCTGCAC GCCAACTACC TGATGCGGTC GATGATGTCG GCCGGCCGCG CCGCGCTCGC CCACCTGGCG GCCGCCGGCG AGACCGTGAC CCGGGCCGAG CAGATGGCGA CCTGGAGCGA GCGTCAGTCG CTGTTCCACC TCCCGGCGTT CACCGCCGCC GAGGCCTACT TCGACCAGCC CCTCGACGTG CTTCGGAGCG AGCGGTGA
|
Protein sequence | MNSSSHQEFP HLADLVGVGP APTAHDRRTA LRTALREATA GAGPLLLPGV TDALGARLVE AAGFGAAYAT GAGLANAQYG LPDLGLVSLG EVADHVGRIT EATRLPVVVD ADTGYGGPLA AMRTMRLLER AGAAGIQLED QEMPKRCGHF DSHTLIPLGH MQAKIAAVLD AREDDATVLV ARTDARSAEG IDRAVERARA YVEAGADVIF VEAPRTVAEL TLVGRELAGT PLVVNVVEGG KTPHLSAQEY ADLGFTVVLH ANYLMRSMMS AGRAALAHLA AAGETVTRAE QMATWSERQS LFHLPAFTAA EAYFDQPLDV LRSER
|
| |