Gene Noca_3419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3419 
Symbol 
ID4598217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3621708 
End bp3623411 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content70% 
IMG OID639778025 
Productdihydroxy-acid dehydratase 
Protein accessionYP_924606 
Protein GI119717641 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGC CCACCTCCCG CCCCGACATC AAGCCCCGCT CCCGCGACGT CACCGACGGC 
CTGGAGAAGG CCGCCGCGCG CGGCATGCTG CGGGCGGTAG GCATGGGCGA CGAGGACTTC
GCGAAGCCGC AGATCGGCGT GGGCTCGAGC TGGAACGAGA TCACCCCGTG CAACCTGTCC
CTGGACCGGC TCGCCAAGGC CGTGAAGAAC GGCGTGCACG CGGCCGGCGG CTACCCGCTG
GAGTTCGGCA CCATCTCGGT CTCCGACGGC ATCTCCATGG GCCACGAGGG CATGCACTTC
TCGCTGGTCT CCCGCGAGGT GATCGCCGAC TCGGTCGAGA CCGTGATGAT GGCCGAGCGC
CTGGACGGCT CGGTGCTGCT CGCCGGCTGC GACAAGTCGC TGCCGGGCAT GATGATGGCC
GCCGCCCGCC TCGACCTGGC CAGCGTGTTC CTCTACGCCG GCTCGACGAT GCCCGGGCAG
GTGGACGGCA ACGACGTCAC GATCATCGAC GCGTTCGAGG CCGTCGGGGC GTGCCTGGCC
GGCAAGATCA GCCGCGACGA GGTGGACCGG ATCGAGCGGG CCATCTGTCC GGGCGAGGGC
GCCTGCGGCG GCATGTACAC CGCGAACACG ATGGCCTCCA TCGCCGAGGC GATCGGCATG
TCGCTCCCCG GCTCGGCGGC CCCGCCGGCG GTCGATCGCC GACGCGACGG CTTCGCGCAC
CGCTCGGGGG AGGCCGTGGT CAACCTGTTG CGCCAGGGCA TCACGGCCCG CCAGATCATG
ACTCGCGCGG CGTTCGAGAA CGCGATCACG GTCGCGATGG CGCTCGGCGG CTCGACCAAC
GCCGTGCTCC ACCTGCTCGC GATGGCTCGT GAGGCCGACG TCGACCTCAC CATCGACGAC
TTCAACCGGA TCGGTGACAA GGTGCCGCAC CTCGGCGACC TCAAACCGTT CGGCAAGTAC
GTCATGAACG ACGTCGACAA GATCGGCGGC ATCCCCGTCG TGATGAAGGC ACTGCTCGAC
GCCGGTCTGA TGCACGGCGA CGCGCTCACC GTCACCGGCA AGACCCTGGC CGAGAACCTC
GCCGAGCTGG CCCCGCCGGA GCTCGATGAC GAGGTGATCC GCAAGCTGGA CCGGCCGATC
CACAAGACCG GTGGGCTCAC GATCCTCAAG GGCTCGCTCG CTCCCGAGGG CGCCGTGGTC
AAGACGGCCG GCTTCGACGA CTCCGTGTTC ACCGGCACCG CCCGCGTCTT CGACGGCGAG
CGGGCCGCGA TGGACGCCCT CGAGGCCGGG CAGATCCAGC CCCGCGACGT CGTGGTGATC
CGCTACGAGG GCCCGAAGGG TGGCCCCGGC ATGCGCGAGA TGCTCGCGAT CACCGGCGCG
ATCAAGGGCG CCGGCCTCGG CAAGGACGTC TTGCTGATCA CCGACGGCAG GTTCTCCGGT
GGTACGACGG GCCTGTGCGT GGGCCACATC GCCCCCGAGG CCGTCGACGG CGGCCCGATC
GCGTTCGTCC GCGACGGCGA CCAGATCACC CTCGACGTCG CCAACCGGCT GCTCGAGGTG
GAGGTCGTCG GTCCCGATGC CGAGGCCGAG TGGGAGCGCC GCAAGGTCGG CTGGGAGCCG
AATCCGCCCA AGTACACCCG CGGGGTGCTC GGCAAGTACG CCAAGATCGT CCAGTCCGCC
GCGCACGGCG CCATCACCGG CTGA
 
Protein sequence
MTEPTSRPDI KPRSRDVTDG LEKAAARGML RAVGMGDEDF AKPQIGVGSS WNEITPCNLS 
LDRLAKAVKN GVHAAGGYPL EFGTISVSDG ISMGHEGMHF SLVSREVIAD SVETVMMAER
LDGSVLLAGC DKSLPGMMMA AARLDLASVF LYAGSTMPGQ VDGNDVTIID AFEAVGACLA
GKISRDEVDR IERAICPGEG ACGGMYTANT MASIAEAIGM SLPGSAAPPA VDRRRDGFAH
RSGEAVVNLL RQGITARQIM TRAAFENAIT VAMALGGSTN AVLHLLAMAR EADVDLTIDD
FNRIGDKVPH LGDLKPFGKY VMNDVDKIGG IPVVMKALLD AGLMHGDALT VTGKTLAENL
AELAPPELDD EVIRKLDRPI HKTGGLTILK GSLAPEGAVV KTAGFDDSVF TGTARVFDGE
RAAMDALEAG QIQPRDVVVI RYEGPKGGPG MREMLAITGA IKGAGLGKDV LLITDGRFSG
GTTGLCVGHI APEAVDGGPI AFVRDGDQIT LDVANRLLEV EVVGPDAEAE WERRKVGWEP
NPPKYTRGVL GKYAKIVQSA AHGAITG