Gene Plav_3010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3010 
Symbol 
ID5455283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3212320 
End bp3214050 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content65% 
IMG OID640878598 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001414274 
Protein GI154253450 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.496774 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCA AGACATTCGA CAAATCGAAA CTGCCGAGCC GCCACGTCAC GGTCGGGCCG 
GAGCGCGCGC CGCACCGGTC CTACTACTAT GCGATGGGCC TCACCGAGGA AGAGATCAAT
CAGCCCTTCG TCGGCGTGGC GACCTGCTGG AACGAGGCGG CTCCCTGCAA CATCGCGTTG
ATGCGTCAGG CGCAGTCGGT GAAGAAGGGT GTCAAGGCCG CCGCCGGCAC GCCCCGCGAA
TTCTGCACCA TCACGGTGAC GGACGGCATC GCCATGGGCC ACCAGGGCAT GAAATCCTCG
CTCGCCAGCC GCGACGTGAT CGCGGATTCG GTCGAGCTCA CCATGCGCGG TCATTGCTAC
GACGCGCTGG TCGGCCTTGC CGGCTGCGAC AAGTCGCTGC CGGGCATGAT GATGTCGATG
GTTCGTCTCA ACGTCCCCTC CGTCTTCATG TATGGCGGCT CGATCCTGCC GGGCCATTTC
AAGGGCAAGG ACGTCACCGT GGTCGACGTC TTCGAGGCGG TCGGCCAGCA CTCGGCGGGC
AACATGGAAG ACGAAGAGCT GCATGCGCTC GAATGCGTCG CCTGTCCCTC CGCCGGCGCC
TGCGGCGGCC AGTTCACGGC CAACACCATG GCCTGCGTTT CGGAAGCCAT GGGCCTCGCG
CTGCCGGGTT CGGCGGGCGC GCCTGCGCCT TATGAGAGCC GCGACGAATA TGCGGAAGCC
TCGGGCCGCG CCGTCATGCA CCTCCTCGCC AACAACATCC GGCCGCGCGA CATCGTCACG
CGCAAGGCGC TCGAGAATGC CGCCGTCATC GTCGCGGCGA CGGGCGGTTC GACCAATGGC
GGGCTTCATC TCCCGGCCAT CGCGCATGAA GCGGGCATCG ATTTCGACCT GATGGAAGTC
GCCGAAATCT TCAAGAAGAC GCCCTACATC ACCGACCTGA AGCCCGGCGG CAATTACGTC
GCGAAGGATC TTCACGAGGC GGGCGGCGTT TCGATGGTCC TCAAGGTGCT GCTCGATGGC
GGCTATCTCC ACGGCGATTG CCTCACCGTC ACCGGCCAGT CGCTCGCAGA CAACCTCAAG
GACGTGAAGT TCAACCCGGA CCAGAAGGTC GTCTATCCGC TGTCCAATCC GCTCTCGCCC
ACCGGCGGCG TTGTCGGGTT GCAGGGCTCG CTCGCGCCGG ATGGCGCCAT CGTCAAGGTC
GCGGGCATGG AGAAGGATCA TCTCCGTTTC TCCGGTCCGG CGCGTTGCTT CGACAGCGAA
GAGGAATGCT TCGAGGCGGT CGACAAACGT CAGTACAAGG AAGGTGAGGT TCTCGTCATC
CGCTACGAGG GACCGAAGGG TGGCCCCGGC ATGCGCGAAA TGCTCTCCAC CACCGCCGCG
CTTTACGGCC AGGGCATGGG CGACAAGGTC GCGCTCATCA CCGATGGCCG CTTCTCCGGC
GGCACGCGCG GTTTCTGCAT CGGCCATGTC GGCCCGGAAG CCGCCGTCGG CGGCCCCATC
GCGCTGATCG AGGATGGCGA CATCATCACC ATCGACGCGG AGAACGGCAC CATCGATCTC
GAAGTCGACG AAGCCGTGCT CGAAAAGCGC CGCGCGAACT GGAAGCCCCG GGAGACCATG
TACGCCTCCG GCGCGCTGTG GAAATACGCG CAGCTCGTCG GCACCGCCCG CAAGGGCGCC
GTCACCCATC CGGGCGGCAA GGCGGAGAAA CATGTCTATG CGGATATCTG A
 
Protein sequence
MDAKTFDKSK LPSRHVTVGP ERAPHRSYYY AMGLTEEEIN QPFVGVATCW NEAAPCNIAL 
MRQAQSVKKG VKAAAGTPRE FCTITVTDGI AMGHQGMKSS LASRDVIADS VELTMRGHCY
DALVGLAGCD KSLPGMMMSM VRLNVPSVFM YGGSILPGHF KGKDVTVVDV FEAVGQHSAG
NMEDEELHAL ECVACPSAGA CGGQFTANTM ACVSEAMGLA LPGSAGAPAP YESRDEYAEA
SGRAVMHLLA NNIRPRDIVT RKALENAAVI VAATGGSTNG GLHLPAIAHE AGIDFDLMEV
AEIFKKTPYI TDLKPGGNYV AKDLHEAGGV SMVLKVLLDG GYLHGDCLTV TGQSLADNLK
DVKFNPDQKV VYPLSNPLSP TGGVVGLQGS LAPDGAIVKV AGMEKDHLRF SGPARCFDSE
EECFEAVDKR QYKEGEVLVI RYEGPKGGPG MREMLSTTAA LYGQGMGDKV ALITDGRFSG
GTRGFCIGHV GPEAAVGGPI ALIEDGDIIT IDAENGTIDL EVDEAVLEKR RANWKPRETM
YASGALWKYA QLVGTARKGA VTHPGGKAEK HVYADI