Gene Bpro_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_2052 
Symbol 
ID4015278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp2132690 
End bp2134384 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content63% 
IMG OID637941724 
Productdihydroxy-acid dehydratase 
Protein accessionYP_548880 
Protein GI91787928 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0538673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACCA AGACCATTCA GCTCAACCCC CGCAGCAAAA ACATCACCGA AGGCAAATCG 
CGCGCGCCCA ACCGCTCGAT GTACTACGCC ATGGGCTATG AAGAGGCCGA CTTCAAGAAG
CCCATGATTG GTGTGGCCAA CGGGCACAGC ACCATCACCC CCTGCAACAG CGGCCTGCAA
AAGCTGGCGG ACGCGGCCAT TGCCGGCATT GAAGAAGCCG GCGGCAACGC CCAGGTGTTC
GGCACGCCCA CCATCTCGGA CGGCATGGCC ATGGGCACCG AGGGCATGAA GTACTCGCTG
GTCAGCCGCG AAGTGATCTC CGACTGCATC GAAACCTGCG TGCAGGGCCA GTGGATGGAC
GGCGTGCTGG TGATCGGCGG CTGCGACAAG AACATGCCCG GCGGCCTGAT GGGCATGCTG
CGCGCCAACG TGCCGGCCAT CTACGTCTAT GGCGGTACCA TTTTGCCGGG CAGCTACAAG
GGCAAAGACC TCAACATCGT CAGCGTGTTT GAAGCCGTCG GCGAAAACGC AGCAGGCCGC
ATGAGCGATG AAGACCTGCT GCAAATCGAG CGCCGCGCCA TTCCCGGCAC CGGTAGCTGC
GGCGGCATGT ACACGGCCAA CACCATGTCC AGCGCCTTCG AGGCGCTCGG CATTTCGCTG
CCCTACTCCA GCACCATGGC CAATCCGCAC GACGAGAAAA TGAACTCGGC CAGGGAGTCC
GCCAAGGTCC TGGTCGAAGC CATCAAGAAA GACATCAAGC CGCGCGATCT CGTCACGAAG
AAAGCCATTG AAAACGCCGT GGCAGTGATC ATGGCCACGG GCGGCTCCAC CAATGCCGTG
CTGCACTTCC TGGCGATTGC GCATGCCGCC GGCGTGGACT GGACAATCGA CGACTTCGAA
CGCGTGCGCC AAAGAACGCC GGTGCTGTGC GACCTGAAGC CGTCCGGCAA GTACCTGGCC
GTGGACCTGC ACCGCGCCGG CGGCATTCCG CAGGTCATGA AGATGCTGCT GGCGGCCGGC
CTGCTGCATG GCGACTGCCT GACGATCACC GGCCAAACCA TTGCCGAGGT GCTGAAGGAT
GTGCCCGAAG CGCCGCGCGC CGACCAGGAC GTGATTCGCC CCATCAGCAA CCCCATGTAC
GCCCAGGGCC ACCTGGCCAT CCTGAAGGGC AACCTCTCGC CTGAAGGCTG CGTGGCCAAA
ATCACCGGCC TGAAAAACCC GGTCATGACG GGCCCGGCCC GCGTATTTGA CGACGAGCAG
TCGGCGCTGG CCGCCATCCT GGCCGGCAAG ATCAAGGCGG GCGACGTGAT GGTGCTGCGT
TACCTCGGCC CCAAGGGCGG TCCCGGCATG CCTGAAATGC TGGCGCCTAC CGGTGCGCTG
ATTGGCGCCG GCCTGGGCGA AAGCGTGGGC CTGATCACCG ACGGCCGCTT CTCCGGCGGC
ACCTGGGGCA TGGTAGTCGG CCATGTGGCC CCCGAAGCGG CCGCCGGCGG CAATATCGCA
TTCATCAACG AAGGCGACTC CATCACCATT GACTCAAAAC AGTTGCTGCT GCAACTGAAC
ATCAGCGACG CGGAGCTGGA AAAACGCAAG GTCGGCTGGA AAGCACCGGC ACCGCGCTAC
AACCGCGGCG TGCAGGCCAA GTTTGCCTTC AACGCGTCGA GTGCCAGCAA GGGTGCGGTG
CTGGACGACT ATTGA
 
Protein sequence
METKTIQLNP RSKNITEGKS RAPNRSMYYA MGYEEADFKK PMIGVANGHS TITPCNSGLQ 
KLADAAIAGI EEAGGNAQVF GTPTISDGMA MGTEGMKYSL VSREVISDCI ETCVQGQWMD
GVLVIGGCDK NMPGGLMGML RANVPAIYVY GGTILPGSYK GKDLNIVSVF EAVGENAAGR
MSDEDLLQIE RRAIPGTGSC GGMYTANTMS SAFEALGISL PYSSTMANPH DEKMNSARES
AKVLVEAIKK DIKPRDLVTK KAIENAVAVI MATGGSTNAV LHFLAIAHAA GVDWTIDDFE
RVRQRTPVLC DLKPSGKYLA VDLHRAGGIP QVMKMLLAAG LLHGDCLTIT GQTIAEVLKD
VPEAPRADQD VIRPISNPMY AQGHLAILKG NLSPEGCVAK ITGLKNPVMT GPARVFDDEQ
SALAAILAGK IKAGDVMVLR YLGPKGGPGM PEMLAPTGAL IGAGLGESVG LITDGRFSGG
TWGMVVGHVA PEAAAGGNIA FINEGDSITI DSKQLLLQLN ISDAELEKRK VGWKAPAPRY
NRGVQAKFAF NASSASKGAV LDDY