Gene Bpro_2057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_2057 
Symbol 
ID4015283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp2138067 
End bp2139137 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content69% 
IMG OID637941729 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_548885 
Protein GI91787933 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.011824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.166362 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCC CCATCTCGGA CATCCACATC GCCCAGGCCG ACCCGCTGCC GGAACCCCGG 
CTGCTGCGTG ACGAACTGCC GGCGGGCGAC GCTGAAGCCG AATTCATCGC CGCGTCGCGC
GCCGCCACCC GCAACATCTT GCGCGGCCTG GACGACCGGC TGCTGGTGGT CGTGGGCCCC
TGCTCTATCC ACGAGCCCGA GTCGGCGCTC GAGTACGCCG CGCGGCTGCG CCAGGAAGCG
GTGCGCCTTG GCGAATCGCT GCTGCTGGTG ATGCGCGTTT ACTTCGAAAA GCCGCGCACG
CGCATGGGCT GGAAGGGCCT GATTTATGAC CCGGGGCTGG ACGGCCAAGG CGACATCGGC
GAGGGCCTGC GCCACGCGCG GCGCATTTTG CTCGATTGCG CGCGGCTGGG TGTGCCGGCA
GCCTCTGAAA TCCTGGACTT GGTGACGCCG CAGTATTACG CCGAGCTGTT GACCTGGGGC
GCGATTGGCG CCCGCACGGT AGAAAGCCCG CTGCACCGGC AGATGGCTTC GGCCCTGTCG
GCGCCCGTGG GCTTCAAGAA CGCCACCAAC GGCAGCGTGG GCGCGGCCAT CGACGCCATC
CATGTGGCTG CGCAGCCGCA CCGCTTTCCG ACCATCTCGC TCGAGGGCCG GGCCATGGTC
ATCACGACCA CCGGCAACCC CGATGGTCAC CTGGTATTGC GCGGCGCCAG TGACGGGCCA
AACTACGACG CCGCCAGCGT CGGGCGCGCC ACCGAGGCCC TGGAGAAATC CGGCCTGCCG
CCCCGCCTGG TGATCGACTG CAGCCACGGC AACAGCAACA AGGACTATTC GAGGCAACCT
GCGGTGGCGG CCGACATTGC GCAGCAGGTC GCCAGCGGCT CGACCGGCAT CTGCGGCCTG
ATGATTGAAA GCCACCTGGT CGAGGGCCGG CAGGACATCG TCGACGGCCG CAAGGGCCTG
CACTATGGGC AAAGCGTGAC TGACGCCTGC ATCGGCTGGG AGGCGACCGT GGCCGTGCTG
GAGCAGCTGG CGGCGGCCGT GCGCCAGCGC CGGGCGGGCG CCATCAGGTA A
 
Protein sequence
MTIPISDIHI AQADPLPEPR LLRDELPAGD AEAEFIAASR AATRNILRGL DDRLLVVVGP 
CSIHEPESAL EYAARLRQEA VRLGESLLLV MRVYFEKPRT RMGWKGLIYD PGLDGQGDIG
EGLRHARRIL LDCARLGVPA ASEILDLVTP QYYAELLTWG AIGARTVESP LHRQMASALS
APVGFKNATN GSVGAAIDAI HVAAQPHRFP TISLEGRAMV ITTTGNPDGH LVLRGASDGP
NYDAASVGRA TEALEKSGLP PRLVIDCSHG NSNKDYSRQP AVAADIAQQV ASGSTGICGL
MIESHLVEGR QDIVDGRKGL HYGQSVTDAC IGWEATVAVL EQLAAAVRQR RAGAIR