Gene Bpro_4392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4392 
Symbol 
ID4012924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4636214 
End bp4637779 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content61% 
IMG OID637944041 
Productextracellular solute-binding protein 
Protein accessionYP_551178 
Protein GI91790226 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.779012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCATA GTTTTCCGTC CGTCACCCGT CCGCTCACCG CCGCGGGCCT TGCCCTGATG 
CTGCTCGGCG GTGCCGGCCA TGCTGCAGCC GAGACCAGCT TGACCTATGT CCAAGGCACA
GAGATCGACA CACTCGATCC AGCCGTGAGC CGTAGCACCC CCTCGCAAAT CGTCATCACG
CATATTTTTA ACAAATTGGT GAGCTGGGAC GGTCCTGGCT TCAAGAAGAT CGTTCCTGAC
CTTGCGGAGT CATGGAACGT CTCTGCTGAC GGGAAGGTCT GGACTTTCAA TCTTCGCAAA
GGAGTGAAAT TCCATGATGG GACGCCACTG GACGCGGCGG CTGTCAAGTT CAACCTGGAC
CGTTTGCGCA GTCCCGAACT GGGTTCGCCG AACCGGTCCT ACTACGCTGC TATTGAGACG
GTAGAGGCCC CGAGCACGTA CGTCGTGACC ATTGCCGCCA AGGAACCGTC GCCCACACTG
CTTGAGATTC TGACCGAGGA ATGGGCGTCC ATCAGCAGCC CCACAGCCAT CGAAAAGCGT
GGTCGCGCCT ATGGCCGCAA CCCCGTGGGT ACCGGACCCT ATGTATTCAA GAGCTGGATT
CCCAATGAGC GCGCCGAGAT TGTGCGCAAC CCTGACTACT TCGGCACTCC CGGGAAAACA
GACCGACTGG TGTTTCGCCC CGTGCCGGAG AATGCTGCGC GCGTGATCGA GCTGAAGACC
GGAAACGCGG ACGTTGCCGC GAACATTGCG CCGGAACTCA CCGGTGACTT CAAGGGCAGC
GACAAAATGG TGCTGCAACA AGCCCCTAGT GCCTTTCAGG TGTTCTTCGA GCTGAACGTC
ACCAAGCCCC CCTTTGACGA CGTGCGCGTA CGCAAGGCGG TGAATGTGGC CATAGACCGC
CAAGCCATCG TCGACAAGAT CCTGCTGGGC TATGGCCGCG TTCCCGTCAG CCCCTTTCCA
GAGGGAACGC AGGCTCGCCG CATATTTCCG GCTTACAAAT ACAACCCGGA CGAGGCGCGC
CGCCTGCTGA AGGCCGCATT CCCTGGTGGG TACCCCGGTG TCGTGACCAT GTGGACTCCC
TCGGGCCGTT ACACAAAGGA TCGTGCCGCC GCCGAAGCCG TACAGGGCTA CCTCAACGCC
GTCGGTCTGA AGACGGAGTT CAAGGTCTGG GAGTGGGCCT CCTACCAGAA GGAGCTGTAC
CGCGCCGAGC CCGGCAAGGG TACGGGCAAG GGAAGCAACG GCGCCAACAT GTGGCTGCTG
GGCACCGGCA TCCCGAATGC GGATATCCGC CTGCGCCGCA AGCTGAGCAC GGGCGACCCG
TCCAACCTCA CCGGCTACTC CAACCCGATG GTCGACGGCC TGCTGGCCAA AGCTGCCGGC
GAGATGAACT ACGACAAACG CATGGCGACG TACGGCGACG TCCAGCGTGT CGTCTGGGAG
CTCGAACCCA ACACGATACC GCTATTTGAC CAGGTTCAAC TCATCGGCCA ACGCAAGGGC
GTCAAGGGTC TGACCGTCTA TAGCGACGAG ATCGTCGAAT TTAACAACGC AACCGTCACC
CGCTAA
 
Protein sequence
MRHSFPSVTR PLTAAGLALM LLGGAGHAAA ETSLTYVQGT EIDTLDPAVS RSTPSQIVIT 
HIFNKLVSWD GPGFKKIVPD LAESWNVSAD GKVWTFNLRK GVKFHDGTPL DAAAVKFNLD
RLRSPELGSP NRSYYAAIET VEAPSTYVVT IAAKEPSPTL LEILTEEWAS ISSPTAIEKR
GRAYGRNPVG TGPYVFKSWI PNERAEIVRN PDYFGTPGKT DRLVFRPVPE NAARVIELKT
GNADVAANIA PELTGDFKGS DKMVLQQAPS AFQVFFELNV TKPPFDDVRV RKAVNVAIDR
QAIVDKILLG YGRVPVSPFP EGTQARRIFP AYKYNPDEAR RLLKAAFPGG YPGVVTMWTP
SGRYTKDRAA AEAVQGYLNA VGLKTEFKVW EWASYQKELY RAEPGKGTGK GSNGANMWLL
GTGIPNADIR LRRKLSTGDP SNLTGYSNPM VDGLLAKAAG EMNYDKRMAT YGDVQRVVWE
LEPNTIPLFD QVQLIGQRKG VKGLTVYSDE IVEFNNATVT R