Gene Bpro_4344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4344 
Symbol 
ID4012993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4581047 
End bp4582627 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content57% 
IMG OID637943994 
Productextracellular solute-binding protein 
Protein accessionYP_551131 
Protein GI91790179 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.133353 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTC CTGTTCTCAC CATTAAGACG ACCGCGTTTG CCGCCCTCTT GGCGTTCAGT 
GGCATCTGTC CGCCTGCGTT CGCCCAAGCT AAGGACCGCC CTGTGCTTAG CATCAGTTCG
GACGCAACAG ATATCTCAAC GCTCGACCCG CACCGCGCGA CAACTTCGGA CGACAAGGGC
GTCGCGTCGC AAGTGTTCAA CGGCCTGGTG CGCTTTCCAC CGGGCAGCGC CGATCCAAAA
GAGCTTCAGC CCGATCTCGC GGAAAAGTGG CAATCATCCC CCGACAAAAG GACTTGGACG
TTCTATCTGC GCAAGGGTGT GCAGTTCCAT GGTGGATATG GCGAGGTGAA GGCCGCTGAC
GTTGTGTATT CGTTCATGCG TGCAGCCGAC AAAGGTCGAT CCAGTTTTGC AGCGAACTTC
GAAGTCATCG AAAAAGTTGA GGCGCGGGAC GACTACACCG TGCGTTTCTT GCTGAAGTAT
CCGGACGCCG CGTTCCTCGG TCGCGTATCG AATTACCACG CTGGCAACAT CGTTAGTCAG
GCGGCGGCCG AAAAACTCGG GGACAGATTC GGTCAGTCAC CGATCGGCAC CGGTCCATTT
GCATTCAGCG AACACGTTAC GCAGCAATAT GTGAAGTTGG TCGCTAACGA CGCGTACTTC
CGTGGCGCGC CAAAACTGGC CGCCATTACC TATCGCATGA TTCCGTCGGA CAGCGCGCGT
GAACTGGCGT TTACGTCCAA AGAAGTCGAC ATGATGAGTG GTAAGCGGGA GCAACGCTGG
GTGGACGGCG CGAGCAAACG CGGCATGAAG GTCGATGTCT TTGACCCTGC CGAGTTCCGT
ACGCTTCACA TCAATCGCAC CATTAAGCCC TTGGATAATC TGAAGGTGCG CCAGGCAATC
GCTGCGGCGG TCAACGTGGA CGAAATCCTG AGGTATGTCG GCAAAAGTGT GGGCGACAAA
GGCTGCTCGA TCATCCCCAA CGGCTATCTA GGCGAGGATT GCAGCGCGGG GGGCTACACC
TATGACGTTG CGCGCGCCAA GAAATTGTTG ACCGAAGCCG GTTTTCCTAA CGGAATTTCT
ATCAAGTCGG TGGTGTCGAA CGTCTCTGCG CAGCAACCCA TCATGCAAAT TGTCCAGTCA
CAGCTGGCCA AAGCCGGCAT CAAGCTGGAA ATGGAGGTCG TGGACCATGC GACCTATCAA
GCCAAAAGCC GGAAAGACCA AAGCGCATTG GTGTTCTATG GCGCGGCTCG TTTTCCTATC
GCAGATGTCT GGCTCACGGA GTTCTACGAC TCGGCCGCCT CCATTGGGGC GCCTAAGGCA
ATTTCCAACT TCTCGCATTG CTCCGTGGGT GACGACGCCA TCCGGCAGGC GCGCATTGAG
CCTGATGCGC AGAAGCAGCT GGCATTGTGG AAACAAGCGC AGCAAAAGAT CCATGCGGAT
GTCTGCTCCG TGCCGCTCTT TGGCCTGAAG CAAGTGTGGG TGCATAGCGA CCGCGTGAAC
TACGGCTATA CGCTGAAAGG CGCCCTGAAC CTACAGCCGC CCATCACCGA ACTGGCGACC
GTTACGCGGA GCGCACCTTG A
 
Protein sequence
MKLPVLTIKT TAFAALLAFS GICPPAFAQA KDRPVLSISS DATDISTLDP HRATTSDDKG 
VASQVFNGLV RFPPGSADPK ELQPDLAEKW QSSPDKRTWT FYLRKGVQFH GGYGEVKAAD
VVYSFMRAAD KGRSSFAANF EVIEKVEARD DYTVRFLLKY PDAAFLGRVS NYHAGNIVSQ
AAAEKLGDRF GQSPIGTGPF AFSEHVTQQY VKLVANDAYF RGAPKLAAIT YRMIPSDSAR
ELAFTSKEVD MMSGKREQRW VDGASKRGMK VDVFDPAEFR TLHINRTIKP LDNLKVRQAI
AAAVNVDEIL RYVGKSVGDK GCSIIPNGYL GEDCSAGGYT YDVARAKKLL TEAGFPNGIS
IKSVVSNVSA QQPIMQIVQS QLAKAGIKLE MEVVDHATYQ AKSRKDQSAL VFYGAARFPI
ADVWLTEFYD SAASIGAPKA ISNFSHCSVG DDAIRQARIE PDAQKQLALW KQAQQKIHAD
VCSVPLFGLK QVWVHSDRVN YGYTLKGALN LQPPITELAT VTRSAP