Gene Bpro_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_0139 
Symbol 
ID4012107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp145010 
End bp146572 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content62% 
IMG OID637939823 
Productextracellular solute-binding protein 
Protein accessionYP_547002 
Protein GI91786050 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.382397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAC GTATGACACC GCTTCGACCC AAGGCCCTGG CTGCCAGCCT GCTGGCGCTG 
CTGGCCCTGG CCTCCACTGG CTCGGCCCTT GCCGCCAAGG ACGTGGTACT GGCCATCGGC
GGCCAGCCCG AAACGCTGGA CCCCTACAAC ACCAACACCA CGCTGACCAC GGCAGTCACC
AAGTCCTTCT ACCAGGGCCT GTTCGAGTTC GACAAGGACC TCAAAATCCA GAACGTGCTG
GCCGAAAGCT ACACCGCCTC CAAAGACGGC CTTGTCTACA CCCTCAAGCT GCGCCAGGGC
GTCAAGTTCC ATGACGGCAC CGACTTCAAT GCCGAAGCCG TCAAGGTCAC GCTGGACCGG
GTGCTCAACC CCGACAACAA ACTGGCCCGC TTCAACCAGT TCAACCGCGT GGACAAGGTC
GAGGCGATTG CACCGTACAG CGTGCGCATC ACGCTCAAGG AGCCTTTTGG CCCCTTCATC
AACTCGCTGG CGCACGCCTC GGCCGCGATG ATCTCCCCGA CGGCACTCAA AAAGTACGGC
AACAAGGACA TCGCCTTCAA CCCGGTGGGC ACCGGCCCCT TCACTTTTGT GGAGTGGAAG
CAGACCGATT TCGTCAAGGT CAGGAAATTC GACGGCTACT GGAAAAAGGG CTACCCCAAG
GTCGACAGCG TGACCTGGAA ACCCGTGCTG GAGAACAACA CACGCGCCGC CATGCTGCAG
ACCGGCGAAA CCGACTTTGC CTTCCCGCTG CCCTACGAGC AGGCCACCGA ACTGGAGAAA
AACGCCAAGC TCAAGCTGGT GAGCGGCCCG TCCATCATCA CGCGCTACCT CAGCTTCAAC
ATGCAGCAAA AGCCTTTCGA CAACCTCAAG GTGCGCGAAG CCATCAACTA CGCCATCAAC
AAGGAAGCGC TCGCCAAAGT GGCCTTCGGT GGCTATGCCG TCCCGGCCGA GGGCATCGTG
CCGCAAGGGG TGAAGTACGC CCACAAGATG GCGCCCTGGC CTTATGACCC GAAGAAGGCG
CGCGAGCTGC TCAAGGAAGC CGGTTACGCC AACGGCTTCG AGTCGGTGCT CTGGAGCGCC
TACACCACCA CCACCGCGCA GAAAACCATC CAGTTCGTGC AGCAGCAGCT GGCCCAGGTG
GGCATCAAGA TCTCGGTGCA GTCACTCGAA CCCGGCCAGC GTACCGAGTG GGTGCAAACC
GCGCCGGACC CCAAAACCGC GAAAGTGCGC ATGTACTACG CCGGCTGGTC TTCTTCGACC
GGTGAAGCCG ACTGGGCGCT GCGCCCGCTG CTGGCCACCG AAGCCTGGCC GCCCAAGCTG
AACAACACCG CCTACTACAG CAACGAGAAG GTGGACGGCG CTATCGCCAA GGCCCTGAAG
TCGGTCGATG ACAAGGAAAA AGCCGACCTG TATCGCCAGG CACAGGAGCA GATCATGAAA
GACGCGCCCT GGGCGCCGCT GGTGACGGAG AAAAACCTGT ACGCCACAAG CAAGCGCCTG
TCGGGCGTCT ACGTGATGCC TGACGGCAAC ATCAATGCGG ACGAGATCGC GGTCGCCGAG
TAA
 
Protein sequence
MTERMTPLRP KALAASLLAL LALASTGSAL AAKDVVLAIG GQPETLDPYN TNTTLTTAVT 
KSFYQGLFEF DKDLKIQNVL AESYTASKDG LVYTLKLRQG VKFHDGTDFN AEAVKVTLDR
VLNPDNKLAR FNQFNRVDKV EAIAPYSVRI TLKEPFGPFI NSLAHASAAM ISPTALKKYG
NKDIAFNPVG TGPFTFVEWK QTDFVKVRKF DGYWKKGYPK VDSVTWKPVL ENNTRAAMLQ
TGETDFAFPL PYEQATELEK NAKLKLVSGP SIITRYLSFN MQQKPFDNLK VREAINYAIN
KEALAKVAFG GYAVPAEGIV PQGVKYAHKM APWPYDPKKA RELLKEAGYA NGFESVLWSA
YTTTTAQKTI QFVQQQLAQV GIKISVQSLE PGQRTEWVQT APDPKTAKVR MYYAGWSSST
GEADWALRPL LATEAWPPKL NNTAYYSNEK VDGAIAKALK SVDDKEKADL YRQAQEQIMK
DAPWAPLVTE KNLYATSKRL SGVYVMPDGN INADEIAVAE