Gene Bpro_2113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_2113 
Symbol 
ID4015224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp2203070 
End bp2204653 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content61% 
IMG OID637941783 
Productextracellular solute-binding protein 
Protein accessionYP_548938 
Protein GI91787986 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.142361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0512396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCA AGCCAAACCT GCTCTCGGTC GCCGTGTTTT GTGCGGTAGC TGCTACAAGT 
TTTGCAGCGT CAGCGCAGAC CCTGCGCGTC GCCAACCAGG GTGACGCGCT GTCCATGGAC
CCGCATTCGC TCAACGAATC GATGCAGCTC AGTGTGACGG GCAATGTTTA CGAAGGCCTG
GTGATTCGCA ACAAGGACCT GAGCCTGGCC CCGGGCCTGG CGACCGCCTG GAAGCAGACC
TCGCCGACCG TGTGGCGTTT TGAGCTGCGC AAGGGCGTGC AGTTTCATGA CGGCACGCCT
TTCACGGCCG ATGACGTGGT GTTCAGCTTC CAGCGCACGC AGGTCGAAGG CTCCGACATG
AAGAGCTACA CCAACGACTT CAAGGAAGTG CGTAAGGTCA ACGACCACGT GGTCGAAATC
GAAACCAAGA CGCCTTTTCC CATCCTTCCT GATGTGATCT CGCTGGTCTA CATGATGAGC
AAGAAGTGGT GCGAGACCAA TCAGGCAGCC AAGCCGGTGG ACCGTCGCAA GGGTGTCGAG
AACGCCGCCT CGTTCCGCGC CAACGGCACG GGCCCGTTCC GTGTGCGCGA GCGCCAGCCC
AATGTGCGAA CCACCTTTGT CCGCAACGGC AACTACTGGG GCAAGATCGA GGGCAACGTT
CAGGAGGTGG TCTTCACCCC GATCGGCAAT GACGCGACCC GCGTGGCTGC CCTGCTGTCG
GGCGAGATCG ACGTCATGGA GCCGGTGCCG GTTCAGGATA TCGACCGCGT CAACAGTGCC
CCTGCCACCC GCGTGCTGGC CGGCCCCGAG TTGCGCACCA TCTTTTTGGG CATGGACCAG
AAGCGCGACG AGCTGCTGTA TTCCAGCGTC AAGGGCAAGA ATCCGTTCAA GGACAAGCGC
GTGCGCCAGG CCTTCTACCA GGCCATCGAT ATCGAGGGCA TCAAGAAGAC TGTGATGCGT
GGCGGCTCGA ACACGACCGC GCTGCTGGTG GGCCCCGGCA TCAATGGCTT CCAGGCCGAT
CAGAACAAGC GACTGCCGTA TGACCCGGAA GCAGCCAAGA AGCTGATGGC GGAGGCCGGT
TACCCGAACG GCTTCGAGGT CTCCATGAAC TGCCCGAACG ACCGCTATGT GAATGACAGC
CGGATTTGCC AGACCGTGGC CGCCAACCTG TCGCGCATCA ACGTTAAGAT CAACCTGCAG
GCTGAGACCA AGGGCAGCTA CTTCCCTAAG GTGCTGCGCC GCGACACCAG CTTCTACATG
CTGGGCTGGA CGCCCAGCAC CTACGACGCG CACAACGCGC TCAACGCGCT GCTCGCCTGC
GTCGACGACA AAGGTGCCGG CCAATTCAAC CTGGGGGCTT ATTGCAATCC CAAGGTCGAC
GAGCTCACCA AGAAGGTGCA GGCCGAGACC GACAAGGCCA AGCGCAACGC CATGATCAAG
GAAGCGTTCG AGATCCATTC GGCCGACATC GGCCACCTGC CGCTGCACCA GCAGGCGCTG
GCCTGGGGCA TGAGCAAGAA GGTCGAGTTG GTCCAGCTGG CCGACAACTT CATGTTCTAC
AAGTGGGTGA GCGTCAAAAA ATAA
 
Protein sequence
MKFKPNLLSV AVFCAVAATS FAASAQTLRV ANQGDALSMD PHSLNESMQL SVTGNVYEGL 
VIRNKDLSLA PGLATAWKQT SPTVWRFELR KGVQFHDGTP FTADDVVFSF QRTQVEGSDM
KSYTNDFKEV RKVNDHVVEI ETKTPFPILP DVISLVYMMS KKWCETNQAA KPVDRRKGVE
NAASFRANGT GPFRVRERQP NVRTTFVRNG NYWGKIEGNV QEVVFTPIGN DATRVAALLS
GEIDVMEPVP VQDIDRVNSA PATRVLAGPE LRTIFLGMDQ KRDELLYSSV KGKNPFKDKR
VRQAFYQAID IEGIKKTVMR GGSNTTALLV GPGINGFQAD QNKRLPYDPE AAKKLMAEAG
YPNGFEVSMN CPNDRYVNDS RICQTVAANL SRINVKINLQ AETKGSYFPK VLRRDTSFYM
LGWTPSTYDA HNALNALLAC VDDKGAGQFN LGAYCNPKVD ELTKKVQAET DKAKRNAMIK
EAFEIHSADI GHLPLHQQAL AWGMSKKVEL VQLADNFMFY KWVSVKK