Gene Bpro_4335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4335 
Symbol 
ID4012984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4572241 
End bp4573821 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content58% 
IMG OID637943985 
Productextracellular solute-binding protein 
Protein accessionYP_551122 
Protein GI91790170 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.83307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGG TCGACTATGT GATCGCGTGT CGAGCGCTCC TCATCGCCGC ATGTATTGGC 
CTTGTCGGGC CGGTATTTGC TCAGGAGTCC ATCCTCCGCA TCGGAAGTGA TGCGATTGAC
ATCACTACTC TGGATCCGCA CCGCACGGCG GTGTCGGAAG AAAAGGGCTT GATTAGCATG
ATCTTCGGTG GCTTGGTGAG ATTTCCACCC GGTTCCGCTG ACCCCGAGAA AATAGAAGGT
GATTTGGCCG AGAGCTGGCA GGCTTCTGCT GATGGACTGA CCTGGACATT CAAGCTGCGT
CACGGTGTGC AGTTCCACCG CAACTATGGC GAGGTGAATG CGGAGGATGT GGTTTATAGC
CTAATGCGCG CAGCCGATCC AAAGCGGTCC TCGTTTTCGT CTACCTTTGA GCCGGTGAAG
GAAGTGAGCG CGCTCGATTC GCATACGGTG CGCATACAGC TCAAGGCGCC AGTACCCAGT
CTGCTGGGAC TGGTTGCCAA CTACCACGGC GGAATGGTCG TAAGCCGCAA GGCCGACCAG
GATCTGAAAG ACGCGTTCAA ATTGAAGCCA GTTGGCTTTG GGCCCTTCGA GTTCGTTGAA
CATCAGAAAC AGCGCGAGGT GGTGCTGAAG GCGCACGACA AGTATTTCCG TGGGAAGCCG
AAGATAGAGC GCATCGTTTA CCGATTCATC CCCTCCGAGG CCGTCCGGGG GCTAGCGTTC
GCTACTGGCG AGCTTGACCT GGTCGCCGGC CAGCGCGATC AGCGCTGGGT GCAGCGGGCC
CGGCTCTGGG CGCCGCCTAA GGACCAATCG TCCGTCAAAG TCGATGTCTT CGGACCAGGT
GAATTTCGTA CCCTGATGCT CAATCGGCGC ATCAAGCCGC TGGACGACCC GCGTGTTCGG
GAAGCCGTAG CACGCGCCGT GGATGTGCAG GAACTAGTTC ACTTCGTCGG CGCGGATATC
GTCAAGCCTG GCAGATCTGT CATTCCGCCA GGCTACGCAG GTGAGGTAGA CGTTGGGCCA
AAGTTTCCTT ATAACGTCGG CAAGTCAAAA GCTTTGCTGA CGGAAGCCGG CTACGCCAAC
GGCATCACCC TAAGGGCGGT AGTGTCCAGT ACCGCCTCGC AACTGTCGGT TATGGATGTG
GTGCAGAAAC AGCTCAAGCG CGCGGGCATC AACTTAACCA TGGACGTGGT CGAGCACGCC
GCTTATCACG CGCAGATCCG CAAGGACGTG AGTGCAATCG TCTTCTACGG CGCTGCTCGA
TTCCCGGTGG CCGATTCCTA CCTCACAGAG TTCTATCACT CCCGCTCCGA GATCGGCGCA
CCTACTCAGG TTACCAATTT CTCCCACTGC AATGCAGCCG ACAAGGAAAT TGATGCGGCG
CGCGCAACCC CAAATGCGGC TGCGCGAAGC TCGCTTTGGC GCGTGGCACA GGTGAAGATC
AACGCAGATC TCTGCGCCAT CCCGCTATTT GATCTGCAGC AGGTCTGGGC CCGTCGCGGT
GCGCTCGACT ATGGAGTGCC TCTTGAGGGC GCAATGAACC TCTTTCCGCC GATCAACGAA
AAATCGACAC TGAAGAAATG A
 
Protein sequence
MKAVDYVIAC RALLIAACIG LVGPVFAQES ILRIGSDAID ITTLDPHRTA VSEEKGLISM 
IFGGLVRFPP GSADPEKIEG DLAESWQASA DGLTWTFKLR HGVQFHRNYG EVNAEDVVYS
LMRAADPKRS SFSSTFEPVK EVSALDSHTV RIQLKAPVPS LLGLVANYHG GMVVSRKADQ
DLKDAFKLKP VGFGPFEFVE HQKQREVVLK AHDKYFRGKP KIERIVYRFI PSEAVRGLAF
ATGELDLVAG QRDQRWVQRA RLWAPPKDQS SVKVDVFGPG EFRTLMLNRR IKPLDDPRVR
EAVARAVDVQ ELVHFVGADI VKPGRSVIPP GYAGEVDVGP KFPYNVGKSK ALLTEAGYAN
GITLRAVVSS TASQLSVMDV VQKQLKRAGI NLTMDVVEHA AYHAQIRKDV SAIVFYGAAR
FPVADSYLTE FYHSRSEIGA PTQVTNFSHC NAADKEIDAA RATPNAAARS SLWRVAQVKI
NADLCAIPLF DLQQVWARRG ALDYGVPLEG AMNLFPPINE KSTLKK