Gene Bpro_4407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4407 
Symbol 
ID4012900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4652035 
End bp4653660 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content59% 
IMG OID637944055 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_551192 
Protein GI91790240 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.095613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGTC GTGATTTCGT CAAAGGCGCA GCAGCTCTTA CTGCAACCGC CGCCGCCGGC 
ATCGGGCAAG CGCAACAAGC CGGGGCGCCA GTTGCGAAGC GCTCAATTCT GACAGCTACC
GCGCAGTTCG ACCCGGCCCG CCCGGAGATT GCCCGGGTCG TCGCACAGTC CTGCAAGTCG
CTCGGGTGGG AGGTTGAAGC CAATCCCATC GACTACAACC AGGGCATCAC GAAAGTGATC
AACGAGCATG ACTTCGAGAT GTTCCTGGTC TTTCTTCCGG GTACGGCCAT CCGCATCGAC
CCCGACTTCT TCATTCGCAG CGTTCACCAC TCCACAGAGC ATAAGCGCGG CGGCTTCAAC
TGGGCGGGCT ACAAGAACGA TCGGGTGGAT GCTCTTGCGG CCGTGCAGAG CCGCATGATG
AACTTCAACG ATCGCTGGAA GCCAGTCTTC GAAGCGCAGG ACGTCATCTT CCAGGACCAG
CCTGGAACCG TGCTGGCCTA TACGCAGATG ACGATGGCTC ATCGTTCGGA CAAGCTCAAG
GGCCTGGTGC CGCAGCTCGG CGAGGGCATC GGCGGCTTTT GGTCTGACGT CAACATGGAA
GTAGCGGGCG ATGGCTACTC GCGTACGGGC TCAAACGCGG ATGTGAAGCA TCTGAATCCG
CTCGCCGTTA CCGACACGAG TGAGTTCATG GAGCTTTCCA TGATCTATGA CCGACTCTTT
CGCCTCGGAC CGGATGGCAA ACCTGTACCG TGGGCAGCCT CCGGAATGAA GTTGATCAAC
GACAGAACGA TCGAGCTGAC GATCCGCTCG GATATGCGCT GGCATGACGG CAAGCCAGTG
ACGGCCGAGG ACGTCATGTT CACCTTCGAC TATCACAAGA AGTGGAAGGC GCCATTTTTC
CTTGCCGCGC TCAATAATGT CGTCGCGGTG GAGCTCCCGA GCCCCAATAC GGTGCGCATC
CGACTCGAAA ATCCATCGGC TCCCTTTGTC TCCAACGTGA TGGCGACGAT GTTTCTTCTC
CCCAAACACA TTTGGGAAGG CATCCCGGAG AAGGTCGGCA TTGACGATCC GCTCAAATTC
CCCAACGACA AGCCGGTCGG TAGCGGACCG TTCAAGTTCG ACTACTGGCG GCGTGGCTCG
GAACTCAAGG TCACCGCGTT CCGCGAGCAC TTTAAACCGC CCAAGTGCGC TGGCATCATC
CGCGTCGTCT ATGGCAGCCA TGACGCTTTG GCCGCTGCCA TCGAGAAGGG GGAATGCGAC
CGCTCACGCT ATATCGTGTC GCCAGCCCTC GTCAATCGCC TGAAAGCGGT GAAGAACGTA
GTAGCCAAGG GCTATCCCAG CCACGGCCTC TATCACCTGG CGTACAACAA CAAGATCAAG
CCTTTCGACG ATCCCGTCTT TCGCCAGGCG CTCAACCATG TGATGCCGCG CAAGATGATC
TCGGAACTGA TCCTGCTTGG ATACGCTGAT CCTGGTGCAT CAATCATCTC GCCCATCAAC
GCGTTCTGGC ACAACCCGGC TGTCAAGGTT CCGGCCGAAG ACGTGAAGAA GGCGCGCGAC
ATTCTTGCGA AGGCCGGTTA CGGATGGGAC GCGCAAGGCA AATTGCTCTC TCCCCGCGGC
AAATGA
 
Protein sequence
MDRRDFVKGA AALTATAAAG IGQAQQAGAP VAKRSILTAT AQFDPARPEI ARVVAQSCKS 
LGWEVEANPI DYNQGITKVI NEHDFEMFLV FLPGTAIRID PDFFIRSVHH STEHKRGGFN
WAGYKNDRVD ALAAVQSRMM NFNDRWKPVF EAQDVIFQDQ PGTVLAYTQM TMAHRSDKLK
GLVPQLGEGI GGFWSDVNME VAGDGYSRTG SNADVKHLNP LAVTDTSEFM ELSMIYDRLF
RLGPDGKPVP WAASGMKLIN DRTIELTIRS DMRWHDGKPV TAEDVMFTFD YHKKWKAPFF
LAALNNVVAV ELPSPNTVRI RLENPSAPFV SNVMATMFLL PKHIWEGIPE KVGIDDPLKF
PNDKPVGSGP FKFDYWRRGS ELKVTAFREH FKPPKCAGII RVVYGSHDAL AAAIEKGECD
RSRYIVSPAL VNRLKAVKNV VAKGYPSHGL YHLAYNNKIK PFDDPVFRQA LNHVMPRKMI
SELILLGYAD PGASIISPIN AFWHNPAVKV PAEDVKKARD ILAKAGYGWD AQGKLLSPRG
K