Gene Bpro_4406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4406 
Symbol 
ID4012938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4649496 
End bp4651121 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content58% 
IMG OID637944054 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_551191 
Protein GI91790239 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0490152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGTC GTGATTTCGT CAAAGGCGCA GCAGCTCTTA CTGCAACCGC CGCCGCTGGC 
ATCGGGCAAG CGCAACAAGC CGGGGCGCCA GTTGCGAAGC GCTCAATTCT GACAGCTACG
GCGCAGTTCG ACCCGGCCCG CCCGGAGATT GCCCGGGTTG TCGCACAGTC CTGCAAGTCG
CTCGGGTGGG AGGTTGAAGC CAATCCCATC GACTACAACC AGGGCATCAC GAAAGTGATC
AACGAGCATG ACTTCGAGAT GTTCCTGGTC TTTCTTCCGG GTACGGCCAT CCGCATCGAC
CCCGACTTCT TCATTCGCAG CATTCACCAC TCCACAGAGC ATAAGCGCGG CGGCTTCAAC
TGGGCGGGCT ACAAGAACGA CCGTGTAGAC GCGCTGGCAA CCGTGCAGAG CCGCTTGATG
AAGATCGACG ACCGTAGGAA GCCGGTGCTC GAAGCGCAGG AAACCATCTT CCAGGACCAG
CCTGGAACCG TGCTGGCCTA TACGCAGATG ACGATGGCTC ATCGTTCGGA CAAGCTCAAG
GGCCTGGTGC CGCAGCTCGG CGAGGGAATC GGCGGCTTTT GGTCTGACGT CAACATGGAA
GTGGCGGGCG ACGGATTCTC GCGCACCGGT TCGAATGCGG ACGTCAAGCA TCTGAATCCA
GTTGCGGTCA ATGACTCCAT GGAATTCATG GAGCTTTCCA TGATCTACGA CCGACTTTTC
CGCCTTGGGC CCGACGGCAA ACCTGTACCT TGGGCAGCCT CCGGAATGAA GTTGGTGGAT
GACAAAACCA TCGATCTGAC GATCCGCTCG GGTATGCGCT GGCATGACGG TAAGCCAGTG
ACGGCCGAGG ATGTCAAGTT CACCTTCGAC TATCACAAGA AGTGGAAGGC GCCATTTTTC
CTTGCCGCGC TCAATAATGT CGTCGCGGTG GAGCTCCCGA GCCCCAATAC GGTGCGCATC
CGACTCGAAA ATCCATCGGC TCCCTTTGTC TCCAACGTGA TGGCGACGAT GTTTCTTCTC
CCCAAACACA TTTGGGAAGG CATCCCGGAG AAGGTCGGCA TTGACGATCC GCTCAAATTC
CCCAACGACA AGCCGGTCGG TAGCGGACCG TTCAAGTTCG ACTACTGGCG GCGCGGTTCG
GAACTCAAGG TCACCGCGTT CCGCGAGCAC TTCAACCCAC CCAAGTGCGC AGGCATCATC
CGCATTACTT ACGGCAGCCA TGACGCGTTG GCCGCCGCCA TCGAGAAGGG GGAATGCGAC
CGCTCACGCT ACATCTTGTC GCCAGCCCTT GTCGATCGTA TAAAGAACGT GAAGAACGTA
GTGGCCAAGG GCTATCCCAG CCACGGCCTC TATCACCTTG CGTACAACAA CAAGATCAAG
CCTTTCGACG ATCCCGCCTT CCGCCAGGCG CTCAACCATG TGATGCCGCG CAAGATGATC
TCGGAGCTGA TCCTGCTCGG GTACGCCGAT CCCGGCGCAT CAATTATTTC GCCCATCAGC
ACGTTTTGGC ACAACCCGGC TGTCAAGGTT CCGGCTGAAG ACGTGAAGAA GGCGCGCGAC
ATTCTTGCGA AGGCCGGTTA CGGATGGGAC GCGCAAGGCA AATTGCTCTC TCCCCGCGGC
AAATGA
 
Protein sequence
MDRRDFVKGA AALTATAAAG IGQAQQAGAP VAKRSILTAT AQFDPARPEI ARVVAQSCKS 
LGWEVEANPI DYNQGITKVI NEHDFEMFLV FLPGTAIRID PDFFIRSIHH STEHKRGGFN
WAGYKNDRVD ALATVQSRLM KIDDRRKPVL EAQETIFQDQ PGTVLAYTQM TMAHRSDKLK
GLVPQLGEGI GGFWSDVNME VAGDGFSRTG SNADVKHLNP VAVNDSMEFM ELSMIYDRLF
RLGPDGKPVP WAASGMKLVD DKTIDLTIRS GMRWHDGKPV TAEDVKFTFD YHKKWKAPFF
LAALNNVVAV ELPSPNTVRI RLENPSAPFV SNVMATMFLL PKHIWEGIPE KVGIDDPLKF
PNDKPVGSGP FKFDYWRRGS ELKVTAFREH FNPPKCAGII RITYGSHDAL AAAIEKGECD
RSRYILSPAL VDRIKNVKNV VAKGYPSHGL YHLAYNNKIK PFDDPAFRQA LNHVMPRKMI
SELILLGYAD PGASIISPIS TFWHNPAVKV PAEDVKKARD ILAKAGYGWD AQGKLLSPRG
K