Gene Bpro_4051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4051 
Symbol 
ID4013301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4252751 
End bp4253872 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content62% 
IMG OID637943699 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_550842 
Protein GI91789890 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.222615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATA GCAAGACCCC CCGCCGCCGC AGCCTGCTCA AGGGCGCAGC AATGGCCGCC 
GGTGCCGGCG CCATGTCCGC CCCCATGCTG GCGACCGCGC AAACCACCAC CACGCTGCGT
TTCCAGAGCA CCTGGCCCTC GAAGGACATC TTCCACGAAT ACGCCAACGA CTTTGCCAAG
AAGGTCAACG ACATGGCTGG CGGCAAGCTG AAAATCGAAG TGCTGCCCGC TGGCGCGGTG
GTGCCTGCAT TCCAGCTGCT GGAAGCCGTC AACAAGGGCA CGCTGGATGG CGGTCACGGC
GTGGTGGCCT ACCACTACGG CAAAAACTCG GCGCTGGCGC TCTGGGGTTC CGGTCCCTCC
TACGGCATGG ACCCCAACAT GCTGCTGGCC TGGCACAACT ACGGCGGCGG CAAGGCCATC
CTGGAAGAAA TCTACAAGTC GCTCAACATG GACGTGGTGT CCTACCTGTA CGGCCCGATG
CCGACGCAAC CGCTGGGCTG GTTCAAGAAG CCGGTGACCA AGGTTGAAGA CATGAAGGGC
CTGAAGTTCC GCACCGTCGG CCTGGCCGTC GACATCTTCA CCGAGATGGG CACCGCCGTC
AACCCGCTGC CGGGCGGCGA AATCGTGCCG GCGCTGGACC GCGGGCTGAT TGACGCGGCC
GAGTTCAACA ACGCCTCGAG CGACCGTCTG CTGGGCTTTC CCGACGTGGT GAAAAACTGC
ATGCTGCAGA GCTTCCACCA GAGCGGCGAG CAGTTCGAGA TCCTGTTCAA CAAGGGCAAG
TACAACGCCT TGCCACAAGA GCTGCGCTCC ATCATCGACT ACGCCGTCCA GGCAGCCAGC
GCCGACATGA GCTGGAAGGC TGTGGAGCGC AATTCGCAGG ACTACATCGA ACTCAAGAAA
GCCGGCGTCA AGTTCTACAA GACACCCGAC GCGATCCTGC GCGCCCAGCT GGCCGCCTGG
GACAAAACCA TCGACAAGAA AGCCAAGGAA AACGCGCTCT TCAAGAAGGT GCTCGACTCC
CAGAAAGTCT TTGCGCAGCG CGCGGGCCAG TGGCAGAACG ACTACACCGT GGATTTCAAG
ATGGCCTATA ACCACTACTT CGGCCGGGGC AAGAAAGCCT GA
 
Protein sequence
MTDSKTPRRR SLLKGAAMAA GAGAMSAPML ATAQTTTTLR FQSTWPSKDI FHEYANDFAK 
KVNDMAGGKL KIEVLPAGAV VPAFQLLEAV NKGTLDGGHG VVAYHYGKNS ALALWGSGPS
YGMDPNMLLA WHNYGGGKAI LEEIYKSLNM DVVSYLYGPM PTQPLGWFKK PVTKVEDMKG
LKFRTVGLAV DIFTEMGTAV NPLPGGEIVP ALDRGLIDAA EFNNASSDRL LGFPDVVKNC
MLQSFHQSGE QFEILFNKGK YNALPQELRS IIDYAVQAAS ADMSWKAVER NSQDYIELKK
AGVKFYKTPD AILRAQLAAW DKTIDKKAKE NALFKKVLDS QKVFAQRAGQ WQNDYTVDFK
MAYNHYFGRG KKA