Gene Bpro_3640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_3640 
Symbol 
ID4013708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp3841148 
End bp3842611 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content61% 
IMG OID637943298 
Productpeptidase S1C, Do 
Protein accessionYP_550442 
Protein GI91789490 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.316615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGAAT TGAACCGGAA ACCCCTGCGC TCCTATTTGC TCGCAGGTCT GTTGGGCGTG 
GTGACAGCCA CCGTGGTGCT GCCTGTCGCT CCCGCGTGGG CGCAGGTCCG CACGCTGCCT
GACTTCACCG ATCTGGTTGA TCAGGTCGGC CCCTCGGTGG TGAACATCCG CACGATAGAA
AAAGTGAAAG CCTCGGGTGC GGGAAGCGTG GATGAGCAAA TGCTGGAGTT TTTCAGGCGT
TTCGGGATTC CCGTACCACC CAACATGCCG CGCGCGCCGC GTCCAGACCG CGGCCAGCCC
CAGCCCGACG AAGAGCAGCC GCGTGGTGTG GGTTCCGGCT TCATCCTCAC GGGAGACGGT
TTTGTCATGA CCAATGCCCA TGTGGTGGAG GGCGCCGATG AAGTGATTGT CACGCTGACT
GACAAGCGTG AATTCAAGGC CAAAATCATC GGAGCGGACA AGCGCAGCGA TGTGGCGGTC
GTGAAGATAG AGGCAAGCGG GTTGCCGGCC GTGAAAATCG GTGACATCAA CCGCCTGAAA
GTGGGCGAAT GGGTGATGGC CATCGGCTCA CCGTTTGGCC TTGAAAACAC CGTGACGGCG
GGTATTGTGA GCGCCAAGCA GCGCGATACC GGCGACTACC TGTCCTTCAT CCAGACCGAT
GTGGCCATCA ACCCCGGCAA CTCGGGTGGG CCCCTGATCA ACATGCGCGG GGAGGTCGTG
GGTATCAACA GCCAGATCTA TTCACGTTCT GGCGGCTTTC AGGGCATTTC GTTTTCCATC
CCGATTGACG AGGCAACGCG TGTTTCAGAC CAGCTGCGCA GCAGTGGTCG GGTCACACGT
GGACGCATCG GGGTGCAGAT CGACCAGGTC AGCAAGGAAG TGGCCGAATC CATCGGTCTG
GGCAGTCCCC GCGGGGCGCT GGTCAGAGGC GTGGAGGCTG GTGCACCTGC AGAAAAGGCA
GGCGTAGAAG CCGGCGACAT CATCATCAAG TTCGACGGCA AGCAGATCGA AAAATCCAGT
GATCTTCCGC GCATGGTCGG CAATGTGAAG CCCGGCACCA AGGCGGTGGT GACCGTCTTC
AGGCGGGGTG CTACCAGGGA TTTGCCGGTG GTCATCGCGG AGGTGGAAGC AGAGAAGCCC
CTTCGCAAGG CCTCGTCACC GGAAGCCAAA CCGCCGGTCG CCGGCCCTGC ACAGGCTTTG
GGCCTGGTGG TGAGCGACCT GCCGGATGCA CAGAAAAAGG AACTCAAGAT CAAGGGCGGG
GTCCGGGTAG ACAGCGCCGA GGGTGGAGCC GCCCGTGCAG GACTTCGCGA GGGTGATGTG
ATCGTTGCGA TTGCCAACTC CGAAATCACA ACCGTCAAGG AGTTCGAGGC TGCGCTCGCA
AAAATTGACA AGAGCAAGCC CGTCAATGTG CTGTTTCGCC GGGGAGAACT GGCGCAGTTT
GTGTTGATCC GGCCGGCACG TTGA
 
Protein sequence
MLELNRKPLR SYLLAGLLGV VTATVVLPVA PAWAQVRTLP DFTDLVDQVG PSVVNIRTIE 
KVKASGAGSV DEQMLEFFRR FGIPVPPNMP RAPRPDRGQP QPDEEQPRGV GSGFILTGDG
FVMTNAHVVE GADEVIVTLT DKREFKAKII GADKRSDVAV VKIEASGLPA VKIGDINRLK
VGEWVMAIGS PFGLENTVTA GIVSAKQRDT GDYLSFIQTD VAINPGNSGG PLINMRGEVV
GINSQIYSRS GGFQGISFSI PIDEATRVSD QLRSSGRVTR GRIGVQIDQV SKEVAESIGL
GSPRGALVRG VEAGAPAEKA GVEAGDIIIK FDGKQIEKSS DLPRMVGNVK PGTKAVVTVF
RRGATRDLPV VIAEVEAEKP LRKASSPEAK PPVAGPAQAL GLVVSDLPDA QKKELKIKGG
VRVDSAEGGA ARAGLREGDV IVAIANSEIT TVKEFEAALA KIDKSKPVNV LFRRGELAQF
VLIRPAR