Gene Bpro_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4033 
Symbol 
ID4013283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4234645 
End bp4235754 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content66% 
IMG OID637943682 
Productputative zinc protease protein 
Protein accessionYP_550825 
Protein GI91789873 
COG category[R] General function prediction only 
COG ID[COG4324] Predicted aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.213397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAACCC GGGTGGCCCA AGTGGCCCTC ACCTGGCCCG CCATCCGGGC CACCGCCATG 
GCGGCCATCG CCGCCCTGGG CCTGTCGGGC TGCACCGGTA TCGGCTACTA CTGGCAGTCG
GTCAGCGGCC ACCTGCAGAT GATGAACGCG GCACGCCCGG TCAGCGACTG GCTGGACGAT
GCGCAAACCC CCGAGCAGCT CAAAACCCGG CTGGCCCTGA GCCAGCGCAT CCGCAGCTTT
GCCGCGAGCG AGCTAAAACT GCCCGACAAC GCCAGCTACC GCCGCTATGC CGACCTGCAG
CGCCGGGCCG TGGTGTGGAA CGTGGTGGCG GCCCCCGAGT TGTCGCTCAC CCTCAAGACC
TGGTGCTTTC CAGTGGCGGG CTGCGTGGGC TACCGCGGTT ATTTTGACGA AGCCGAGGCG
CGCGCTGAAG CAGCGCGGCT GCAAACCGCG GGGCTGGAGG CCGGCGTCTT CGGCGTGCCG
GCCTACTCCA CGCTGGGCTG GCTGAACTGG GCCGGCGGCG ATCCGCTGCT CAACACCTTC
ATCGCCTACC CCGAAGGCGA GCTGGCCCGG CTGATCATTC ATGAACTGGC GCACCAGGTG
GTCTACGCCC AGGATGACAC CATGTTCAAC GAATCATTTG CGACGGCGGT GGAACGGCTG
GGCAGCCAGC GCTGGCTCGC CACCCAGGCC AGCCCGGCGG CCCGGGCCGA GTACGCGGCC
TTTGACAGTC GGCGCCAGCA GTTCCGGGCG CTGGTGCGGG CCACGCGGCA CAGGCTGGAT
GCAATTTACG ATTTGAATTG GGCGCCAGCG CCCGCCAGAG CCGCGCAAGT TGCGATGAAA
AGCATCGCTA TTTCAGATTT CAAGCAACAG TATGAGCAAC TCAAAACCAG CTGGGGCGGC
TTCGCCGGCT ACGACCCCTG GGTCGCCCAG GCCAACAACG CCGCGTTTGG CGCGCAGGCC
GCCTATGACG AACTGGTGCC CGGCTTTGAG GCGCTGTTCA AGCGCGAAGG CGGCGACTGG
CGGCGGTTTT ATGATGCGGT GAAGCGACTG GCCAGCCTGT CCAAAGAAGA ACGGCACCAG
GCTCTTGCGA CCCATAACAC CGATAAATAA
 
Protein sequence
MKTRVAQVAL TWPAIRATAM AAIAALGLSG CTGIGYYWQS VSGHLQMMNA ARPVSDWLDD 
AQTPEQLKTR LALSQRIRSF AASELKLPDN ASYRRYADLQ RRAVVWNVVA APELSLTLKT
WCFPVAGCVG YRGYFDEAEA RAEAARLQTA GLEAGVFGVP AYSTLGWLNW AGGDPLLNTF
IAYPEGELAR LIIHELAHQV VYAQDDTMFN ESFATAVERL GSQRWLATQA SPAARAEYAA
FDSRRQQFRA LVRATRHRLD AIYDLNWAPA PARAAQVAMK SIAISDFKQQ YEQLKTSWGG
FAGYDPWVAQ ANNAAFGAQA AYDELVPGFE ALFKREGGDW RRFYDAVKRL ASLSKEERHQ
ALATHNTDK