Gene Dshi_2733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2733 
SymboldegP 
ID5713632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2894377 
End bp2895882 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content67% 
IMG OID641268658 
Productprotease do precursor 
Protein accessionYP_001534067 
Protein GI159045273 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0138399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.054673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATAGAA CAGACCCTCA GCGACCCCAG GCCCGGATCC TGGCGAAGGT CGCAACGCCC 
GCGGGCGTGC CCCGTGCGGC GCAGGCCGCC GCCGGGGCGC TGCTTGCGCT GGCCCTGCTG
CTCGCGCAGA CGCTGATCGT GCAGGCACGT GAAATCCCCG GCAGTTTTGC CGACCTCGCC
GAGCGTGTGA GCCCGGCGGT CGTCAACATC ACAACCTCGA CCAATGTCGC CACCCCGGGC
GGGCCGCAAC CGATGGTGCC CGAGGGCTCG CCCCTCGAAG ACTTCTTCCG CGATTTCATG
GATCGGCAGG AGCAGGATGG TCGCCCCGCA CCCCGGCAGC GGCGCTCCAA CGCGCTCGGC
TCCGGCTTCG TGATCTCCGA GGACGGCTAT ATCGTCACGA ACAATCACGT GATCGAGCAG
GCCGACGAGA TCCTGATCGA GTTCTTCTCC GGTGAAGAGC TGGCGGCAGA GGTCGTCGGC
ACCGACCCCA ACACCGATAT CGCGCTCTTG AAGGTCGAAA GCGACACGCC GCTGCCGTTC
GTCACCTTCG GGGACAGCGA TGCGGCCCGC GTGGGCGATT GGGTGATGGC CGTGGGCAAC
CCGCTGGGCC AGGGTTTCTC GGTCTCGGCC GGGATCGTCT CGGCGCGCAA CCGGGCGCTT
TCGGGCACCT ATGACGATTA CATCCAGACC GACGCGGCGA TCAACCGGGG CAACTCGGGC
GGACCGCTGT TCAACATGGA CGGTGAAGTC ATCGGCGTGA ACACCGCGAT CCTGTCGCCC
AACGGTGGCT CTATCGGGAT CGGTTTTTCC ATGGCAGCCG GTGTCGTGAC CAACGTGGTC
GATCAGCTCA AGGAATTCGG CGAGACCCGC CGCGGCTGGC TGGGCGTGCG CATCCAGGAC
GTGACCGACG ACGTGGCCGA AGCCCTCGGG CTGGAGCAGG CCGCCGGGGC GCTCGTGACC
GATGTGCCGG ACGGCCCGTC GCTCGATGCC GGGATGGAGG CGGGGGACGT GATCCTCACC
TTCGACGGGC GCGACGTGGA GGACACCCGT GAGCTGGTCC AGATCGTCGG CAACACAGCG
GTCGGCAAGG CCGTGCGCGT CGTGGTGTTC CGCGACGGCG CAACCCAGAC CCTGCTGGTG
ACCCTTGGGC GTCGCGAAGA GGCGGAGCGG GCGATCCCGG CTTCCGCGTC TGCGGATGAA
GAGATCCTCG AGAAGGAGAT CATGGGCCTG ACCGTCAGCG AGTTGACCGA TGAGCTGCGC
GAGCAGCTCG GGATCGCGGC GAGCGATACC GGGCTTGTCG TGGCCGATAT CGACGAGACC
TCGGAGGCCT TCGACAAGGG TCTGCGCGCG GGCGATCTCA TCGTCGAGGC CGCACAGGTC
CGTGTGACGA CCATCGAAGA GTTCGAAGAG CGGGTCGAGG CCGCCAAGGA GGCGGGGCGC
AAGTCCATCC TCGTGCTGGT GCGCCGGGAT GGCGACCCCC GTTTCGTGGC CCTTTCGCTG
AGCTGA
 
Protein sequence
MNRTDPQRPQ ARILAKVATP AGVPRAAQAA AGALLALALL LAQTLIVQAR EIPGSFADLA 
ERVSPAVVNI TTSTNVATPG GPQPMVPEGS PLEDFFRDFM DRQEQDGRPA PRQRRSNALG
SGFVISEDGY IVTNNHVIEQ ADEILIEFFS GEELAAEVVG TDPNTDIALL KVESDTPLPF
VTFGDSDAAR VGDWVMAVGN PLGQGFSVSA GIVSARNRAL SGTYDDYIQT DAAINRGNSG
GPLFNMDGEV IGVNTAILSP NGGSIGIGFS MAAGVVTNVV DQLKEFGETR RGWLGVRIQD
VTDDVAEALG LEQAAGALVT DVPDGPSLDA GMEAGDVILT FDGRDVEDTR ELVQIVGNTA
VGKAVRVVVF RDGATQTLLV TLGRREEAER AIPASASADE EILEKEIMGL TVSELTDELR
EQLGIAASDT GLVVADIDET SEAFDKGLRA GDLIVEAAQV RVTTIEEFEE RVEAAKEAGR
KSILVLVRRD GDPRFVALSL S