Gene Dshi_1540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1540 
Symbol 
ID5713197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1601019 
End bp1602053 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content73% 
IMG OID641267455 
Productprotease 
Protein accessionYP_001532883 
Protein GI159044089 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.107774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.168358 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTCT GGCTCCCCCT CGCGCTCTGC CTGCTGGCCC TGGTCGCACC CGCCCGGGCA 
TCCGAACTGA CCGCCCCCGA AGAGCGCCTG ATTTCCCTGT TCGAGACGTC GCGCGCCGCC
GTGGTGTCGA TCACCACCGG CCAGCGCCGG GTCGATCCCT GGATGCGCCG GGCCGAAATC
GTGCCCAGCG GCTCCGGCTC GGGGTTCGTG TGGGACCGCG ACGGCCATGT GGTCACCAAC
GCCCATGTCA TCCGCGGCGC GGCCCGGGCG GATGTGCACA TGGCCGACGG GCGCGTGCTG
CCCGCCCGGC TGGTGGGCAC GGCCCCGCAA TACGACCTCG CGGTGCTGCG CGTCGATCTC
GGCACGCGCC GTCCCGACCC GCTCCCCCTG GGGCGCAGCG ACGCGCTCCG CGTGGGTCAA
AGCGTGCTGG CCATCGGCAA TCCGTTCGGG CTGGACTGGA CGCTGACCAC GGGCATCGTC
TCGGCGCTGG AGCGCGAGAT CCCGCTGGGC ACCGGCACGA TCGAGGGGCT TATCCAGACC
GACGCGGCGA TCAATCCGGG CAATTCCGGC GGCCCGCTTC TGGACAGCTC CGGGCGGCTG
ATCGGCGTGA ACACCGCGAT CTTCAGCCCC TCGGGCTCCA GTGCCGGAAT CGGCTTTGCC
GTGCCCGTGG ACCGGGTCGC CCGCGTGGTG CCGCAACTCA TCGCCCGGGG CATGTATCGC
CCGCCGGTCC TCGGCATCCG TTTCGATCCG CGCATCGACG CGCTGGCCCG GCAGAACGGC
GTCGAAGGCG CCGTGATCCT CGCGATAGAA CCGGGCGGCC CCGCCGCCGC CGCAGGTCTG
CGCCCGGCCC GGCGGGATGG GGCGGGCTTT CTCGTGCCCG GCGACGTGAT CCAGCGCCTG
GCGGGCCGCC CCATCGCCAG CGGCAGCGAC CTGCGCAGCG TGCTCGACGA TTTCGACCCG
GGCACCGAGG TGACCCTCGA GGTCTGGCGC GACGGCACCC GGCGCGAGGT CCGCGTCACC
CTGGCCGCGC CCTGA
 
Protein sequence
MRLWLPLALC LLALVAPARA SELTAPEERL ISLFETSRAA VVSITTGQRR VDPWMRRAEI 
VPSGSGSGFV WDRDGHVVTN AHVIRGAARA DVHMADGRVL PARLVGTAPQ YDLAVLRVDL
GTRRPDPLPL GRSDALRVGQ SVLAIGNPFG LDWTLTTGIV SALEREIPLG TGTIEGLIQT
DAAINPGNSG GPLLDSSGRL IGVNTAIFSP SGSSAGIGFA VPVDRVARVV PQLIARGMYR
PPVLGIRFDP RIDALARQNG VEGAVILAIE PGGPAAAAGL RPARRDGAGF LVPGDVIQRL
AGRPIASGSD LRSVLDDFDP GTEVTLEVWR DGTRREVRVT LAAP