Gene Dshi_3421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3421 
Symbolapr 
ID5712479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3601513 
End bp3603003 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content67% 
IMG OID641269350 
Productsubtilisin DY 
Protein accessionYP_001534755 
Protein GI159045961 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCAT TCAAGGCGCT TCTGGCGCTG GTTCTTGTCG GGCTCTTATG GTCGTCGCCG 
TCACAGGTGG CGGCCCAAAC TGCGCCTGCA TTGCCTGAAT GCGTCAATCC CGGCGACCTG
ATCACGATAC CGCGGGTGCG CAACCCCCCG CCGATCCGGG CGACCTTGTT CCTCGATTTC
GGCGATTTCC GGGTGCGCGT GCTGATCCGG AACGTGACCG CGTCCAGTTT CAGTTTCCGC
ATGCCCCGCC TGCAGCGCAC TCCCCGAAAC GGAGAGTTCG AACTGGTCGT TCGGCTGCTG
CTGGGGGTGG AACGGACGCT GCGCACGGGC CGCATTTGTC AGGGCAGGCT CTTGGAAGAG
GCCTTGGACC GTATTCCGGA TGTTCCAATT CGCCCCGCCA CCGGCAACGA GGTGGCCGCT
CCCAGTGGGG GTCCGGAATA TGTCCTCGCG GGTACGGACC AGGAGATCGC CCGGGCGCGT
GTCGTGCTGC GTGGGGCGCG GGCGCAGATC TTGCGCTCCC AGCGGCTGGG CTCTCTTGGT
CAGAGCCTTC TCTTTGTGGA TCTCGCCGGG GCACTCACCG AAGCGCAGGC TCGCGCGCTG
CTTGCTCGCG AAGGCATACG TTCGGCCATC GGGACGCATA CGGTCTACAG TTTGTCTCAG
TCCAGTGGCG GGCGCGCCGG ACTACGCCTG TTCGCGACAG CCCTGGTGCG GCCCGATCCG
GGGCGCAACT GTACCCTGAC CCGCCCCGTG CGGGTCGGTC TGATTGACGG ACCCCTCGAT
CTGCGCACGC CGTCCCTGAC CAATGTACGG GTGACCAGCC TGTCGGTTCT CAGACCGCGG
GAGCGGCCCG GTTCTACCGC CCATGGCACG GGGATTGCCG CCCTGATCGC GGGCCAGGCC
ACCACGCAAG GTCCGGCGGG TCTCGCACCT GGGGCGGAGT TACTCTCGGT GGTGGCGTTT
GCACGGGCGG GGGGGCGCGA CCTCGCCCGG CTCGAAAATA TCGCACTCGG TCTCGATTGG
CTGGTCGAGC GGGGTGCAGA CGTGGTCAAC ATGTCGCTTG CTGGCCCCCC GAACGAGGCG
TTGGCTGCCC TGGTGGAGAT CGCCGATCAG CAGGGACTGA TCATGGTGGC CGCCGCCGGC
AACCGGGGCG AGCCGTCCCT GGGATATCCT GCCGCCGATC CGCGCGTTCT TGCGATTACG
GCGATCGATG CGGACAAACG GATCTACCGC CGGGCCAGTT TCGGGGCGGG TATGGATTTC
TCGGCGCCGG GCGTCGATAT CGCGGTGCCG GATCGGCGTG GCTGGTCCTA TCGCTCGGGC
ACGTCCTACG CCGCGGCAGT CGCCACCGGG CTTGTGGCGC AGAAGCTGGC GCAGCAGCGG
CTGACTACGG ATCAGTTGCG TGCAAGCTTT CGGCGCAGTG CCGAGGACCT CGGGCCTTCG
GGATATGACC CCCGATTTGG TTGGGGTCTC ATGCGCGGCG ACCCCTGCTA A
 
Protein sequence
MKPFKALLAL VLVGLLWSSP SQVAAQTAPA LPECVNPGDL ITIPRVRNPP PIRATLFLDF 
GDFRVRVLIR NVTASSFSFR MPRLQRTPRN GEFELVVRLL LGVERTLRTG RICQGRLLEE
ALDRIPDVPI RPATGNEVAA PSGGPEYVLA GTDQEIARAR VVLRGARAQI LRSQRLGSLG
QSLLFVDLAG ALTEAQARAL LAREGIRSAI GTHTVYSLSQ SSGGRAGLRL FATALVRPDP
GRNCTLTRPV RVGLIDGPLD LRTPSLTNVR VTSLSVLRPR ERPGSTAHGT GIAALIAGQA
TTQGPAGLAP GAELLSVVAF ARAGGRDLAR LENIALGLDW LVERGADVVN MSLAGPPNEA
LAALVEIADQ QGLIMVAAAG NRGEPSLGYP AADPRVLAIT AIDADKRIYR RASFGAGMDF
SAPGVDIAVP DRRGWSYRSG TSYAAAVATG LVAQKLAQQR LTTDQLRASF RRSAEDLGPS
GYDPRFGWGL MRGDPC