Gene Dshi_3470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3470 
Symbol 
ID5712528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3652890 
End bp3653921 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content66% 
IMG OID641269399 
Productpeptidase M48 Ste24p 
Protein accessionYP_001534804 
Protein GI159046010 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCATC CGGAACAGGT CATCCTCGGT ACCGCGGTCA GCGGTGGCAC CTCTCTCCAG 
GTCGCGGCGC GTCTTCTGGT GCGCGGGGAT ATGGCCAAGT TAATCGCGGT GGAGACCGGT
GGAACCATGG CCGAGGCCCG GCTGGACGCG GTGAGGTTCG ATCCGCCGCT GGGGTCGCTG
CCACGCAAGC TGCGCTTCCC GGACGGTGCC GAGTTCGAAA CGGGGGACCG CGAGGCGATT
GCCGCGTTGG AGCCGCGCGG CTTCTGGACC CGGCTGCACG GATGGGAACG GCTGCATCCG
CGCCTGATCC TGTTCGTGGT TGGCGGATTC GCGGGCGGCT GGCTGGTCTA TTCCGTGGCC
CTCACCGCAC TGGTTGCCAT GGCTGTCGCC CTGACACCGG AGCCCCTCGT GCGGGCGATG
GATCGCAGTA CCCTCTCCGC CCTCGACCGC GTCATTGCCT CCGAAACAGC GCTGAGTACA
GCAGATCAAG CCGAGGCTCG CGCGATTTTC GAGGACCTGC GCGCGGTTCT GCCGGACCGC
GACCTCGCGG AAGCCGTGAG CCTGGAGTTT CGGGCGCTCC GGGGTTTGGG ACCGAATGCG
CTGGCCCTGC CCGGGGGCAC CGTGGTGTTG TCGGATGCCT TGGTTAAGCA GTTCGATGCT
GATGTCGTCG CCTCGGTGCT CGGCCATGAG ATCGCCCATG TGATGGAGGA ACACTCCCTC
AAGCGGCTCT ATCGGTCGCT GGGCATCTAC GTGATGGTCG CCCTGATCGC CGGGGAAACC
GGGCCTTTGC TCGAGGATCT TCTGCTGGAG GGGAATGTGC TGCTGTCGCT GTCCTACTCC
CGCGGGCAGG AGGCGGAGGC GGATCAGATC GGCCTGCGGC TCGCCGACGC CGCAGGGTAT
GATCCGACCG GGTTGAAGGT ATTTTTTGAA ACGCTCGCGG CCGAGGTCGG AGACGGCGGT
GGCTGGCTGT CCACCCATCC GGGCAATGAC GACCGCATCG AGGCAATCGA TGCCTATCTG
GAGGCGCGCT AG
 
Protein sequence
MAHPEQVILG TAVSGGTSLQ VAARLLVRGD MAKLIAVETG GTMAEARLDA VRFDPPLGSL 
PRKLRFPDGA EFETGDREAI AALEPRGFWT RLHGWERLHP RLILFVVGGF AGGWLVYSVA
LTALVAMAVA LTPEPLVRAM DRSTLSALDR VIASETALST ADQAEARAIF EDLRAVLPDR
DLAEAVSLEF RALRGLGPNA LALPGGTVVL SDALVKQFDA DVVASVLGHE IAHVMEEHSL
KRLYRSLGIY VMVALIAGET GPLLEDLLLE GNVLLSLSYS RGQEAEADQI GLRLADAAGY
DPTGLKVFFE TLAAEVGDGG GWLSTHPGND DRIEAIDAYL EAR