Gene Sde_3731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3731 
Symbol 
ID3966766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4730041 
End bp4731471 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content50% 
IMG OID637922828 
Productprotease DO 
Protein accessionYP_529198 
Protein GI90023371 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family
[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.233243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAAC TCACACAACA TACTCGTACT TGTCGTATAC GCAAATGGGG GCGAGCGGCT 
TATTTGGCGG CTAGGTTGAG TGTGTTGCTG AGCGGATTAG TGGGTGCATT ACTGGGCGCA
AGTGTACAAC TGCACGCCGC ACTTCCAGCC AGTGTAAATG GTGAACCCTT GCCATCGCTG
GCCCCCATGC TTAAGCAAGT GAACCCTGCT GTAGTAAATA TTGCTACCTA CTCTACTTTT
CGACAGGCCT ATAACCCGCT GCTAAACGAC CCCTTCTTTC GCCATTTTTT TAACGTACCA
GATTCCTATC GCCAGCAACC GCAAACCCAA AAGCGGCAAC AAAGTGCGGG GTCTGGGGTA
ATAGTGGATG CAAAAAAGGG TGTGGTGCTA ACCAATTACC ACGTAGTGAA AGATGCCGAC
GAAGTGCAGG TTTCGCTCAT CGATGGCCGC GCCCTTATCG CTGAAGTAGT GGGCAGCGAC
CCCGAATTGG ATATAGCCGT GCTTAGGGTA AAAGCCGATG ACCTAACCGA CGTTAAAATG
GTGAATTCAA GCTTGCTAGA GGTGGGCGAC TTCGTAGTCG CCATTGGCAA CCCGTTCGGG
CTTGGGCAAA CCGTTACTAC GGGCATTGTA AGTGCGCTGG GGCGAACAGG CTTAGGTATA
GAAGGCTACG AAAATTTTAT TCAAACCGAC GCCTCTATTA ACCCTGGCAA CTCCGGCGGT
GCGCTGGTGA ATTTACGCGG TGAATTGGTG GGTATTAACA CCGCTATTAT CGCCCCTGCT
GGCGGCAATG TGGGTATTGG TTTTGCCATA CCTATTAACA TGGCTAAAGC GAGCATGGAG
CAAATACTTA AACACGGTAA GGTGCAGCGG GGCCATGTGG CGATAAGCGT GCAAGATATA
ACCCCAGACT TACGCGAAGC ATTTGCCCTT AAAAATGGCC AGCACGGGGT GGTAGTTACC
GGGGTTGGCG AAGGTTCCGA TGCGCAAAAG GCGGGCTTAC AAGCGGGCGA TATTATAGTA
ACCGTAGATG GTGAAAACAT TAATTCACGC GGCCAGTTAA GCAGCCATTT AGCGGTTAAG
GCGGTGGGCG CAAAGGTAAA AATAGGGGTT ATTCGCAAAG GTAAGCGCTT AGACATTAAC
GTGCCCATTA GCGACCCTCA CGCGGCGTTA ACCAGCGGTC AACTACATCC GCTTTTGGAG
GGGGCACGTT TTGAAAATAA CCCAGATGGA GAAGGTGTAA TCGTTGCTGC GCTTTCGCCA
AAATCTTATG CCGCGTACAG CGGCTTGCGT CCAGGTGATG TAGTGCTTGG CGCTAATGAT
TATCAAGTTG TTAACTTAGA GTCTTTTCAG CGCGCGTTAA AACGTAACAA AAAACAAGTA
TTACTGTTAG TTGCACGCGG CAACCGTGCT TTACATATTG TTATTCGGTA G
 
Protein sequence
MSQLTQHTRT CRIRKWGRAA YLAARLSVLL SGLVGALLGA SVQLHAALPA SVNGEPLPSL 
APMLKQVNPA VVNIATYSTF RQAYNPLLND PFFRHFFNVP DSYRQQPQTQ KRQQSAGSGV
IVDAKKGVVL TNYHVVKDAD EVQVSLIDGR ALIAEVVGSD PELDIAVLRV KADDLTDVKM
VNSSLLEVGD FVVAIGNPFG LGQTVTTGIV SALGRTGLGI EGYENFIQTD ASINPGNSGG
ALVNLRGELV GINTAIIAPA GGNVGIGFAI PINMAKASME QILKHGKVQR GHVAISVQDI
TPDLREAFAL KNGQHGVVVT GVGEGSDAQK AGLQAGDIIV TVDGENINSR GQLSSHLAVK
AVGAKVKIGV IRKGKRLDIN VPISDPHAAL TSGQLHPLLE GARFENNPDG EGVIVAALSP
KSYAAYSGLR PGDVVLGAND YQVVNLESFQ RALKRNKKQV LLLVARGNRA LHIVIR