Gene Sde_0687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0687 
Symbol 
ID3964938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp875475 
End bp877067 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content49% 
IMG OID637919748 
Productpeptidase M22, glycoprotease 
Protein accessionYP_526161 
Protein GI90020334 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID[TIGR02814] PfaD family protein 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000276719 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAATGAGA CTAATACTGG GCTAGGTAAT GATTGTTGCC GGCCAATAGC ATCCGGGCCA 
GCGCTGCTCG AACTCATACG CAGTGTGCGC GCGCCATTAG CTGTTTATAA AGATAAGCAT
GGCAGCTTAT CTGTGTATCC CGCCACAAAA GACGTTACCG TTACCAACGA TTACTCCCCT
GTAGCAACGC TGCCCCCCTT ATACCCAGAA TGGTTGGGCG ATAGAGGCTT TTGCGCAGTT
CACGGTGTGC GCTTTGCCTA TATTGCTGGT GCCATGGCGC GAGGTATTGC CTCTATAGAC
ATGGTGGTAG CCATGGGCAA AGCCGGCATG ATGGGGTTCT TGGGCTCTGC AGGTCTAAGC
GTAGAGCAAA TTCAACAGTC AATCTGCGCC ATTCAAGCGA GTCTAAATAA CGGTGAAGCC
TGGGGCTGCA ACCTTATTCA TACACCCGAT GCACCAGATT TAGAGATGAG CATAGTGCAG
CTTTTTCTGC AGCTGGGTGT AGCTCGTATT TCAGCTTCTG CATTTATGAG CATGACTCCT
GCATTGGCGG CCTATGCTTA TAAAGGTATC TATCGCGATC GCGATGGCGT TATTTGTCGC
GCTAATTACG TTTTTGCCAA AATATCTCGG CCAGAAACCG CAGCCGCTTT TATGAAACCC
GCACCGCAGG CCATGCTCAA GCAATTACTT GAGCGGGGTT TACTAACTGA AGCCGAAGTG
CAATTGGCGC AAAACTTGCC AGTTTCCGAA GATATTACCG TCGAAGCGGA CTCTGGTGGA
CACACCGATG GTCAAATTCT TACTGCGCTG TTTCCTACAA TTCTCGATTT AAAACATCAG
TTAAGTGAGC AGTACCAATA CCAACGCGAT ATTCGCATTG GCGCAGCGGG AGGCTTAGGC
ACGCCCTCTG CTATTGCCGC GGCCTTTGCG TTGGGGGCTG CCTACGTACT TACTGGGTCG
GTAAATCAGG CATGTATAGA AGCAGGTACT TCTCAGGCTG TTAAGCAATT GCTAACGCAA
GTGCGCACAG GTGATGTGGC CATGGCGCCC TGCGCAGATA TGTTTGAAGC TGGGGTAAAG
GTGCAAGTCT TAAAGCGCAG AACCCTATAT GCACCAAGAT CTACCAAGCT TTACGAAATA
TATAAACGCT ACGACAGTCT AGAAGACATT CCTGCAGCAG AATTAGACCA ATTAGAGAAA
TCCATATTTC GTCAATCATT AACAAGCGTT TGGCAAACAA CCAAACGCTT TTTTGAAACC
AGAGATCCAG CGCAATTAGC GAAAGCAGAA GCGAACCCAA AAGTGAAAAT GGGATTAATT
TTCCGCTGGT ATTTAGGCAG CAGCTCGCGC TGGCCAATTG ATGGCGACGA AGATAGGTTG
GTGGATTACC AAATATGGTG TGGGCCAGCA CAAGCGGCCT TTAATGATTG GGTGCGGGGC
AGCTTTCTAG AGCCAGCAGA AAATAGAGCA GTAGTGCAGG TAGCTCGCAA TTTGTTAGAA
GGCGCGGCCA TAGTCACCCG CGCGCAACAG CTGCGTACCT TTGGCATAGC TGTACCGCAA
AATGCCTTTG CTGTGGTTCC GCAAAAGCTA TAG
 
Protein sequence
MNETNTGLGN DCCRPIASGP ALLELIRSVR APLAVYKDKH GSLSVYPATK DVTVTNDYSP 
VATLPPLYPE WLGDRGFCAV HGVRFAYIAG AMARGIASID MVVAMGKAGM MGFLGSAGLS
VEQIQQSICA IQASLNNGEA WGCNLIHTPD APDLEMSIVQ LFLQLGVARI SASAFMSMTP
ALAAYAYKGI YRDRDGVICR ANYVFAKISR PETAAAFMKP APQAMLKQLL ERGLLTEAEV
QLAQNLPVSE DITVEADSGG HTDGQILTAL FPTILDLKHQ LSEQYQYQRD IRIGAAGGLG
TPSAIAAAFA LGAAYVLTGS VNQACIEAGT SQAVKQLLTQ VRTGDVAMAP CADMFEAGVK
VQVLKRRTLY APRSTKLYEI YKRYDSLEDI PAAELDQLEK SIFRQSLTSV WQTTKRFFET
RDPAQLAKAE ANPKVKMGLI FRWYLGSSSR WPIDGDEDRL VDYQIWCGPA QAAFNDWVRG
SFLEPAENRA VVQVARNLLE GAAIVTRAQQ LRTFGIAVPQ NAFAVVPQKL