Gene Sde_0953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0953 
Symbol 
ID3967668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1237117 
End bp1238508 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content50% 
IMG OID637920020 
Productglycoside hydrolase family protein 
Protein accessionYP_526427 
Protein GI90020600 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.161663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000205432 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTATAAAA TTTCACGCCG CACAACACTC AAAGGCTTAG GCCTAACTTG CCTAGCCGGC 
TGCACCACCA GCCTACCCAC ACTAGAGCAA GACCCATGGG CTTTTGCACA AAACATAGCG
GACAACACCA CCATCCCCAC ATTCCCAAAC AAAGAATTTA ATTTACTCGA ATTCGGCGGC
AAAGAAGGGA GCGACAACAC CCTCGCCTTC AAAAAAGCGA TTGCAGCATG CAGCAAAGCA
GGTGGCGGCA AGGTGGTAGT ACCCGCAGGA CGATTTGAGA CAGGCGCCAT CCACTTAGAG
TCGAACGTTA ACCTTCATAT TAGCGAAGGC GCTACCATCG CCTTTTTTAC CGACCCCAAA
TATTACCTGC CTGCGGTTTT CACTCGCTGG GAAGGCATGG AGTGCATGGG CTACTCACCC
CTTATATACG CCTACGGCAA AACCAACATA GCCATTACCG GTAAAGGCAC CCTCGACGGT
CAAGCCGACC CAACGCACTG GTGGGCATGG AAAGGCAACA AAGAATGGGG CGTAGAGGGC
TACCCAAGCC AAAAGGAAAG CCGCAACCAA CTATTTGCCC AAGCAGAAGC TGGCGACCCC
GTTAGAGAGC GCGTGTATGC AGACGGCCAC TACCTGCGCC CCTCGTTTGT GCAACCCTAC
AAGTGCGAAA ACGTGCTGAT AGAAGACATA ACTATTATCA ACGCTCCCTT CTGGTTGCTA
CACCCCACCC TTTCACAAAA CGTCACTGTA CGCGGTGTTC ACCTAGAAAG CCTAGGCCCC
AACTCGGATG GCTGCGATCC TGAAAGCTGT AAGAATGTAG TTATCGAAAA CTGCTTTTTT
AATACCGGTG ACGACTGTAT CGCTATTAAA TCTGGCCGCA ACAACGATGG CCGCAGGCTT
GCCACACCTA CCGAGAACGT GATTATTCGC AACTGTAAAA TGGAAGCGGG TCACGGTGGC
GTAGTTATAG GCTCAGAAAT TTCTGGCGGC GTGCGCAATG TGTTTGCCGA AAATAACGTA
ATGAGCAGCC CCGATTTAGA GAAAGGCATT CGCATTAAAA CCAACTCTGT GCGCGGCGGA
CTGCTAGAGA ACATCTATGT GCGCAACTGC ACCATAGGCG AAGTACAACA AGCCATTGTT
ATTAACTTCC AATACGAAGA AGGCGATGCG GGTAAATTTG ACCCCACCGT GCGCAATGTA
GAAATACGCA ATTTGGTCTG CCAGCACGCC TTACAAGTGT TTAACATCCG CGGTTTTGAG
CGCGCCCCCA TTCAAAACTT TAGGATAATC GACAGCACCT TTGTGCGTGG TGACAACCCA
GGCGTAATTG AACATACCAC AGGGTTAGTT ATCGACAACG TCCAAGTCAA CGGCAAAGCG
TTTAACATCT AG
 
Protein sequence
MYKISRRTTL KGLGLTCLAG CTTSLPTLEQ DPWAFAQNIA DNTTIPTFPN KEFNLLEFGG 
KEGSDNTLAF KKAIAACSKA GGGKVVVPAG RFETGAIHLE SNVNLHISEG ATIAFFTDPK
YYLPAVFTRW EGMECMGYSP LIYAYGKTNI AITGKGTLDG QADPTHWWAW KGNKEWGVEG
YPSQKESRNQ LFAQAEAGDP VRERVYADGH YLRPSFVQPY KCENVLIEDI TIINAPFWLL
HPTLSQNVTV RGVHLESLGP NSDGCDPESC KNVVIENCFF NTGDDCIAIK SGRNNDGRRL
ATPTENVIIR NCKMEAGHGG VVIGSEISGG VRNVFAENNV MSSPDLEKGI RIKTNSVRGG
LLENIYVRNC TIGEVQQAIV INFQYEEGDA GKFDPTVRNV EIRNLVCQHA LQVFNIRGFE
RAPIQNFRII DSTFVRGDNP GVIEHTTGLV IDNVQVNGKA FNI