Gene Sde_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2024 
Symbol 
ID3967287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2548465 
End bp2550096 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content47% 
IMG OID637921112 
Productlow molecular weight phosphotyrosine protein phosphatase 
Protein accessionYP_527496 
Protein GI90021669 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1388] FOG: LysM repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000444434 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATTA CGTTATTTTC TCGCACTTTT GCGCGAACCA CTCCGCTAGT CGCACTATTA 
GGTGTATTAA TTGGCTTTCA AGGATGCGCT TATACTAAAA ACGAAGCCGA ATCTGAGCTT
GTTATCGTAT CAACTCCAGA AGAGCAGCTA GAAGCTGATA TTGCTGAGCA CTCGCTCGAT
GCTGAACCAT TAGACGTCCC CGAAGCCGAC GAGCCAATTG TAGAGTCAGA ATTTACCCTT
TGGGATCGAA TCCGAGCTGG CTACCAACTA GAAATCCCCC AAGACAAAAG AATCGACCAA
GAACGTAATT GGTATGGCAA ACACCAAAGC TACCTAGATA GAGTCACCGA TCGCGGTGAG
CGTTACCTCT ACTATATTGT TGGCGAATTA GAACGCCGCA ACATGCCTAC GGAATTTGCT
TTGCTTCCTA TAGTAGAAAG CGCATTTGAC CCCTTTGCCT ATTCCCATGT AACGGCTTCT
GGCATGTGGC AGTTTATGCC CCGCACAGGG CGAAGCTTGG GCTTAAAGCA AAACTGGTGG
TATGACGGTC GCAGAGACGT GGTTTTATCT ACCAATGCAG CGCTCACCTA CCTAGAAAAA
CTACATAAGC ATTTTGATGG CGACTGGCTG CTGGCAATGG CAGCGTATAA TAGCGGTATT
GGTAATGTTT CTCGGGCAAT TAAACGCAAT GAAAAAGCCG GTAAACCCAC CGATTTTTGG
AACCTAAAAC TCCCCAGAGA AACTCAAGCA TACGTACCTC GGCTACTGGC TATTAGCCAG
CTAATTGGCG CACCAGAAGA TTACGGCTTA ACACTGCGCC CAATTCCCAA CCAACCGTAT
TTCGAAGCCG TTGAAGTGGG GTCGCAAATA GATTTGGCTC AAGCGGCAGA GCTTGCCGAA
ATAGATATGG ATGAGCTATA TCAACTTAAT CCTGCCTTTA ATCGCTGGGC GACAGACCCT
ACCGGCCCGC ACACTTTATT GGTGCCTTAC GCCTCAGCGA ACTCATTCAA AGAAAAGCTA
GCCGGCATTC CGCCTGGCAA ACGAATCACA TGGGATAGAT ACACTATCGC GTCCGGTGAT
TCCCTCTCTA CTATCGCTGC AAAATACAAA GTAAGCGTAG ATTCACTAAA AAGCATAAAT
GGTTTGCGAA ATAACAACAT TCGCGCGGGA AAGACTCTAT TGGTGCCTAT TGCTGCTAAA
AAAGACGAGC ACTACACACA AAGCATTGTG CAACGCATCC AAAACAGACA ATCCTCAGGC
GGTAAAAGCG GCAACTCAAC CAAAGTTGAG CATATAGTAC AAAGCGGCGA CAGCTTCTGG
TCTATTGGTA AAAAATACGG TGTTACCCCA AGCAAAGTTG CACATTGGAA CAACCTTGCC
CCAGCGGACC TTATCAAACC GGGCCAAACT CTCGTTATTT GGACCAAGGC CGAAGCTAGC
ACTGCGTCCA ACAACGCTGT TGTTAGAAAG CTCTCGTACA AGGTAAGGCG CGGAGATTCC
CTGCATCGTA TAGCTGACAA ATTCAAAATA AACGTGAGCG ACATATTGCA GTGGAATCAA
GTCGACACCA AGAGCTATTT ACAGCCAGGT GATACGCTAA CACTTTTTGT GGATGTGACT
AAAACTAACT AG
 
Protein sequence
MQITLFSRTF ARTTPLVALL GVLIGFQGCA YTKNEAESEL VIVSTPEEQL EADIAEHSLD 
AEPLDVPEAD EPIVESEFTL WDRIRAGYQL EIPQDKRIDQ ERNWYGKHQS YLDRVTDRGE
RYLYYIVGEL ERRNMPTEFA LLPIVESAFD PFAYSHVTAS GMWQFMPRTG RSLGLKQNWW
YDGRRDVVLS TNAALTYLEK LHKHFDGDWL LAMAAYNSGI GNVSRAIKRN EKAGKPTDFW
NLKLPRETQA YVPRLLAISQ LIGAPEDYGL TLRPIPNQPY FEAVEVGSQI DLAQAAELAE
IDMDELYQLN PAFNRWATDP TGPHTLLVPY ASANSFKEKL AGIPPGKRIT WDRYTIASGD
SLSTIAAKYK VSVDSLKSIN GLRNNNIRAG KTLLVPIAAK KDEHYTQSIV QRIQNRQSSG
GKSGNSTKVE HIVQSGDSFW SIGKKYGVTP SKVAHWNNLA PADLIKPGQT LVIWTKAEAS
TASNNAVVRK LSYKVRRGDS LHRIADKFKI NVSDILQWNQ VDTKSYLQPG DTLTLFVDVT
KTN