Gene Sde_3862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3862 
Symbol 
ID3967029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4866601 
End bp4868349 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content48% 
IMG OID637922959 
Producthypothetical protein 
Protein accessionYP_529329 
Protein GI90023502 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00590592 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.508788 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTAC TAAATAACAA AACCCTTCGT GCTATTTCTG TATTGTGTAG TTTAACGTTT 
GTGCCCTGTG CTATTGCCGA GGTGGCGCCG CTTTGGTTGC AGTTTGAGGC CGCAAAGGCG
AATGGCGATG AGCCAACCTT GCCGGACTTT TCTTATGCGG GTTACGACTA TTCCGAGAGC
GAGCTGCCAG ACATTTCTAG TTGGGCCGTA TTTAATGTGA CGGACTACGG GGCAATAGCA
AACGACGAAA ACTACGATGA CCAAGCCATA CAGCTGGCAA TAGATGCCGC ACAAAACGCT
GGTGGCGGTG TGGTTATGTT CCCCGCTGGC AGGTTTTTAG TTAGCCCTAA CGAAACGGTG
GGCGAGAATA TTTTTATTCG CGCTAGCAAC ATAGTGCTAA AAGGCGCGGG CGCTGGTGAT
AACGGAACAG AAATATTTAA AGTAAATAAA AAAGTAAATA ACGGCGAGTA TATTTTTGAG
GTTTCGCCAA CAAGCACTGG CGAATCGGTG ATTACAACTG TAGTAGCAAA TGCACAGCGC
GAAAGTTTCG AAATAGTAGT TGCCGATGCA TCACAACTGA GTGTGGGCCA GCGTATTTTA
TTGCGAGCAG ATAGCGTGGA ATTAGCGCAA TCGTATTATT CCCCGCTTAC CATTCGCAGC
GAATGGACGC GATTACTAAA CGACGGCTTT AACCTGCGAG AAATTCACTC TATTGCCGCA
ATTAACGGCA ACACCGTGCG CCTTAGAGAA CCACTACACA TAAGCCTAAC TATAGGTTCT
ACGCCCATTC AGGTGCGTAG TTACAACATG ATAAATAACA TAGGTATAGA AGATATTCGC
TTTAAAGGTA ATTGGGATTC CTACCCCGAA GACTTCGATC ACCATAAAGA CGATATTCAC
GATTACGCAT GGAACGCACT AAGGTTAGAT AACGTAGAAA ACGGCTGGAT ACAAAACGTA
GAATTTAAAG ATTGGAACCA AGGTATTTAT ATAGACGGCT CTGCAGCACT TACCCTGCGT
AATATTACCT TTACCGGTAA GAAGGGGCAT ATGTCTATTC ATACGCGGCG CTCTTACGGT
GTATTAATAA AAGATAGTGC CGACCACGCA GGCCATCACC ACGGGCCGGG TGTTGGCTAC
TGGGGTTGCG GCACTGTGTA CCTGCGTTAC CAAATGTTAG CTGGCCAAAA TATAGATAGC
CATTCGGGCA GCCCCTACGC AACGCTGTTT GATAATGTAA CTAACGGTCA CCTTTCTAAC
AACGGCGGCC CACACGAAAG TTACCCGCAC CACGGTAAAC ATTTAGTCGC ATGGAATATG
ACTTTAGAAG GCGGGCCAGA TAGCTATAAT TTTTGGTCGG CGTCTCGCAA CGGCCACACC
TTCGCCATGC CGTATTTTAT TGGCTTGCAG GGTAAAAGCG TTACCTTTAC CGAAGGCACC
TATAGCGCTA ATGAATTGCT AGGGCAAATG GCCGAGCCTG CTTCTTTATT TGAAGCGCAA
CTTGCCTTGC GCCTTGGTAG CACTGCGCCC GAGCAACCAG AAGAAACAGA AGAACCAGAA
CAAGAGCCAG AGGGTGAAAC CGGTGGCGAG CAACAACCAA CAGAAGAAAC CCCTACACCA
CAACCCAATC AGCCAAACGC CAGCAACACA TCGGGTGGTG GAGCCATAGG TGGGCTATTG
CTCACATTGT TGATGGTGCT GGCAGCTACT ACCAAAATTA ATTACATAAC GCGCCGCACA
GGTAATTAA
 
Protein sequence
MRLLNNKTLR AISVLCSLTF VPCAIAEVAP LWLQFEAAKA NGDEPTLPDF SYAGYDYSES 
ELPDISSWAV FNVTDYGAIA NDENYDDQAI QLAIDAAQNA GGGVVMFPAG RFLVSPNETV
GENIFIRASN IVLKGAGAGD NGTEIFKVNK KVNNGEYIFE VSPTSTGESV ITTVVANAQR
ESFEIVVADA SQLSVGQRIL LRADSVELAQ SYYSPLTIRS EWTRLLNDGF NLREIHSIAA
INGNTVRLRE PLHISLTIGS TPIQVRSYNM INNIGIEDIR FKGNWDSYPE DFDHHKDDIH
DYAWNALRLD NVENGWIQNV EFKDWNQGIY IDGSAALTLR NITFTGKKGH MSIHTRRSYG
VLIKDSADHA GHHHGPGVGY WGCGTVYLRY QMLAGQNIDS HSGSPYATLF DNVTNGHLSN
NGGPHESYPH HGKHLVAWNM TLEGGPDSYN FWSASRNGHT FAMPYFIGLQ GKSVTFTEGT
YSANELLGQM AEPASLFEAQ LALRLGSTAP EQPEETEEPE QEPEGETGGE QQPTEETPTP
QPNQPNASNT SGGGAIGGLL LTLLMVLAAT TKINYITRRT GN