Gene Sde_2989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2989 
Symbol 
ID3967750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3798591 
End bp3800015 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content42% 
IMG OID637922086 
ProductRicin B lectin 
Protein accessionYP_528458 
Protein GI90022631 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3264] Small-conductance mechanosensitive channel 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000129425 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACTTA CACACATCCT TGTCATTCTT ACAATTTGCA TACCCTTTTC AAGCACCCTT 
GCGCAAAATG ACTCCCCCAA AGAGCATAAG CCTCCTGCGC ATACAGAAAA AAGCATTCAC
GAAGCAAGTA TAGAAAATGT CTCTCGCATC ATTGAGGTGG AAGACGGACC TAGTGACGAA
AAACTCAAAA AACGATTGCA ATCAATTCTT CAAAGCTCAG AAAAATACCA AAATTTAGAG
GTTAGGGTCG CCAATGGCTT GGTTACTATT TACGCGATAG CAGATAACGA AAAAGACGTA
GATTGGGCTG GTGACATAGC GAAGAATATT GAAGGCGTAG TAGCGGTTAT TAATAACATT
TCAACCCCTA AACAAGATTA TTTCACGTTA GCCCCTGTGC GAGCTGAGCT ACTAGCAGTT
TGGATAAAAT GCCTAGAAGC AGTGCCTCTT GTGATAATTG GCGCCCTCAT ACTTTCTACA
TTTATTTTTA TTTCACGCTA TAGCTCTTAT ATCTTAGACA AGCCCATCAA CTACTTATCT
CAAAGCGAGT TAATTCGTAT TGTTTTAAAA CGTGTCATCT CCACGCTTAT TATCATAGTC
GGGTTTTACT TTTTTCTTAA AACGGCCGGC CTTACCCAAT TTGCACTAGC TATAATTAGC
GGCACGGGTG TAATTGGTTT GGTGCTAGGT TTTGCTTTTC GCGACATAGC CGAAAACTTT
ATATCAAGCT TGCTGCTTAG CGTGCAGCGT CCATTTAGGC TTGGCGATGT GGTAGAGGTA
AGTGGCCACA AAGGTATTGT AAGAAAAGTA ACCGCCAGAG GAACAACATT AGTAGATTTT
GATGGCAACC ACATACAAAT ACCTAATGCA ATTGTATATA AAAATATTAT TCAGAACTTC
ACTGCCAACC CTAATCAGCG CGGCAAATTT ATTATTGGCA TTGGCTACGA CGCGAGCGTA
CAAGGCGCGC AAACAATCGC TGTGGGGGTA GTGCAAAATC ATTTCGCCGT TTTGCAGGAC
CCAGAACCTC AAGTACTTAT AGATCAACTA GGCTCGTCAA CTATTAACCT ACAAATTTTC
TTTTGGGTAA ACGGCCATGA ATATAGTCTA CCCAAGGTTT CATCCATGCT CATGCGCCAA
GTTATGCGAG AATTCGAACG CAACGGAATA TCAATGCCCG ACGACGCACG AGAAATAATA
TTCCCTGAAG GTGTGCCTGT TTTTATGCAA GGTGAAAAAA CGTCCCTCAC AAATCAAGCC
TCGCCCCCTC TGTCATCCTC CCATCAGATA CCACGTAACG CGCCCATAGA AAAAGACATA
TCACCCAATA ACGAACAAGA AGACCTTAGT AGCGACAACC TAGATATTCA ACGCCAAGCA
GATATGGCAA GAGACCCAGA AGAAGGGGCG AGTATAATCA AATGA
 
Protein sequence
MKLTHILVIL TICIPFSSTL AQNDSPKEHK PPAHTEKSIH EASIENVSRI IEVEDGPSDE 
KLKKRLQSIL QSSEKYQNLE VRVANGLVTI YAIADNEKDV DWAGDIAKNI EGVVAVINNI
STPKQDYFTL APVRAELLAV WIKCLEAVPL VIIGALILST FIFISRYSSY ILDKPINYLS
QSELIRIVLK RVISTLIIIV GFYFFLKTAG LTQFALAIIS GTGVIGLVLG FAFRDIAENF
ISSLLLSVQR PFRLGDVVEV SGHKGIVRKV TARGTTLVDF DGNHIQIPNA IVYKNIIQNF
TANPNQRGKF IIGIGYDASV QGAQTIAVGV VQNHFAVLQD PEPQVLIDQL GSSTINLQIF
FWVNGHEYSL PKVSSMLMRQ VMREFERNGI SMPDDAREII FPEGVPVFMQ GEKTSLTNQA
SPPLSSSHQI PRNAPIEKDI SPNNEQEDLS SDNLDIQRQA DMARDPEEGA SIIK