Gene Sde_2894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2894 
Symbol 
ID3968062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3668186 
End bp3671275 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content45% 
IMG OID637921991 
Productcellulose binding, type IV 
Protein accessionYP_528363 
Protein GI90022536 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0297101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTTA CCTGCCGATA CATTTATCGA CTAACTGCCG CACTGCTGCT TTGCAGTAGC 
TGTGCACTTG CCGCCAACAC TCCCCAAACG CAAAACGCCA CACCCGAAGC GCCCGGAAGC
CAAATAAGCA CAGCAGAGCA AGTTGCGACA ATTGAAGAAT TTTACGCAAA GCGACTTGTT
GGCCTTGAGC CCATTCGCGC TATTTTCCCT GCTGCATGGC TAACCGAAAC ACCTGCTTGG
AATGAAGAAA AAAATAAACT GCAACAGCAT CTAGCCGCAT TATCACAAGC AGACACCACA
AATACCTCCC CACACGAGCT GGAACTGACG CAGCTAAAAT TGATGTGGGT GCTATTTACA
CCCGCAGAGC GCAGCGATTT GATCAATTCA GCCAAAACCC AGCTGCTTGC TCAAACCAAC
AAACAAAATG CGCAAAAAAT ACTTGAACAA ACCCAAGCGG CAGCCAGCGA GCACGAAAAG
CAACTGCAGC AGGTTAAACA CGAACTGGCC CAAAGTGAGG ACGACAGAGA AAAGCGCTTG
CTGGCTATAC AATTAGGTAT CGAAAAGCGC ATTGAGTTAA TAACGGCCAA AACCAAACTG
ATCGCCGATG CGCAAATAGC ATTTAGCCAA GATGTAAGCG ATTGGAAAGA AATTCAATCT
CTCATCAGCA ATGCGCTCGA TGACGAGCAC TTCATCCCCG GCACCCGAGA TTTCACCCTG
TATCGCAAAA CCCTGCGCGA CCGACTTGCG GCCTCTAACC GCGCCGTAGC CAACAAACAT
GTATTTGTTA CTACGTACTT TCGCGACGAC ATGAACCTGC CAGAAGAGGA GCAAAGCTTT
AATGTGCAGG TAAGAGAAAC AGACAGTGAT AAAAACGTCG AGCGAGTCAA TACTGTACTT
AGAAAGCGTC TTGAACTCAG TCAAAGTGAA GAGAATTTTA TCGCTAAAAA GCAAACGCTA
ATCGTAGACC TCGCGGTTTG GCACAAACAA TATCGCACCG ATCTATTACT TCTGCGTGGC
AAACTAATCA AGCAAGTTAT CGACAATAAC GCTTTTTCGC TTAACAACTT CGACCCAATT
GCAGTCGAGT TACGCACGCT TTACGCCTCC ACGCTTTTCT CTAAATGGAG CAAAGCTTAC
GTTAAAACCT ATGGCACAAG TAAAAAAACA TCGTTAACGT TTTCTAATAT TACCCATCTG
CTAAAAGTAC TGCTTACCAC GTGTTTTATT GTGTGGTTAT TTCTAAAACG GCAGCTGGTG
CTGGATAGCG CCAAACGCTG GTGCCTTACC CGCACTAGTA ACGCGCGTTG GCGTAAAGCC
ACCGCACTTT TGTTTGGTGC ACTACAAGAA TTGTATATAT TTATAATTTT GTTTTTCTTT
GGGGGTAGCG TAATAAAAAT ACTGGTAAGC GTAGGCATTA GCTCTGCCGA AATGTTCAAA
CCGGTACTAA ATAAAATTGT ATTGTTCTTT CTACTACTTG GGCTTATTCA ATATGTACAA
CCTTTTTTAA GCCAGCGCGA ACAGCGCAAG GGTAGAGATA CACATGAAAT AGCCGCCATT
GAAGAGGTTT TTTCTTTAGT GCCTCAAGTT TACCTTTACT ACTGGTTGGC TGCGGGCGTT
GTGGCAAGCT TAATATCTCA ACACTTAAAT GAAAGCTTAC TGGGCTTTCA CGCAGTCAAT
GTTATCACCT TCGCATTTGC GATAGCCTTG CTTGTTATTA TATGGGTGCG CCGTCACACT
TGGCGAACCA TAAACGAAAA AGCCTATAAT TCGGAGTTGT GGCAGAAAAT TAGCTTCAAT
GCACGTCGTA AACCCTGGGA ACCTCTAATT CTGCTTATTG GCGGCGGCAT GGGAGTTTAC
CGTGTTGTTT GGTCCGTTTT GTTAGACAGG CTAACAGAGC TAGAGCTCAC CCGCAGCTTT
CAGGCTATGG TAAGCCGTGC GATATTAGAA AGGCAGTACC GCAAAACCAC CACAAAACTC
TATGCAGAAC GCTTCCCCGA TAAATATTGG CGAAACTTCC ATTTTCAAAC CCCAGCAGAA
GCCCACTGGT ACGTTGAGCG CTCGGAACAC CAAGAGGTTA TTCAGGCAGC CTATGAGAAC
TGGCAGAGTA AAGGCAAAGC AACACGTTTA CTGATTTGTG GCGACCGAGG TATTGGTAAA
TCTGAGTTAA TCTCCAACTT CTTACGTAAT GAAGAAATAA AGTGCTTGCA CACACATATA
GACACAGGTG AAACATCCGT AGCTGCGGTA TGCCAGCGAT TATCGCTGTC GTTTTTACAA
TCGCGTATGG ATACACCCGA AGATATTATC AATGCGCTAA AACTGATGGA ACCACAGGTA
TTGTGCGTAG AAAATATTGA AAACACTATT TTGCGCAAAG TGGGTGGCTT TGCCGCCTAT
ACGGCCGTAA TTGATATTAT TTTACAAACA TCTGACAAAC ACTTTTGGCT GGTAACCTGC
ACAAGCTACG CATGGACAAT AGTACAGCAC GGCGTAATTG GTGCAGACTG CTTTACCGAA
AACATAATGG TAGAAGGCTT AAGTGAAGAA GCATTAAAAA CTGCCATGCT AGCAAGGCAT
AACGATGCAC ACCCTGCTAC CCCGGACTTT AGCCAGCTAA ATTTTAGCAA TCCCAAACAG
GGGCGGCTTC GCGATAAACT GCAACAGCAA GATGCAAGTG AAAAAGGCGA AGAGCTGTAT
TTTCGAATAC TGTGGGACTA CACCAAAGGT AACCCAAGGC AAGCACTGTA TTACTGGAAG
GCCTCTCTCG CATGGGACGG ATTAAAATGC ACGGTACGAT TATTTGAAAT ACCCGAACAC
AGGGTGCTCG AAACCCTACA AGATAGAGCA CTTATGCTAC TTGCCGGTTT AATAGAACAT
AACGGCTTAA CCTTAAGAGG CATGCAAGAA ATAATGAATT GCCCAGAAGC CACGGTGCGC
AGACACATTG AAGAGCTAAC ACCTTACGGT ATTGTATTTA GCATAGAAAA CGACGACTCC
TGTGGCTGGC ACGTGGAAAG CTTTTGGACC CGTGCAGTAG AAAATTATCT AGAAAAAAGA
CAATTGCTTT TTAAAGGAGC AGCACTATGA
 
Protein sequence
MTVTCRYIYR LTAALLLCSS CALAANTPQT QNATPEAPGS QISTAEQVAT IEEFYAKRLV 
GLEPIRAIFP AAWLTETPAW NEEKNKLQQH LAALSQADTT NTSPHELELT QLKLMWVLFT
PAERSDLINS AKTQLLAQTN KQNAQKILEQ TQAAASEHEK QLQQVKHELA QSEDDREKRL
LAIQLGIEKR IELITAKTKL IADAQIAFSQ DVSDWKEIQS LISNALDDEH FIPGTRDFTL
YRKTLRDRLA ASNRAVANKH VFVTTYFRDD MNLPEEEQSF NVQVRETDSD KNVERVNTVL
RKRLELSQSE ENFIAKKQTL IVDLAVWHKQ YRTDLLLLRG KLIKQVIDNN AFSLNNFDPI
AVELRTLYAS TLFSKWSKAY VKTYGTSKKT SLTFSNITHL LKVLLTTCFI VWLFLKRQLV
LDSAKRWCLT RTSNARWRKA TALLFGALQE LYIFIILFFF GGSVIKILVS VGISSAEMFK
PVLNKIVLFF LLLGLIQYVQ PFLSQREQRK GRDTHEIAAI EEVFSLVPQV YLYYWLAAGV
VASLISQHLN ESLLGFHAVN VITFAFAIAL LVIIWVRRHT WRTINEKAYN SELWQKISFN
ARRKPWEPLI LLIGGGMGVY RVVWSVLLDR LTELELTRSF QAMVSRAILE RQYRKTTTKL
YAERFPDKYW RNFHFQTPAE AHWYVERSEH QEVIQAAYEN WQSKGKATRL LICGDRGIGK
SELISNFLRN EEIKCLHTHI DTGETSVAAV CQRLSLSFLQ SRMDTPEDII NALKLMEPQV
LCVENIENTI LRKVGGFAAY TAVIDIILQT SDKHFWLVTC TSYAWTIVQH GVIGADCFTE
NIMVEGLSEE ALKTAMLARH NDAHPATPDF SQLNFSNPKQ GRLRDKLQQQ DASEKGEELY
FRILWDYTKG NPRQALYYWK ASLAWDGLKC TVRLFEIPEH RVLETLQDRA LMLLAGLIEH
NGLTLRGMQE IMNCPEATVR RHIEELTPYG IVFSIENDDS CGWHVESFWT RAVENYLEKR
QLLFKGAAL