Gene Sde_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3003 
Symbol 
ID3967764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3828980 
End bp3832483 
Gene Length3504 bp 
Protein Length1167 aa 
Translation table11 
GC content50% 
IMG OID637922100 
Producthypothetical protein 
Protein accessionYP_528472 
Protein GI90022645 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.25795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000812606 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAATTA AACGTTGGCC GTTCGACCGA AAAGGCCCAC CTAAAAAACC TAACGCTAAA 
AAATTACTCG CAAGCTTAGC GGCTGCACTA AGCTTAACCG CCATGCAAAG CACTGCAGCG
GTAGAGCCAT TACAAACCAG CGGCAATCAA ATTCTTGTTG GCAACCAAGC CAAAGCCCTT
GGCGGCCACA GCTTGTTTTG GCATAACGTG CCGGCAGCAG GCAGCTTATA CAATGCAGAT
ACAGTAAGCA GGCTTAAGAA TGATTGGAAC TCCAAGGTTA TTCGGGCCGC AATTGGGGTT
GAAGTACCTT TCAATTCAGA AAACACCTAC ATAGGCAATA AGGGCAGCTC GCTGGCCGCA
ATAGACCGCG TAGTTAATGC CGCTGTTGCC AACGATATGT ATGTGATTAT CGATTTTCAT
ACTCACCATG CAGATCAAGT AGAAAACGTT GCCCACGACT TTTTCAACGA AGTTTCTAGC
CGTTACGGTC ATTTAAACAA TGTTATTTAT GAAGTATTTA ACGAGCCAGA ATGGTGTGGC
GAGCACGGTC GGTGGGCATC TACCATTAAG CCCTACGCCG AGCGCGTTAT CCAAACCATT
CGCAACAATG ACCCAGACAA CCTAGTAATA GTAGGCACTA CCTGTTTCTC GCAAGATGTA
GATGTAGCCG CAGCCGACCC CATTAACGAT GTAAACGTGG CCTATACGCT ACACTTTTAC
GCAGCCACCC CTGCCCACCA GCAACCCTTG CGCGACAAGG CCCAAACCGC GCTCGACCGC
GGCGCGCCAC TATTTGTAAC CGAATGGGGT ACAACCACAT TTACAGGTGA TGGTTTTGTA
GATGAGGCGC AAACGCGCAC ATGGATTAAC TGGTTAAACG AACGCGGTAT TAGCCACGTT
AACTGGTCGG CGTCTACCCA GCCAGAAAGC TCAGCTATAT GGAATGGCGA CATGACCTAC
AAGCATTCGG GCTTATTGGT TGGCGAACTG GTGCAACAAA CAAATGGCAC AACCACGCCA
CCAACCGGTG AAATAAGTGG CCCGTGCGAT TTACATTTTG TACCTGCCAA AGCCGAGGCT
GAAAGCTTCT GTACCGCCAA AGGCATTCAA TTTGAAACCA CCACCGACAC GGGCGGCGGC
CAAAACATGG GCTGGCTAGA TGCCGGCGAC TGGGTAACTT TTGATGTAGA TGTACCTGCT
AGCGGCCAAT ATTTAATAGA TTACCGCGTA GCATCAGAGC TAGGTGATGG TCGGTTCCGC
ACCGAAGCCG CCAACGGCAC TGCCCTTGGC ACAATATCTG TACCCAATAC CGGCGGCTGG
CAGAATTGGC AAACGCACAC ACACACAGTG CAACTCTCGC AAGGCACACA AACCGTTAAA
CTAGTTGCCG AAACTGGTGG CTGGAACTTA AATTGGTTTG AAGTGCGCGC AGGTGAGGTG
TGCGAAGGCG CTGACTGCCC ATGTGAAGGA GCCGAATGCC CTTGCCCAGA TTGCAACGGC
ACACCGGTTA AGTTTGAGGC AGAAACGTTT GTGGCTATGC AAGGCGTGCA GCTAGAAAAC
ACATCCGATG TGGGCGGCGG CCAAAACGTT GGCTACATTG ATAGCGGCGA CTGGATAACT
TACAACGGGG CCTTGCCCGC AAGTGCAGAC AACCGCTATG TAGTGTCTTA TAGAGTAGCG
CGTCAACCTA GCGGCAATGC CAAATTTAAA ATAGAACAGC CAGGTGGAGC AGCGGTATAT
GGCGAAATTT CGGTGCCCAG CACCGGCGGC TGGCAAACAT GGACAACCAT TAGCCACACC
ATAACAATTC CCGCTAACGC AAACGGCTTT GCACTAGCAG CAATAGATGG CGGTTGGAAT
ATAAACTGGA TAGAAATAAA ACCGGCGACC ACTCAACCAC CCGAGCCAAT CAACCCGTTA
AAACTTCAAG CTGAAGATTA CATCAACTTT AACGACACCA CCCCCGGTAA CGAAGGCGGT
GCACACAGAA GCGATGATGT AGATATTCAA GCAACTACCG ATACCGGTGG CGGTTTTAAT
GTTGGCTGGG TAGACGCTGG CGAATGGCTA GAGTATGAGT TCTTTTTAGA GTCTCCTGAT
TTTTATGCAG CTGATGTACG GGTTGCTTCA GACCAAACTG GCGGCGCACT GCAACTACAA
ATAGATGGCC AAAACGTTGG CCAAGCCATT ACCGTTGGCA ACACCGGTGG CTGGCAAGCG
TGGACAACCA AAAACACACT CATTGGCGAC CTAAGTGCAG GCACCCACAC GTTGCGTGTA
TACGCGCAAA GCGGCCCATT AAATTTAAAC TGGGTAGAGC TAAAGCGTAC AACGCCCGCA
CCAGCCACTT CGTGTTTTAA TATTGCCGAA GACCGCTTAA ACGTTCACCT AGATGCGCAC
TGTACTGCAG GCAGCAACCT GCAATACAAT TGGGATTTTG GTGACGGCAA CAGCGCAACC
GGCGTAGCCA CTAGCCACAG CTACTACACT AGCGGCACTT ACACCATTAC CTTAACCGTT
AGTGATACCC GCACCACAGA CACCTCTAGC CAACAGGTAA CGGTAGATTT TTCTGCCCCT
GCAGGCCCTG TGGATTTTTA CGGCGAACTA ATGGTGAATG GCAACCGCAT TCACGGCGAA
AAAACCGGCG AACCCGCACA AGTACGCGGC ATGAGCTTTT TTTGGAGCAA CACCGGTTGG
GGCCAAGAAA AATGGTGGAA CGCCAGCACC GTGGACCGCA TGGTTGATGA GTTCAAAGTA
GAACTTGTGC GCGGCGCAAT GGGCACTGAT GAAGGCGGCG GTTATTTACA CGACGCGTCT
AATAAGGCTC GCTTACAAGC AGTTGTTGAA CAAGCCATTG CACGCAATGT GTATGTAATT
ATCGACTGGC ACACCCACCA TGCCGAAGAT AACATTGCCG AAGCCATTAC ATTCTTTAGC
GAAATGGCGC AGCTTTATGG CCACCACGAC AACGTGATTT TCGAGATTTA CAACGAGCCA
TTAAACACCA CAAGCTGGGG CACTATTAAG CACTACGCTG AACAAGTTAT TCCTGCTATT
CGCGCTCATT CCGATAATTT AATTGTTGTG GGCACGCGCA CCTGGTCGCA AAACGTAGAC
GAAGCCGCGT TCGATAAAAT TAACGACAGC AACACCGCCT ACGCCCTGCA CTTTTATGTT
GGCTCGCACG GCAACCACGT TCGCAACCTA GCACAAACCG CACTAAACAA CGGCGCGGCT
ATTTTTGCTA GCGAATGGGG AATTTGGCCA AACAACAACT ACGATGGCAT GAACGCCGAC
GATTGGATGA ACTTTTTAGA CCAAAACAAA ATATCTTGGG CTAACTGGGC CATATCCGAC
AAAGTAGACC CCAACACAGG CCAACTAGAA CCACCCAGCA TGTTCAACCC AGACGGCAGC
CTAAGCAGTA ATGGTCAATA TGTAGTGAAC AAACTAAATG AATACGCAGC ACAAGCACCG
TGGAGGGAGG CAATCGCTAA TTGA
 
Protein sequence
MTIKRWPFDR KGPPKKPNAK KLLASLAAAL SLTAMQSTAA VEPLQTSGNQ ILVGNQAKAL 
GGHSLFWHNV PAAGSLYNAD TVSRLKNDWN SKVIRAAIGV EVPFNSENTY IGNKGSSLAA
IDRVVNAAVA NDMYVIIDFH THHADQVENV AHDFFNEVSS RYGHLNNVIY EVFNEPEWCG
EHGRWASTIK PYAERVIQTI RNNDPDNLVI VGTTCFSQDV DVAAADPIND VNVAYTLHFY
AATPAHQQPL RDKAQTALDR GAPLFVTEWG TTTFTGDGFV DEAQTRTWIN WLNERGISHV
NWSASTQPES SAIWNGDMTY KHSGLLVGEL VQQTNGTTTP PTGEISGPCD LHFVPAKAEA
ESFCTAKGIQ FETTTDTGGG QNMGWLDAGD WVTFDVDVPA SGQYLIDYRV ASELGDGRFR
TEAANGTALG TISVPNTGGW QNWQTHTHTV QLSQGTQTVK LVAETGGWNL NWFEVRAGEV
CEGADCPCEG AECPCPDCNG TPVKFEAETF VAMQGVQLEN TSDVGGGQNV GYIDSGDWIT
YNGALPASAD NRYVVSYRVA RQPSGNAKFK IEQPGGAAVY GEISVPSTGG WQTWTTISHT
ITIPANANGF ALAAIDGGWN INWIEIKPAT TQPPEPINPL KLQAEDYINF NDTTPGNEGG
AHRSDDVDIQ ATTDTGGGFN VGWVDAGEWL EYEFFLESPD FYAADVRVAS DQTGGALQLQ
IDGQNVGQAI TVGNTGGWQA WTTKNTLIGD LSAGTHTLRV YAQSGPLNLN WVELKRTTPA
PATSCFNIAE DRLNVHLDAH CTAGSNLQYN WDFGDGNSAT GVATSHSYYT SGTYTITLTV
SDTRTTDTSS QQVTVDFSAP AGPVDFYGEL MVNGNRIHGE KTGEPAQVRG MSFFWSNTGW
GQEKWWNAST VDRMVDEFKV ELVRGAMGTD EGGGYLHDAS NKARLQAVVE QAIARNVYVI
IDWHTHHAED NIAEAITFFS EMAQLYGHHD NVIFEIYNEP LNTTSWGTIK HYAEQVIPAI
RAHSDNLIVV GTRTWSQNVD EAAFDKINDS NTAYALHFYV GSHGNHVRNL AQTALNNGAA
IFASEWGIWP NNNYDGMNAD DWMNFLDQNK ISWANWAISD KVDPNTGQLE PPSMFNPDGS
LSSNGQYVVN KLNEYAAQAP WREAIAN