Gene Sde_2636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2636 
Symbol 
ID3968533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3335102 
End bp3336967 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content47% 
IMG OID637921734 
ProductDNA mismatch repair protein 
Protein accessionYP_528108 
Protein GI90022281 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000119439 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.352688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAAC ATCAATTCAG CAAAGCGCTG CGTGCGCTAG GCTTTGGTGG GGCTGTGTTT 
GCGGCATCGC TAATGGCTAG CCAAGCAAGT GCCCTTGAGT GTGAGCATTC AATCAGTAAT
GATTGGGGCG CCGGCTTTAC CGGTGCAATG AAAGTTACCA ATAATGACTC TAGCCCCATT
ACCGGTTGGC GGGTCGAATG GGCGTATAGC GGCAATGTAA ATATTGTTAA TTCGTGGAAC
GCCTCAGTAA CAAAAGGCAG TAATTATGTT GCCGTAGATG CCGGATGGAA TGGTAATTTA
CAGCCGAGCC AATCTACCGA ATTTGGCTTA CAGGGTGATG GCGCCGATAG AAATGTAACC
ATTATTAGTT GTGTTGCCGA AGGCGGATCA TCTTCTAGTT CATCAAGTTC TTCCAGCTCC
TCAAGTAGTT CTTCATCTAG CTCAAGTACT TCTTCATCGA GTAGCTCAAG TTCCTCGACG
AGCTCTTCTT CTAGTTCGAC TTCTAGCTCT TCTTCAAGCA CCTCTTCTAG CTCATCGTCC
AGTTCATCAA GCTCTTCTTC GGGCGGCAAC TGTGTTGCAA TGTGTAATTG GTACGGTGAA
AACCGCCCTG TTTGTGCCAA TCAAAATACT GGTTGGGGGT GGGAAAACAA CCAAAGCTGT
ATAGGTGCAA ACACCTGTAA CGATCAATGG GGCGACGGGG GCGTGGTGTC CAGCTGTGGT
ACGTCTAGCT CTTCATCCAG TTCTTCGTCC AGTTCGTCTA CCAGTTCATC CTCGTCTTCT
AGCTCGAGCA CCAGCTCTAC AAGCAGCTCA TCAAGCTCTA GTTCGTCGTC TGGTGGGTTA
AGCGCGGTAG AGTTTTCGCA GCAAATGGGC TTGGGGTGGA ATCTTGGAAA CTCCCTAGAA
GCGATTGGTG GCGAAACCGC GTGGGGCAAC CCAATGGTTA CGCAGCAATT AATTAACTCC
ATAAAAGCTG CTGGGTTCGA CACTATTCGC ATTCCGGTTG CGTGGAGCCA ATTCTCGGAC
GAAGCTAATT TTGTTATCAA TAGCAATTGG ATTGCACGCG TAGAAGAAGT AGTGAACTAC
GCATTGAGCG CCGATATGTA CGTGGTAATG AACCAACATT GGGACGGCGG TTGGATGCAG
CCCACATATG CACAGCAAGA ATATGTTAAC AATCGCTTGC AAATTATGTG GACGCAAATA
GCTAATCACT TTAAAGATTA CGATAGTCGC TTACTGTTTG CAGGCACCAA CGAAGTGATG
GTGGAAGGCG ATTACGGTAC GCCCACCTTC GAATACTACA CAGTACAAAA TAGCTTTAAC
CAAACGTTTG TGGATGCTGT ACGTGCAACC GGTGGCGCTA ATGCTAGCCG TTACTTAGTG
GTACAGGGGT TTAATACCAA CATAGATCAC ACGGTGAACT TCGCGGTAGT GCCAACCGAC
CCGGCAACAA ACAGGTTAAT GATGGAAGTA CACTATTACG ACCCCTATAA CTTTACGTTA
AATACCAACA GCAACATTAC TCAGTGGGGC GTAATTGCAA CTGACCCTAG CGTTACCGAA
ACATGGGCGA ATGAATCTTA TGTGGATGCG ACTTTCCAAA AAATGAAAAC TAACTTCGTT
GATCAAGGTA TAGCGGTAAT TTTAGGTGAG TACGGGGTTG TATCGCGCGC GAATGTGGCC
GGGCACGAAA CTTACCGAGA GTATTGGAAC CAATACATTA CTCAATCTGC GGTAGATCAT
GGAATGGTGC CTATTTATTG GGATAACGGT TATTCCGGTG ATGGTGGTAT GGCATTGTTT
GATCGCGCCA GTGGCAATCA ACTTTACCCC AATATTATTA ACGCAATTAT CAATGCCGGT
AACTAA
 
Protein sequence
MLKHQFSKAL RALGFGGAVF AASLMASQAS ALECEHSISN DWGAGFTGAM KVTNNDSSPI 
TGWRVEWAYS GNVNIVNSWN ASVTKGSNYV AVDAGWNGNL QPSQSTEFGL QGDGADRNVT
IISCVAEGGS SSSSSSSSSS SSSSSSSSST SSSSSSSSST SSSSSSTSSS SSSTSSSSSS
SSSSSSSGGN CVAMCNWYGE NRPVCANQNT GWGWENNQSC IGANTCNDQW GDGGVVSSCG
TSSSSSSSSS SSSTSSSSSS SSSTSSTSSS SSSSSSSGGL SAVEFSQQMG LGWNLGNSLE
AIGGETAWGN PMVTQQLINS IKAAGFDTIR IPVAWSQFSD EANFVINSNW IARVEEVVNY
ALSADMYVVM NQHWDGGWMQ PTYAQQEYVN NRLQIMWTQI ANHFKDYDSR LLFAGTNEVM
VEGDYGTPTF EYYTVQNSFN QTFVDAVRAT GGANASRYLV VQGFNTNIDH TVNFAVVPTD
PATNRLMMEV HYYDPYNFTL NTNSNITQWG VIATDPSVTE TWANESYVDA TFQKMKTNFV
DQGIAVILGE YGVVSRANVA GHETYREYWN QYITQSAVDH GMVPIYWDNG YSGDGGMALF
DRASGNQLYP NIINAIINAG N