Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_2636 |
Symbol | |
ID | 3968533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 3335102 |
End bp | 3336967 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637921734 |
Product | DNA mismatch repair protein |
Protein accession | YP_528108 |
Protein GI | 90022281 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000119439 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.352688 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAAAC ATCAATTCAG CAAAGCGCTG CGTGCGCTAG GCTTTGGTGG GGCTGTGTTT GCGGCATCGC TAATGGCTAG CCAAGCAAGT GCCCTTGAGT GTGAGCATTC AATCAGTAAT GATTGGGGCG CCGGCTTTAC CGGTGCAATG AAAGTTACCA ATAATGACTC TAGCCCCATT ACCGGTTGGC GGGTCGAATG GGCGTATAGC GGCAATGTAA ATATTGTTAA TTCGTGGAAC GCCTCAGTAA CAAAAGGCAG TAATTATGTT GCCGTAGATG CCGGATGGAA TGGTAATTTA CAGCCGAGCC AATCTACCGA ATTTGGCTTA CAGGGTGATG GCGCCGATAG AAATGTAACC ATTATTAGTT GTGTTGCCGA AGGCGGATCA TCTTCTAGTT CATCAAGTTC TTCCAGCTCC TCAAGTAGTT CTTCATCTAG CTCAAGTACT TCTTCATCGA GTAGCTCAAG TTCCTCGACG AGCTCTTCTT CTAGTTCGAC TTCTAGCTCT TCTTCAAGCA CCTCTTCTAG CTCATCGTCC AGTTCATCAA GCTCTTCTTC GGGCGGCAAC TGTGTTGCAA TGTGTAATTG GTACGGTGAA AACCGCCCTG TTTGTGCCAA TCAAAATACT GGTTGGGGGT GGGAAAACAA CCAAAGCTGT ATAGGTGCAA ACACCTGTAA CGATCAATGG GGCGACGGGG GCGTGGTGTC CAGCTGTGGT ACGTCTAGCT CTTCATCCAG TTCTTCGTCC AGTTCGTCTA CCAGTTCATC CTCGTCTTCT AGCTCGAGCA CCAGCTCTAC AAGCAGCTCA TCAAGCTCTA GTTCGTCGTC TGGTGGGTTA AGCGCGGTAG AGTTTTCGCA GCAAATGGGC TTGGGGTGGA ATCTTGGAAA CTCCCTAGAA GCGATTGGTG GCGAAACCGC GTGGGGCAAC CCAATGGTTA CGCAGCAATT AATTAACTCC ATAAAAGCTG CTGGGTTCGA CACTATTCGC ATTCCGGTTG CGTGGAGCCA ATTCTCGGAC GAAGCTAATT TTGTTATCAA TAGCAATTGG ATTGCACGCG TAGAAGAAGT AGTGAACTAC GCATTGAGCG CCGATATGTA CGTGGTAATG AACCAACATT GGGACGGCGG TTGGATGCAG CCCACATATG CACAGCAAGA ATATGTTAAC AATCGCTTGC AAATTATGTG GACGCAAATA GCTAATCACT TTAAAGATTA CGATAGTCGC TTACTGTTTG CAGGCACCAA CGAAGTGATG GTGGAAGGCG ATTACGGTAC GCCCACCTTC GAATACTACA CAGTACAAAA TAGCTTTAAC CAAACGTTTG TGGATGCTGT ACGTGCAACC GGTGGCGCTA ATGCTAGCCG TTACTTAGTG GTACAGGGGT TTAATACCAA CATAGATCAC ACGGTGAACT TCGCGGTAGT GCCAACCGAC CCGGCAACAA ACAGGTTAAT GATGGAAGTA CACTATTACG ACCCCTATAA CTTTACGTTA AATACCAACA GCAACATTAC TCAGTGGGGC GTAATTGCAA CTGACCCTAG CGTTACCGAA ACATGGGCGA ATGAATCTTA TGTGGATGCG ACTTTCCAAA AAATGAAAAC TAACTTCGTT GATCAAGGTA TAGCGGTAAT TTTAGGTGAG TACGGGGTTG TATCGCGCGC GAATGTGGCC GGGCACGAAA CTTACCGAGA GTATTGGAAC CAATACATTA CTCAATCTGC GGTAGATCAT GGAATGGTGC CTATTTATTG GGATAACGGT TATTCCGGTG ATGGTGGTAT GGCATTGTTT GATCGCGCCA GTGGCAATCA ACTTTACCCC AATATTATTA ACGCAATTAT CAATGCCGGT AACTAA
|
Protein sequence | MLKHQFSKAL RALGFGGAVF AASLMASQAS ALECEHSISN DWGAGFTGAM KVTNNDSSPI TGWRVEWAYS GNVNIVNSWN ASVTKGSNYV AVDAGWNGNL QPSQSTEFGL QGDGADRNVT IISCVAEGGS SSSSSSSSSS SSSSSSSSST SSSSSSSSST SSSSSSTSSS SSSTSSSSSS SSSSSSSGGN CVAMCNWYGE NRPVCANQNT GWGWENNQSC IGANTCNDQW GDGGVVSSCG TSSSSSSSSS SSSTSSSSSS SSSTSSTSSS SSSSSSSGGL SAVEFSQQMG LGWNLGNSLE AIGGETAWGN PMVTQQLINS IKAAGFDTIR IPVAWSQFSD EANFVINSNW IARVEEVVNY ALSADMYVVM NQHWDGGWMQ PTYAQQEYVN NRLQIMWTQI ANHFKDYDSR LLFAGTNEVM VEGDYGTPTF EYYTVQNSFN QTFVDAVRAT GGANASRYLV VQGFNTNIDH TVNFAVVPTD PATNRLMMEV HYYDPYNFTL NTNSNITQWG VIATDPSVTE TWANESYVDA TFQKMKTNFV DQGIAVILGE YGVVSRANVA GHETYREYWN QYITQSAVDH GMVPIYWDNG YSGDGGMALF DRASGNQLYP NIINAIINAG N
|
| |