Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_1135 |
Symbol | |
ID | 3968322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | + |
Start bp | 1469959 |
End bp | 1471590 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637920206 |
Product | MSHA biogenesis protein MshL |
Protein accession | YP_526609 |
Protein GI | 90020782 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02519] pilus (MSHA type) biogenesis protein MshL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00664047 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACA AGTGGATTCT ACTAAAAGGC TGCGTGTTAA TTGCGACAGC ACTATTAATT AGCTGTTCGT CGGTGCCGCA AACAGAAACC CCTGCAGAGC AAGCGCTGGA AACCGCAAGT GCAACTGGTA CAGAGCCCGC CAGCGCCATG CCCGATTCAG TTACTGCCGC CTTATTTAAA GGAGCAAATA GTCGGCCTCA AATAGAAGAG GAAGAGCGGT TTGATATTTC TGTTAATCAA GTGCCTGCAC GCGACTTCTT TATTGGCTTA GTGGCCGACT CTGGGGTGAA TGTTGTCACC CACCCAGAAG TTAACGGCGT GATATCGCTC GATTTAAAAA ACGTAACGCT GCGCAATGTG CTCGATGTTA CTCGCGATGT GTACGGTTAT GAATACAAAT ACAACGGCGG CATTTATAGC ATTTACCCTA GTAAAATGCG CACAGAATTA TTTGAAATAA ATTATATAGA TGTACAAAGA GAAGGCTCGA CAGATACTAG TGTGCTAATT GGTGAAATAA CGTCTAGCGG TGGCAATGGG AGCAATCAGG GCGGTAACAG TAGCAATCAA TCACAGGGTA AGCAGAGTAA TCAAGATAGC AGCTCAGGTT CTCGTGTATC GACAAAAAAC AAAACAGATT TTTGGCAAGA CTTGAAATTA ACCTTATCGG CAATTGTGGG TGGCGAAGCC AATGGCCGCA ATGTGGTGGT TAATCCGCAA GCCGGTTTGG TGGTGGTGCG AGCTTTGCCT AGCGAAATAA GTGCAGTACG AGAATTTCTA GATCGATCTG AGCTTAGTGT TAGGCGCCAG GTTATTCTCG AAACAAAAAT AGTGGAAGTA AAGCTAAACG ATGGCTTTCA GGCTGGTGTG AATTGGAATG AAATTCGCGG CCAAATGCTG CTTACTAAAA ATGTAGAAAC CTTTGATGTA CCTGTAGATA TTGTTAGCGC CAGCGAAAAC GTAGGGGAAA TATTCTCATC TATATTTCGC ATTGGCGATA TCTCACAGTT GCTATCTTTA TTAGAAACAC AGGGCAACGT GCAGGTGTTA TCTAGCCCGC GCGTGTCGAC TGTAAACAAT CAAAAGGCGG TAATTCGTGT AGGCTCTGAC GAGTTCTTTG TTACTGGTAT ATCTAGCCAA ACTACTTCTA CTGCGGCCTC AACGACCAGC GCCCCCAATA TTGAACTTAC ATCCTTTTTT AGCGGTATTT CCCTCGATGT TACTCCACAA ATTGCAGATA ACGGCGATGT GATTTTACAT GTGCACCCAA TCGTGAGTGA GGTAACAGAC CAAAACAAAG ATATAACGCT TGGTAACGAG AAGTTCTCTT TACCGTTGGC GCTGCGCGAA GTGCGTGAGT CCGACAGTAT TGTTCGCGCT CAAAGCGGTC AAATTATTGT GCTTGGCGGG TTGATGAAAG AGAAACTTAA CGATGTATAC AGCAAGCGCC CCGGCCTTGG GGATGTACCT GTACTAAACA CTCTGTTTAG AAAGCGCAGC AAGGTTTCAG AAAAAACCGA ACTGGTTATT TTGCTGCGCC CAATTGTAGT AGAAGACAAC ACTTTCGCAG ATGACATTAA TCAAAGTCGT CAGCGTGTTA ATTCCATGTC AGATGAATAC AGAGGCCGCT AG
|
Protein sequence | MSNKWILLKG CVLIATALLI SCSSVPQTET PAEQALETAS ATGTEPASAM PDSVTAALFK GANSRPQIEE EERFDISVNQ VPARDFFIGL VADSGVNVVT HPEVNGVISL DLKNVTLRNV LDVTRDVYGY EYKYNGGIYS IYPSKMRTEL FEINYIDVQR EGSTDTSVLI GEITSSGGNG SNQGGNSSNQ SQGKQSNQDS SSGSRVSTKN KTDFWQDLKL TLSAIVGGEA NGRNVVVNPQ AGLVVVRALP SEISAVREFL DRSELSVRRQ VILETKIVEV KLNDGFQAGV NWNEIRGQML LTKNVETFDV PVDIVSASEN VGEIFSSIFR IGDISQLLSL LETQGNVQVL SSPRVSTVNN QKAVIRVGSD EFFVTGISSQ TTSTAASTTS APNIELTSFF SGISLDVTPQ IADNGDVILH VHPIVSEVTD QNKDITLGNE KFSLPLALRE VRESDSIVRA QSGQIIVLGG LMKEKLNDVY SKRPGLGDVP VLNTLFRKRS KVSEKTELVI LLRPIVVEDN TFADDINQSR QRVNSMSDEY RGR
|
| |