Gene Sde_1135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1135 
Symbol 
ID3968322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1469959 
End bp1471590 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content45% 
IMG OID637920206 
ProductMSHA biogenesis protein MshL 
Protein accessionYP_526609 
Protein GI90020782 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02519] pilus (MSHA type) biogenesis protein MshL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00664047 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACA AGTGGATTCT ACTAAAAGGC TGCGTGTTAA TTGCGACAGC ACTATTAATT 
AGCTGTTCGT CGGTGCCGCA AACAGAAACC CCTGCAGAGC AAGCGCTGGA AACCGCAAGT
GCAACTGGTA CAGAGCCCGC CAGCGCCATG CCCGATTCAG TTACTGCCGC CTTATTTAAA
GGAGCAAATA GTCGGCCTCA AATAGAAGAG GAAGAGCGGT TTGATATTTC TGTTAATCAA
GTGCCTGCAC GCGACTTCTT TATTGGCTTA GTGGCCGACT CTGGGGTGAA TGTTGTCACC
CACCCAGAAG TTAACGGCGT GATATCGCTC GATTTAAAAA ACGTAACGCT GCGCAATGTG
CTCGATGTTA CTCGCGATGT GTACGGTTAT GAATACAAAT ACAACGGCGG CATTTATAGC
ATTTACCCTA GTAAAATGCG CACAGAATTA TTTGAAATAA ATTATATAGA TGTACAAAGA
GAAGGCTCGA CAGATACTAG TGTGCTAATT GGTGAAATAA CGTCTAGCGG TGGCAATGGG
AGCAATCAGG GCGGTAACAG TAGCAATCAA TCACAGGGTA AGCAGAGTAA TCAAGATAGC
AGCTCAGGTT CTCGTGTATC GACAAAAAAC AAAACAGATT TTTGGCAAGA CTTGAAATTA
ACCTTATCGG CAATTGTGGG TGGCGAAGCC AATGGCCGCA ATGTGGTGGT TAATCCGCAA
GCCGGTTTGG TGGTGGTGCG AGCTTTGCCT AGCGAAATAA GTGCAGTACG AGAATTTCTA
GATCGATCTG AGCTTAGTGT TAGGCGCCAG GTTATTCTCG AAACAAAAAT AGTGGAAGTA
AAGCTAAACG ATGGCTTTCA GGCTGGTGTG AATTGGAATG AAATTCGCGG CCAAATGCTG
CTTACTAAAA ATGTAGAAAC CTTTGATGTA CCTGTAGATA TTGTTAGCGC CAGCGAAAAC
GTAGGGGAAA TATTCTCATC TATATTTCGC ATTGGCGATA TCTCACAGTT GCTATCTTTA
TTAGAAACAC AGGGCAACGT GCAGGTGTTA TCTAGCCCGC GCGTGTCGAC TGTAAACAAT
CAAAAGGCGG TAATTCGTGT AGGCTCTGAC GAGTTCTTTG TTACTGGTAT ATCTAGCCAA
ACTACTTCTA CTGCGGCCTC AACGACCAGC GCCCCCAATA TTGAACTTAC ATCCTTTTTT
AGCGGTATTT CCCTCGATGT TACTCCACAA ATTGCAGATA ACGGCGATGT GATTTTACAT
GTGCACCCAA TCGTGAGTGA GGTAACAGAC CAAAACAAAG ATATAACGCT TGGTAACGAG
AAGTTCTCTT TACCGTTGGC GCTGCGCGAA GTGCGTGAGT CCGACAGTAT TGTTCGCGCT
CAAAGCGGTC AAATTATTGT GCTTGGCGGG TTGATGAAAG AGAAACTTAA CGATGTATAC
AGCAAGCGCC CCGGCCTTGG GGATGTACCT GTACTAAACA CTCTGTTTAG AAAGCGCAGC
AAGGTTTCAG AAAAAACCGA ACTGGTTATT TTGCTGCGCC CAATTGTAGT AGAAGACAAC
ACTTTCGCAG ATGACATTAA TCAAAGTCGT CAGCGTGTTA ATTCCATGTC AGATGAATAC
AGAGGCCGCT AG
 
Protein sequence
MSNKWILLKG CVLIATALLI SCSSVPQTET PAEQALETAS ATGTEPASAM PDSVTAALFK 
GANSRPQIEE EERFDISVNQ VPARDFFIGL VADSGVNVVT HPEVNGVISL DLKNVTLRNV
LDVTRDVYGY EYKYNGGIYS IYPSKMRTEL FEINYIDVQR EGSTDTSVLI GEITSSGGNG
SNQGGNSSNQ SQGKQSNQDS SSGSRVSTKN KTDFWQDLKL TLSAIVGGEA NGRNVVVNPQ
AGLVVVRALP SEISAVREFL DRSELSVRRQ VILETKIVEV KLNDGFQAGV NWNEIRGQML
LTKNVETFDV PVDIVSASEN VGEIFSSIFR IGDISQLLSL LETQGNVQVL SSPRVSTVNN
QKAVIRVGSD EFFVTGISSQ TTSTAASTTS APNIELTSFF SGISLDVTPQ IADNGDVILH
VHPIVSEVTD QNKDITLGNE KFSLPLALRE VRESDSIVRA QSGQIIVLGG LMKEKLNDVY
SKRPGLGDVP VLNTLFRKRS KVSEKTELVI LLRPIVVEDN TFADDINQSR QRVNSMSDEY
RGR