Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_4576 |
Symbol | mshL |
ID | 3520701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | - |
Start bp | 4829825 |
End bp | 4831654 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637287016 |
Product | MSHA biogenesis protein MshL |
Protein accession | YP_271224 |
Protein GI | 71280237 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02519] pilus (MSHA type) biogenesis protein MshL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA CAAGTCCCCC TATGAATAAA CTAAAATTAA ACATTGCTTT GTCAGCGTTT TTCTTTGCCT TAGTAGGCTG TCAATCTATG CCGAATGATC CTGTAGACAT AAAAGCAGAT TTAGATGCGT CGATCAAAGA AACTAAACGT TTAAATGGAC CAAAGGCCTT AACGCAAGTG CCAAATTCAG TACAGCAAGA GTTAATGCTA AATAACATGG ATCAGGCTAA GCAAGGCATG TTAGCTGAAA AACGCTTGGA GGTCTCAGCG ACAGAAGTTG ATGCCAAAGA CTTTTTTCAA GCCATTGTTA GAGGCTCACG TTATAACGTA GTTATTCATC CAGATGTCAC AGGACAAATA TCTTTAAGCT TGAATAATGT TACGTTATCA GAGGCCTTAA CCGTCGTTGA AGATGTCTAT GGTTATGAAA TTATTCGCCG TGGCAATGTT GTAAAAGTTT TCCCGCCGGG CATACGTACT GAAACTATCG CACTAAATTA CTTATTCCTA AAACGTTTTG GTTCATCTAG TACTACTATC AATTCTGGTG GTGTTTCTGA AAACGATCCC AACAGTGGCA ATAGCAGTAA CGGCAATAGC AGCAATAGCA ATAATAGTAA TAGCGGCAGT AACAACAACC AAAGTGGTAA TAATGGCTCG AGTAATCAGA ATAGTGGCAT TAACTTGTAC ACCGAAAATG AGTCTAATTT CTGGGATGAA TTGAAAGAAT CATTAACCGC TTTCGTTGGC ACTGGGGAAG GTCGTTCAGT GATTGTTTCA CCTCAAGCAG GTTTAGTGAC AGTGCGTGCA TTACCACAAG AGCTTACCGC GGTTAAGAAG TTTATAACAG CTACTGAAAG TCATTTACAT CGCCAAGTCA TTATTGAAGC GAAGATAATG GAAGTGACTC TTAATGATGA TTTCCAACAG GGGATTAAGT GGAACAAAGT GCTTGATCAA GTCGGTAGTG CTGACATTAT TTACTCAACT ACAGGTAATG TTGTCGGTAA TGTTATTTCA AACACAATCG GTGGTGTTAA TAGTATAAAT TTTAGTAAAC AAACGAGTGG TAGCGATTTT TCAGGTGTTA TTGAACTTTT ACAAACACAA GGCAATGTTC AGGTACTTTC TAGTCCGCGT ATAACCGCCA CTAATAACCA AAAAGCGGTT ATTAAAGTAG GTGAAGATGA GTACTTTGTT ACTGAGGTTT CCAGCACTAC CACGACCGGT ACTTCAACAA CGACTACACC AGAGGTTGAA TTAACCCCAT TTTTCTCGGG TATTGCACTA GACGTCACAC CACAAATTAG TAAAGACGGC AGTGTTATTT TACATGTTCA TCCTTCAGTG ACTATTACTG AAGAACAAAG CAAAACTATT AAAATTGGTG ACACAACACT GGTATTGCCA CTAGCACAAA GTAGCGTTCG CGAATCAGAT ACCATTATCC GTGCTAACTC AGGGGAAGTT GTTGTTATTG GTGGCCTGAT TGAAACCTAT AACATTGATA TTGAATCTAA AACACCGATA TTAGGTGATA TCCCTTATCT TGGTGAGTTA TTTAAAAACA AGTCGCAAAA ATCGCAAAAA CGTGAATTAG TTATCATGCT AAAGCCCATT GTTGTTGGTC AAGATACTTG GAAAAACCAA TTACAAGATG CGCGCAGTTT GTTAACAAAA TGGTTTCCAG AAGAGGTTGA CGCGTGCGAG TTATCAAATG ACGGTACACT AACGCAAGAA TGTCAAGCGC AATGTGCTGA TGCAGAATAT GCAACAGAGC ATGGTGTTTG CCAAATGGCT GCACAAAATG CAGCTAGTAA TGATAATTAA
|
Protein sequence | MIKTSPPMNK LKLNIALSAF FFALVGCQSM PNDPVDIKAD LDASIKETKR LNGPKALTQV PNSVQQELML NNMDQAKQGM LAEKRLEVSA TEVDAKDFFQ AIVRGSRYNV VIHPDVTGQI SLSLNNVTLS EALTVVEDVY GYEIIRRGNV VKVFPPGIRT ETIALNYLFL KRFGSSSTTI NSGGVSENDP NSGNSSNGNS SNSNNSNSGS NNNQSGNNGS SNQNSGINLY TENESNFWDE LKESLTAFVG TGEGRSVIVS PQAGLVTVRA LPQELTAVKK FITATESHLH RQVIIEAKIM EVTLNDDFQQ GIKWNKVLDQ VGSADIIYST TGNVVGNVIS NTIGGVNSIN FSKQTSGSDF SGVIELLQTQ GNVQVLSSPR ITATNNQKAV IKVGEDEYFV TEVSSTTTTG TSTTTTPEVE LTPFFSGIAL DVTPQISKDG SVILHVHPSV TITEEQSKTI KIGDTTLVLP LAQSSVRESD TIIRANSGEV VVIGGLIETY NIDIESKTPI LGDIPYLGEL FKNKSQKSQK RELVIMLKPI VVGQDTWKNQ LQDARSLLTK WFPEEVDACE LSNDGTLTQE CQAQCADAEY ATEHGVCQMA AQNAASNDN
|
| |