Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0329 |
Symbol | |
ID | 4711275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 372323 |
End bp | 373939 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639854789 |
Product | pilus (MSHA type) biogenesis protein MshL |
Protein accession | YP_001001925 |
Protein GI | 121997138 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02519] pilus (MSHA type) biogenesis protein MshL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.683129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCAC CACTGAGGCC CGTGCGCCTG TGGCTGGGTG TCGCCGGCGC CGCCGTGCTC CTGGGGGCGT GTGCGTCATC GGGGGATCGG GCGCAGGAGC GCTCCGATCG CGCCGCCGAT CACCTGGCGC CGACGGAGCC CGACGAGCAG GTCCGCCAGG AGGATCAAGC CTCCACGCAG GCCCTGGACG ACGCGCTCGG TGCCCCCGGC CTGGGTGATC CGCTGGCGCT TCAGCCCGAG GATCCGCGCT TTGACATCAC CGCCGACGGC GTCGGTGCCC GCGAGTTCTT CCACGGCCTG GTGGAGGACA CCCCGTACAA CGTGGTGGTC CACCCGGATC TGGAGGGCGA GCTGTCGCTG ACCCTGCGCG ACGTCTCCGT CCCCGAGGTG ATGGACACCG TCCGCGAGAT CTACGGCTAC GAGTACAAGC AGGCGCGCAC CGGCTTCCTC ATCCTGCCCG CCCGCCCGCG GGCCGAGGTC TTCCACCTGG ACTACCTCAA CGTCCATCGT AGCGGCCACT CCGGCACGCG GGTGACCTCC GGCGAGATCA CCGGCGAGGA CGACGGCGAC GGCGTGATCG GCAGCCGGGT GGATACCGCC TCGAACTCGG ATCTGTGGGG GCAGGTCGAG GAGACGGTGA GCCGCATGAT CGACGACGAC GACACCGCCT CGGTGGTGGC CAGCCCCCAG GCGGGGACCC TGGCCGTGCG CGGCATGCCC GAGTCCCTGC GCCGCGTCGA GGCGTTTGTC GACCGTCTGC AGGGCAGCCT CAACCGGCAG GTGATCCTCG AGGCGCGCAT CCTCGAGGTC GAGCTCGGCG ACGACTTCCA GGCCGGGATC AGCTGGGAGT CGCTCGGCCG GCACGACGGC CAGGCCCTCG AGGGCTCCTT CCGCCCGGGC GATAACCTGG AGCTGAGTTC GGCCGGGGTG TTCAACATCG GGATCACCCG GGGCCAGGAC GGGCAGCGGG GCTTCTTCGA GGGCTTCCTG CGTGCCCTGG AGCAGCAGGG CGACGTCCAG GTCCTCTCGA CGCCGCAGGT CTCCACGCTC AACAATCAGA AGGCGGTGAT CAAGGCCGGA ACGGACTCCT TCTATCAGAC GGACTTCAGC ATCAACTACC GGACCGTGGA GGTGGGGGGG CAGACCACCA CCCAGCCGGA GCTGGATCCC GACTTCGAGC CGTTCTTCTC CGGCATCGCC CTGGACGTCA CCCCGCAGAT CGAGGAGAGC GGCTGGATCA ACCTGCACGT CCAGCCCTCG GTGACCGAGG TGCGCGAGCG GGAGCGCACC CTGAGAACCG GCCGGGGCGA TGACGAGGTC CGCTTCTCGC TGGCCGAGAG CGACGTCCGC CAGTCCGACT CCATCGTCCG CGCCCGCAGC GGCGAGATGA TCGTCATCGG CGGTCTGATC GAGGAGCGCG AGCAGCAGGA GACCTCCCGG GTGCCGCTGC TCGGGCGGGT GCCGCTGCTG GGCTGGCTGT TCACCCAGGA GCGACAGGAG TCGAGCAAGT ACGAGCTGGT GATCCTGCTG CGCCCCCGGG TCGTCGGGGA GGACACCTGG GCCGGCGAAC TCGAGGAGCA CTCCCGGCGC ATCCAGGGGC TGTACGACCA CTACTGA
|
Protein sequence | MNAPLRPVRL WLGVAGAAVL LGACASSGDR AQERSDRAAD HLAPTEPDEQ VRQEDQASTQ ALDDALGAPG LGDPLALQPE DPRFDITADG VGAREFFHGL VEDTPYNVVV HPDLEGELSL TLRDVSVPEV MDTVREIYGY EYKQARTGFL ILPARPRAEV FHLDYLNVHR SGHSGTRVTS GEITGEDDGD GVIGSRVDTA SNSDLWGQVE ETVSRMIDDD DTASVVASPQ AGTLAVRGMP ESLRRVEAFV DRLQGSLNRQ VILEARILEV ELGDDFQAGI SWESLGRHDG QALEGSFRPG DNLELSSAGV FNIGITRGQD GQRGFFEGFL RALEQQGDVQ VLSTPQVSTL NNQKAVIKAG TDSFYQTDFS INYRTVEVGG QTTTQPELDP DFEPFFSGIA LDVTPQIEES GWINLHVQPS VTEVRERERT LRTGRGDDEV RFSLAESDVR QSDSIVRARS GEMIVIGGLI EEREQQETSR VPLLGRVPLL GWLFTQERQE SSKYELVILL RPRVVGEDTW AGELEEHSRR IQGLYDHY
|
| |