Gene Hhal_0329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0329 
Symbol 
ID4711275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp372323 
End bp373939 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content70% 
IMG OID639854789 
Productpilus (MSHA type) biogenesis protein MshL 
Protein accessionYP_001001925 
Protein GI121997138 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02519] pilus (MSHA type) biogenesis protein MshL 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.683129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAC CACTGAGGCC CGTGCGCCTG TGGCTGGGTG TCGCCGGCGC CGCCGTGCTC 
CTGGGGGCGT GTGCGTCATC GGGGGATCGG GCGCAGGAGC GCTCCGATCG CGCCGCCGAT
CACCTGGCGC CGACGGAGCC CGACGAGCAG GTCCGCCAGG AGGATCAAGC CTCCACGCAG
GCCCTGGACG ACGCGCTCGG TGCCCCCGGC CTGGGTGATC CGCTGGCGCT TCAGCCCGAG
GATCCGCGCT TTGACATCAC CGCCGACGGC GTCGGTGCCC GCGAGTTCTT CCACGGCCTG
GTGGAGGACA CCCCGTACAA CGTGGTGGTC CACCCGGATC TGGAGGGCGA GCTGTCGCTG
ACCCTGCGCG ACGTCTCCGT CCCCGAGGTG ATGGACACCG TCCGCGAGAT CTACGGCTAC
GAGTACAAGC AGGCGCGCAC CGGCTTCCTC ATCCTGCCCG CCCGCCCGCG GGCCGAGGTC
TTCCACCTGG ACTACCTCAA CGTCCATCGT AGCGGCCACT CCGGCACGCG GGTGACCTCC
GGCGAGATCA CCGGCGAGGA CGACGGCGAC GGCGTGATCG GCAGCCGGGT GGATACCGCC
TCGAACTCGG ATCTGTGGGG GCAGGTCGAG GAGACGGTGA GCCGCATGAT CGACGACGAC
GACACCGCCT CGGTGGTGGC CAGCCCCCAG GCGGGGACCC TGGCCGTGCG CGGCATGCCC
GAGTCCCTGC GCCGCGTCGA GGCGTTTGTC GACCGTCTGC AGGGCAGCCT CAACCGGCAG
GTGATCCTCG AGGCGCGCAT CCTCGAGGTC GAGCTCGGCG ACGACTTCCA GGCCGGGATC
AGCTGGGAGT CGCTCGGCCG GCACGACGGC CAGGCCCTCG AGGGCTCCTT CCGCCCGGGC
GATAACCTGG AGCTGAGTTC GGCCGGGGTG TTCAACATCG GGATCACCCG GGGCCAGGAC
GGGCAGCGGG GCTTCTTCGA GGGCTTCCTG CGTGCCCTGG AGCAGCAGGG CGACGTCCAG
GTCCTCTCGA CGCCGCAGGT CTCCACGCTC AACAATCAGA AGGCGGTGAT CAAGGCCGGA
ACGGACTCCT TCTATCAGAC GGACTTCAGC ATCAACTACC GGACCGTGGA GGTGGGGGGG
CAGACCACCA CCCAGCCGGA GCTGGATCCC GACTTCGAGC CGTTCTTCTC CGGCATCGCC
CTGGACGTCA CCCCGCAGAT CGAGGAGAGC GGCTGGATCA ACCTGCACGT CCAGCCCTCG
GTGACCGAGG TGCGCGAGCG GGAGCGCACC CTGAGAACCG GCCGGGGCGA TGACGAGGTC
CGCTTCTCGC TGGCCGAGAG CGACGTCCGC CAGTCCGACT CCATCGTCCG CGCCCGCAGC
GGCGAGATGA TCGTCATCGG CGGTCTGATC GAGGAGCGCG AGCAGCAGGA GACCTCCCGG
GTGCCGCTGC TCGGGCGGGT GCCGCTGCTG GGCTGGCTGT TCACCCAGGA GCGACAGGAG
TCGAGCAAGT ACGAGCTGGT GATCCTGCTG CGCCCCCGGG TCGTCGGGGA GGACACCTGG
GCCGGCGAAC TCGAGGAGCA CTCCCGGCGC ATCCAGGGGC TGTACGACCA CTACTGA
 
Protein sequence
MNAPLRPVRL WLGVAGAAVL LGACASSGDR AQERSDRAAD HLAPTEPDEQ VRQEDQASTQ 
ALDDALGAPG LGDPLALQPE DPRFDITADG VGAREFFHGL VEDTPYNVVV HPDLEGELSL
TLRDVSVPEV MDTVREIYGY EYKQARTGFL ILPARPRAEV FHLDYLNVHR SGHSGTRVTS
GEITGEDDGD GVIGSRVDTA SNSDLWGQVE ETVSRMIDDD DTASVVASPQ AGTLAVRGMP
ESLRRVEAFV DRLQGSLNRQ VILEARILEV ELGDDFQAGI SWESLGRHDG QALEGSFRPG
DNLELSSAGV FNIGITRGQD GQRGFFEGFL RALEQQGDVQ VLSTPQVSTL NNQKAVIKAG
TDSFYQTDFS INYRTVEVGG QTTTQPELDP DFEPFFSGIA LDVTPQIEES GWINLHVQPS
VTEVRERERT LRTGRGDDEV RFSLAESDVR QSDSIVRARS GEMIVIGGLI EEREQQETSR
VPLLGRVPLL GWLFTQERQE SSKYELVILL RPRVVGEDTW AGELEEHSRR IQGLYDHY