Gene Hhal_0326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0326 
Symbol 
ID4711080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp368318 
End bp370156 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content69% 
IMG OID639854786 
Producttype II secretion system protein E 
Protein accessionYP_001001922 
Protein GI121997135 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.038954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGGC AGAAGATTCG CCTCGGCGAT CTGCTCATCA AGCGCGGGGT GATCACCCAG 
GCGCAGATGC AGCAGGCCCT GGAAGCGCAG AAGCAGAGCG GCCGCAAGCT CGGTGCGCAG
CTCATCGCCA TGGACTTCGT CACCGAGGAC CGGATCCTCG CCGAACTCTC CCAGCAGCTC
GACGTGCCGT GGGTCAACCC CAACGACTAC TGGGTGGACC CGGATGTCGC CAGCAAACTG
CCGGAGAGCT ACGCCCGGCG CTTCTCCGCG CTGATCCTCG AGGAGACCGA GACCGACTTC
CTGGTGGCCA TGGCCGACCC CAGCGATCTC ATCGCCTACG ACGAGATCAC CCGGGTCCTG
CGCTGCCCGG TGCGCCTGGC CGTGGCCCGC GAGAAGACTC TCCAGGAGCT GATCAACCTC
GTCTACCGCC GCACCGACGA GATCAGCAAC ATCGCCGAGG AGCTCGGCCG CGAGATCACC
AGCGCCGACG ACATCGACCT CTCCACCCTG CCGGTGGGCG AGGGCGGCGC CAGCGCCCCG
GTGGTCCGGC TGCTCCAGTC GCTGTTCGAG GACGCGGTGC AGGTGGGCGC CTCGGACATC
CACATCGAGC CGGACGAGCA ATCCCTGCGC ATCCGCATCC GCCGCGACGG CGTCCTGCAG
GAGCAGCTCT TTCGCCAGCG CAATATCCAC GCCGCCATGG TCTCGCTGCT CAAGCTCATG
TGCGGGCTGA ATATCACCGA GCGCCGCCTG CCCCAGGATG GCCGCTTCCA GGTCCGCGTC
CACGGCCGCA GCATCGATGT GCGCCTGTCG ACCCTGCCGC TGCAGCACGG CGAGGGCGTG
GTCATGCGCC TGCTCGACCA CAGCGCCGGG GTCAGCTCAC TGGATCAGAC CGGCATGCCG
GCCGACATCC TTCAGCGCCT GCGCCGGCTG ATCAAGATGC CCTACGGCAT GGTGCTGGTC
ACCGGCCCCA CCGGTTCCGG CAAGTCCACC ACGCTCTACG GGGCGCTGTC CGAGATCAAC
CGCCCCGAGG TCAAGGTCAT CACCGTCGAG GACCCGGTGG AGTACCGCCT GCCGCGGGTC
AACCAGGTTC ATGTCCGCGA GAGCATCGGG CTGACCTTCG CCCGCGTGCT GCGCACCACC
CTCCGCCAGG ACCCGGACAT CATCATGGTC GGCGAGATGC GCGACGAGGA GACCGCCGAC
ATCGGCTTCC GCGCCGCCAT CACCGGCCAC CTGGTCTTCT CAACGCTGCA CACCAACGAC
GCCGTCTCCA CCGCCAACCG CCTGGTGGAT ATGGGCGTAG AGCCGTACAT GATCGCCGCC
GGCCTGCGCG CCGTGCTCGC CCAGCGCCTG CTGCGGCGGA TCTGCACCCA GTGCCGCGAG
CCCTACACCC CGGACGCCAG CGAACGGGCC TGGCTGCGCA CCATCTTCGG CGCCGACGAG
ACGGAATCCC TGTCGCTCTA CCAGGGCCGG GGCTGCGCCA GCTGCAGCAA GACCGGCTAC
AGCGGCCGCA TCGGCGTCTT CGAACTGTTG GAGATGGACG CCGATAAGAT CGACGCCCTG
CGCCGCGGCG ACCAGTCCGG GTTCGCCGCC GCCTGCCACG CCGACCTCAG CTACGAGCCC
ATGAGCCGGG TGGCCCTGCG CTACGCCCGC GAGGGCATCA CCACCCTCCA GGAGGTCTCC
CGCGTCCTCG GCGAGGCCGA CGAGAGCGCC CTGCGCGATG AGGTCCGGGC CGTGCGCGAG
GCGGCGCCGA CGGACCCCTC CGAGGCGGAG GGGGTGCTCA ACGACCCGGC GTCCACGGAC
ACCGACAGCG CCTCGACCGA GGCGTGGCGC CAGGGCTAG
 
Protein sequence
MKRQKIRLGD LLIKRGVITQ AQMQQALEAQ KQSGRKLGAQ LIAMDFVTED RILAELSQQL 
DVPWVNPNDY WVDPDVASKL PESYARRFSA LILEETETDF LVAMADPSDL IAYDEITRVL
RCPVRLAVAR EKTLQELINL VYRRTDEISN IAEELGREIT SADDIDLSTL PVGEGGASAP
VVRLLQSLFE DAVQVGASDI HIEPDEQSLR IRIRRDGVLQ EQLFRQRNIH AAMVSLLKLM
CGLNITERRL PQDGRFQVRV HGRSIDVRLS TLPLQHGEGV VMRLLDHSAG VSSLDQTGMP
ADILQRLRRL IKMPYGMVLV TGPTGSGKST TLYGALSEIN RPEVKVITVE DPVEYRLPRV
NQVHVRESIG LTFARVLRTT LRQDPDIIMV GEMRDEETAD IGFRAAITGH LVFSTLHTND
AVSTANRLVD MGVEPYMIAA GLRAVLAQRL LRRICTQCRE PYTPDASERA WLRTIFGADE
TESLSLYQGR GCASCSKTGY SGRIGVFELL EMDADKIDAL RRGDQSGFAA ACHADLSYEP
MSRVALRYAR EGITTLQEVS RVLGEADESA LRDEVRAVRE AAPTDPSEAE GVLNDPASTD
TDSASTEAWR QG