Gene Hore_18700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_18700 
Symbol 
ID7312684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1997576 
End bp1999120 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content41% 
IMG OID643612317 
Producttype II secretion system protein E 
Protein accessionYP_002509614 
Protein GI220932706 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAGA CAAACATGGA TTACGAAAAT ATTAATAGTC TAGAGGTCAG TGAGATAAAT 
AATTTTTATA TAGACAATAA GTTAATCAGA AATTTCCCTG GCTCAACTTT AAAAGAAAAC
AGGCTAATTC CCCTGACCAG ACATGGTGAC ACTGTAGTAG TTATTTCCGA TAATTACCCT
CCCCCTGAGG CAATCAATGA CCTGGAAGTA TATACAGGGT TTAAAATAGA TGTCAGGTTA
GAAAAATCTC ATGTGGTTCA ACAGCTTGTA AATGAACACC TTAAGGCTCC GGTGGATACA
GTAGAAGATA TGCTTGATGA TAATAATTTG CGGGGTCTGG ATGATTTCAA GATCCTAAAA
ATTGATAGCC GGGTTGAAAA CCTGGAGGAT TTAGCCCAGG AAGCCCCAAT AATCAGACTG
GTTAATGCCA TCATCACTAC TGCCTTGAAG AAGGGAGCAA GTGATATCCA TATTGAACCC
TTTGAAGACA AATTAAGGTT AAGGTACAGA ATAGACGGTG TTTTATATGA GAATCCTGCC
CCGCCACTGG AGTTGTTACC GGCTATCATT ACCAGGATAA AAATTATGTC TGAATTAAAT
ATTGCTGAAA GACGGCTTCC TCAGGAAGGT AGAATCAGAA TCAGGGTATC GGGTCGGGAG
CTTGATATAA GGGTGTCTAT TATTCCTGCT CTCCATGGGG AAGGAGTTGT CCTCAGGTTA
CTTGATAAAG CAGCCCGGTT ACTTGATATC AAAAACCTCG GCTTTAGCGA GGCTATGTTA
AAAAGGTATT TGAACCTTAT TAATATTCCT CATGGCATTA TTCTGGTGAC AGGTCCGACC
GGAAGTGGTA AGACAACAAC CCTTTATGCA ACCTTACAGT ATTTGAATTC ATCAAGTCGA
AAGATAATTA CCATTGAAGA CCCGGTTGAA TACCAGTTAG AGGGAATTAA CCAGATACAG
GTAAAACCGG AGATTAACTT CGATTTTGCC TCTGGTCTAC GATCTATTTT GCGACATGAT
CCCGATATTA TTATGATAGG TGAAATCAGA GATGTGGAAA CAGCTAAAAT TGCCATTCAG
GCAGCCCTGA CAGGCCATCT GGTCCTGGCA ACTCTCCATA CCAATGATGC TGCCGGTGCT
GTCTCGAGAT TACTTAATAT GGGGGTTGAA GATTACCTGC TGGCAGCAAC CCTGAAAGGG
ATACTGGCCC AACGTCTTGT CAGGGTTTTA TGCCCCAGGT GTAAGGAGTC CTATCAACCT
ACATCTTCAG AAGTTAACCT GATTGATGAT GATGTAGAGT TATTATACCG TCCGGCGGGT
TGTACCTTCT GTAATAATAT AGGTTTTAAG GGGCGGACGG GTATTTATGA ATTATTAACT
GTAACCCCCC AGATTGAATC AATGATTGTC CAGAGGGCCA GCTCAAGTGA GATTAAAGAA
GAGCTTAAAA AAACAGGTTA TACCAGTCTG TTCACCGACG GTTGTATCAA GGTAAAAGAC
GGTTTAACTT CCATAGATGA AGTGGTCAGG GTTACAACCC AGTAA
 
Protein sequence
MDKTNMDYEN INSLEVSEIN NFYIDNKLIR NFPGSTLKEN RLIPLTRHGD TVVVISDNYP 
PPEAINDLEV YTGFKIDVRL EKSHVVQQLV NEHLKAPVDT VEDMLDDNNL RGLDDFKILK
IDSRVENLED LAQEAPIIRL VNAIITTALK KGASDIHIEP FEDKLRLRYR IDGVLYENPA
PPLELLPAII TRIKIMSELN IAERRLPQEG RIRIRVSGRE LDIRVSIIPA LHGEGVVLRL
LDKAARLLDI KNLGFSEAML KRYLNLINIP HGIILVTGPT GSGKTTTLYA TLQYLNSSSR
KIITIEDPVE YQLEGINQIQ VKPEINFDFA SGLRSILRHD PDIIMIGEIR DVETAKIAIQ
AALTGHLVLA TLHTNDAAGA VSRLLNMGVE DYLLAATLKG ILAQRLVRVL CPRCKESYQP
TSSEVNLIDD DVELLYRPAG CTFCNNIGFK GRTGIYELLT VTPQIESMIV QRASSSEIKE
ELKKTGYTSL FTDGCIKVKD GLTSIDEVVR VTTQ