Gene Hore_05970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_05970 
Symbol 
ID7314502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp648991 
End bp650667 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content42% 
IMG OID643611027 
ProductTfp pilus assembly protein PilB 
Protein accessionYP_002508349 
Protein GI220931441 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E
[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGAA CACATATAAA GAAATTAGGA GAATTATTAC TTGATTTCAA TTTTATTACT 
GAAAAACAGC TTAATGAGGC CCTTAAAAAA CAGAATAAGT CAGGAAAAAA ACTGGGAGAA
ATCCTGGTTG AATCAGGCTA TCTCAATGAA AATGATTTAA TACAGGTCCT GGAGTTTCAG
CTGGGTATTC CCCATGCTGA CCTGAACAAA TATGTAATCA ACCCTCATCT GGCCCAGTAT
ATACCTGAAA ATATAGCCCG GCGTCATAAT GTTGTTCCCC TTGAAAAGAA AAATGGCAAG
CTTAAGGTTG CCATGGTTGA TCCAACCAAT CTTGTCGCCA TAGAGGATAT TGAAATGACT
TCAGGGTTAA AGGTAGAACC CCTGATTGCT TCCCGGAAGA ATATAAAAAT GGCTTTAAAC
CAGATTTATT CAGTTAATGA TTCAGATGCG GCCGAAGTAT TTGCCAGCTT AAATGAGGTT
ACTACCAAAA CTAATGAAGA ACCTGAATTA AATGAATTAA AAGAAATGAT AGAGGATGCC
CCTATCGTTA GACTGGCCAA TTTAATTATT AATCAGGCTA TCCAGATGAA GGCCAGTGAT
ATTCACATTG AACCCCAGGA GGACCAGGTC AGGGTCAGGT ACAGGGTTGA TGGTGTTTTA
CGGGAAAATA TGACGGTTCC CAAACACAGT CAGGCGGCCC TGATTTCAAG GTTGAAGATA
ATTGCTGACC TTGATATTAC CGAGAGAAGG GTTCCCCAGG ATGGTAGGAT AGAGCTCAAT
GTCAGCGGGG TCAAAATAGA TATGAGGGTT TCAACCCTTC CCACAGTTTA TGGGGAAAAG
GTTGTTATCA GACTTTTAAA TAAAGAAGAG AAACTATTAC AGATTGAACA ACTGGGATTC
AGCGAGACCA ATTTAAGCAG GTTTATGAAA CTTATTAAGC AACCCCACGG AATTATCCTG
GTTACCGGAC CCACCGGAAG TGGTAAATCA ACTACCCTTT TTGCTGCTTT GAATAAACTC
AATACACCTG AAAAGAACAT AATTACTGTT GAGGATCCGG TTGAATACCA GCTTCGTGGT
ATTAATCAGG TTCAGGCCAA TTCCAGGGTT GGGTTAACTT TCGCCAGTGC CTTACGGTCA
ATCTTGAGGC AGGACCCGGA TATCGTAATG GTCGGTGAGA TCAGGGATGA GGAGACGGCT
CGCATAGCGG TCCGGGCTGC CCTGACCGGA CATCTGGTTC TGAGTACCCT CCATACCAAT
GATGCCGTAA GCTCAGTTAC GCGACTTATT GATATGGGCA TTCCACCTTA TCTGGTGGCC
TCATCTGTAA TCGGGGTTGT GGCTCAGAGA CTGGTCCGGA GGTTATGTAC CTGTAAGGAA
GAGTATGTTC CTGGACCAGA AGAAATGGAA TTTCTCCAGG TTAATGATAT TGGAAAATTG
CAACGTCCTG AAGGCTGTAA AAAATGCCAT TCTACAGGCT ACAGGGGTCG TCTTCCGGTC
CATGAAATAT TAATTATGGA TAGGAAATTG AGAGAAATGA TTGTAAATGG AGAGGGGGAA
TCTGTTATAA AGAAATATGC CCGGAAAGCA GGAATGCTAA CTTTAAAAGA AGATGGGGTT
AATAAGGTAA TAGAAGGGTT AACTTCATAT GAAGAACTGG CAAGGGTAGT TAGTTAG
 
Protein sequence
MTRTHIKKLG ELLLDFNFIT EKQLNEALKK QNKSGKKLGE ILVESGYLNE NDLIQVLEFQ 
LGIPHADLNK YVINPHLAQY IPENIARRHN VVPLEKKNGK LKVAMVDPTN LVAIEDIEMT
SGLKVEPLIA SRKNIKMALN QIYSVNDSDA AEVFASLNEV TTKTNEEPEL NELKEMIEDA
PIVRLANLII NQAIQMKASD IHIEPQEDQV RVRYRVDGVL RENMTVPKHS QAALISRLKI
IADLDITERR VPQDGRIELN VSGVKIDMRV STLPTVYGEK VVIRLLNKEE KLLQIEQLGF
SETNLSRFMK LIKQPHGIIL VTGPTGSGKS TTLFAALNKL NTPEKNIITV EDPVEYQLRG
INQVQANSRV GLTFASALRS ILRQDPDIVM VGEIRDEETA RIAVRAALTG HLVLSTLHTN
DAVSSVTRLI DMGIPPYLVA SSVIGVVAQR LVRRLCTCKE EYVPGPEEME FLQVNDIGKL
QRPEGCKKCH STGYRGRLPV HEILIMDRKL REMIVNGEGE SVIKKYARKA GMLTLKEDGV
NKVIEGLTSY EELARVVS