Gene Hore_06030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_06030 
Symbol 
ID7314508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp656041 
End bp657591 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content42% 
IMG OID643611033 
Productprepilin-type N-terminal cleavage/methylation domain protein 
Protein accessionYP_002508355 
Protein GI220931447 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4967] Tfp pilus assembly protein PilV 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAT TAAAAAAAGA GAGAGGGTTA TCCTTAATAG AAGTGATGGT TTCCCTGGTT 
ATATTTGCGG TAATTGTTCT TGCTTTTGGT TCCTTTATTA CCTCAAATTA TAAAGGTATC
CAGGAGGCCG GAGAAATGAC CAGGTCAGCC CATGAAAATA AAAAGACAGT AGAGAGAATG
ATTGCTTCAG GACAGGTCTC CCGGGGACAT TCTTTAGAGC TTGACTTCGG GGAAGACAAT
ATTGTGATAG ATGGTGGTAT AGCTGAATCG GGGAAACTAA GAACATTTAT TCCCACGGTC
CCGGCCATTG TCAGTGTTAC TTCAGACCCT GAGTTTCATG TAATGGGTGA AGGACCGGTA
ATTATCGAGG TTGTGGTTAC TACCAGAATG GTACCTGATG ACACCGCGGT TGAAGTTGAG
TTGCGAACCC CTGACGATAT ACTGGTTGAT ACTGCAGTGG GGCAAATTCA GGATAACCAG
GATACTCTTT ATCTTAATGC TGGGGAACAT CTCTCAGATG GCATTTATAA TATTGTAACC
AGGGTAGATG GTATCTGGTC TCCTTTTGTT ATTAATTATG TCATCAGGCC CATTGTTTAT
GTAGTGGTAG GGGAGGATAG CACCGTTCTC TGCTCAAATG ATGGGGAAAA CTGGACAGAC
CACAGTGAAG AGTTGCCGGT AGATGGTGTT GATTTAAATG CTATTATCTG GGGTGGACGT
CCTGATGACC GGAAATTTAT TATAGTCGGG GATGATGGTT ATATATTTAC CTCGGAGGAC
GGGGTTAACT GGCAGGAGGA AATAACTCCT ACTGGTTCGG ACCTCTATGA TATTTGCTGG
GCTAAAGAAA GGTATCTCGC CGTGGGTGAA GGGGGTATAA TTCTTACTTC CGACAGTGGT
ACTGACTGGA ATAAGATATC TTTTGATGAT AATGTTAACC TCTATGGGGT TACATATGGT
GGTACTTCAG AAGATAGCTT TTCAGTAGCA GTTCCGGAGG CAACTCCGAA TTATACTGTT
GTTAAAATTG AAGGGGAAGA CCCTACCAAA AAAAATCTGA CACCATCAGA TAATCTTTAC
AGTGCTACCT GGGGAAGTCT ACCCTCCAGT GGAGAAGGAG TGTTTATGGC TGCCGGTGTC
CAGGATATTA TCAGTTTTGA TCATAATATT AAACTATTGA CCGATAATGG TTATTATAAT
GAGGATTATA TTTTCAATGA TATTGTTCCG GCTTTAATAG CAGAGACCAG TACTTTTCTG
GCAGCCGGTT CTGATGGGAA AGATGGTGTA ATTATGATAT TAAGGAAAGT GGATAGTGGT
GGACTTATCT GGGATTACCT GCATAATGTT GACGAGCTTC CTGAAATCCC CTCAAATCTG
GCAGGTTTTG ATGCAATAGT CTGGTTTAAT GATAGATTAG TAGCCACCGG TGTTAATAAA
AGCGGGAGAG AAGTAATTAT TAATCTCCAT TATAACGGGG ATAGCTGGGA ATGGCAGGAT
GTTTATACCG GTAGTGGGTA TGTGAGACTA AATGATGTGG TGGCCCGGTA G
 
Protein sequence
MKLLKKERGL SLIEVMVSLV IFAVIVLAFG SFITSNYKGI QEAGEMTRSA HENKKTVERM 
IASGQVSRGH SLELDFGEDN IVIDGGIAES GKLRTFIPTV PAIVSVTSDP EFHVMGEGPV
IIEVVVTTRM VPDDTAVEVE LRTPDDILVD TAVGQIQDNQ DTLYLNAGEH LSDGIYNIVT
RVDGIWSPFV INYVIRPIVY VVVGEDSTVL CSNDGENWTD HSEELPVDGV DLNAIIWGGR
PDDRKFIIVG DDGYIFTSED GVNWQEEITP TGSDLYDICW AKERYLAVGE GGIILTSDSG
TDWNKISFDD NVNLYGVTYG GTSEDSFSVA VPEATPNYTV VKIEGEDPTK KNLTPSDNLY
SATWGSLPSS GEGVFMAAGV QDIISFDHNI KLLTDNGYYN EDYIFNDIVP ALIAETSTFL
AAGSDGKDGV IMILRKVDSG GLIWDYLHNV DELPEIPSNL AGFDAIVWFN DRLVATGVNK
SGREVIINLH YNGDSWEWQD VYTGSGYVRL NDVVAR