Gene Hore_06040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_06040 
Symbol 
ID7314509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp657612 
End bp659366 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content39% 
IMG OID643611034 
Productprepilin-type N-terminal cleavage/methylation domain protein 
Protein accessionYP_002508356 
Protein GI220931448 
COG category 
COG ID 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTTCAA GGTTTTTTAC CAGCAATAAA GGATTAACCC TTATTGAGAT ATTAGTGGCC 
CTGGCTCTGG TAGGGATTGC ATCTACAGCT ATCTATGGCT TTTTAAATTT TACTGTTAAT
AACTATCAGG ATGGAGAAGA AAAAATAAGG GTTCAGGATT ATACCAGACT GGTAGCGGAT
GAGATTACTG AAGAACTGAG GGGAGTTACT TCAGCAAGAC TAATAGAAGA TGTAGACAGT
AGTGAAATTG ATTCAGAGAA TTATAACTAT ATGTATGTAA AACCGGATAT TGATCTGGTG
ATAAAGAGGG TAAAAAACGA ACTGGGTCTT ATAGTTGAAG AAAAAATCCC CGGAGTATCG
GGGGTAAAAT ATGTGGATCA GGCTGGCAAT GTAATAAAAG ATTTTACTCC TGATTATGAA
CTAACCTTTT TCATAGATGA TGATAACCCC GGAGTGCTTC ATTTTAATTT AGTGGAAGTT
AATAGTGGTT TTGAAGTGGA ATCAGCTGTA TATCTGATTA ATATAGAAGG TGATATCAGT
GGGGTCACTG AAGGAAAAAT GCTTGAATAC ACATCACTTT ATGATACGGG TGACCCCTTT
GCCCATCTGG ATTTCAATAA ATTCTGGTAT GAATGGTTGC AGGAAAATTA TCAGGATGAG
GGTATGATTG GTGAGGGTGA TTATGGTGTC AATTTTCCCC TGGATGGTGG GGAACTTACC
CTGACTATTA CGGGTTCGGG TAATGCTGCA GCTGGAGGGG CTATGCTTTT AAAGGAATTA
AATTCTGACC ATTTTCCCGA AGGGGCAGAT ATAACAAGCT TTGCGGTTGT TGTCGATGCC
AGAAACCTTG ATGTTGGTGA AGGCGGTTAC GGGGTTCTAC TAAGGGGGGA GGTAATTCAG
GCCAGGGATA AAGATGGAGA AATACTTGAA AATCAGTATA ATGATTATGG TTATATGTTC
CAGTTTGACC CCGGGGCCAG GGGGTTTGTG ATCAGGAGGA TAAAAGGAGG CTTTCATGAT
GTTGAAAATA ATATTGGGGC CTCTCTGATA ACAGGTAATG GTTATAACAC CCGTTTCGGT
GCACCTTATG CTCCGGAACA CCTGGTTAAT GATGTCTTTC ACTGGAGTGG TTATCAGGAC
TGGTTTAAAA GGTATAAAAC TGTAATAAAA GTTCAGACCC AGCCCGGTGG TGATTTAATA
TTGAGGGCAC ATCTTATTGA CGAGGATGGT CACAGATCAG ATGAAATGAT CTTTGGTGAT
TTTAATAAAC TGACGCTTAT CGGTAAATAT GGGGAAGAGA ATATATTTGA TGGCAGAAAG
CTCGATTATG ATTACTGGAG TAATGAAGAT GTAACCCTCC CTGGTAATAT TATCGGGCTC
AGGAGCTGGG ATATGCATAA TAATGAACAT ACAACCGAGT TTTATGAAAT TTCAATTGCC
CCGGCTGAAC CCGGTGTGAT AGATATAGAT TATGATAATA AAGTAATTAC ATTAACTTTT
GATGAAGAGG TAATAGCCGA TGATTTATCA TTATTAACTG GTAATTTTAT AATTACAAGG
GTTACTAATA ATATGGAATT TACTGTTTTC AGTATTGAAC AGGGAAACAC CACCAGGTCT
ATCGAATTAA ATCTCAGTAG TGCCCTGGGC AAGGGAACCT ATCTTATCAG TTATAGCAGA
CCAGCTTCAG GGGGACTTGC CGATTCGGAA GGAAATTTAG TTGAAGATTT TGATAGATAT
CTGGAAATTA ACTAA
 
Protein sequence
MFSRFFTSNK GLTLIEILVA LALVGIASTA IYGFLNFTVN NYQDGEEKIR VQDYTRLVAD 
EITEELRGVT SARLIEDVDS SEIDSENYNY MYVKPDIDLV IKRVKNELGL IVEEKIPGVS
GVKYVDQAGN VIKDFTPDYE LTFFIDDDNP GVLHFNLVEV NSGFEVESAV YLINIEGDIS
GVTEGKMLEY TSLYDTGDPF AHLDFNKFWY EWLQENYQDE GMIGEGDYGV NFPLDGGELT
LTITGSGNAA AGGAMLLKEL NSDHFPEGAD ITSFAVVVDA RNLDVGEGGY GVLLRGEVIQ
ARDKDGEILE NQYNDYGYMF QFDPGARGFV IRRIKGGFHD VENNIGASLI TGNGYNTRFG
APYAPEHLVN DVFHWSGYQD WFKRYKTVIK VQTQPGGDLI LRAHLIDEDG HRSDEMIFGD
FNKLTLIGKY GEENIFDGRK LDYDYWSNED VTLPGNIIGL RSWDMHNNEH TTEFYEISIA
PAEPGVIDID YDNKVITLTF DEEVIADDLS LLTGNFIITR VTNNMEFTVF SIEQGNTTRS
IELNLSSALG KGTYLISYSR PASGGLADSE GNLVEDFDRY LEIN