Gene Hore_22040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_22040 
Symbol 
ID7313752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2399595 
End bp2401148 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content40% 
IMG OID643612656 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_002509944 
Protein GI220933036 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID[TIGR02900] stage V sporulation protein B 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAC AATCACTGTT AAAAGGGGCT TTTATCTTAA TAATTGCGGG TTTTATAAAC 
AGGGTGCTGG GTTTCATACT AAGGATAATA CTCGTCCAGA TGATCGGTGA TGAAGGACTG
GGCCTTTTTC AGATGGTTTA TCCCTTTTTT ATAACCCTCC TTCTTATCAG TACCGCCAGT
TTTCCCACGG CAATATCAAA ACTTATACCG GAAAGGCTGG CCCGAAATGA TAAAAAAGGA
GTCTATCAAT TACTCAAGAC TTCATTACTT TTTGTTGGAG GGATGGGGCT ACTTACCGGC
ACTCTTTTAT ACTTTTTATC TGGTTTTGTA TCACAGAACA TATTCGGTGA TCCCAGAACC
AGGATTATTT TAATGACGTT AACCCCGGCC CTTTTTATAA CCCCTCTGGC CTCAAGCCTC
AGGGGGTTTT TTCAGGGACA CCATACTATG ATTCCTACTG CTGTATCACA GATTACAGAA
CAAATTAACA GGATGGGCTC CACCCTGGTT ATGGTCAGTA TAACCGGCTA TCTCGGTCTT
AAATATCAGG CAGCCAGTAT TGGGCTGGGA ATCAGTATCG GGGAACTATC CGGGTTAATT
ATCCTCCTAT ACTTTTTTGT TACACATATC AAGACCGATA ATAAAAAGAT AACCCCCCTG
AGAATCAAAA CCGGGGTCTT CCACTGCTTT AAAGAAATTA CTAAAATTGC CTTTCCAATC
ACAGCAGGGC GTCTAATCAA CTCTCTGATG TTAAGTGTGG AAGCTATTCT AATTCCCAGA
CAACTTCAAA ACAGTGGTCT GGGGGTCAGA GAAGCCACCT CCCTGTTCGG TCAATTGAGT
GGTATGGTAG AACAGATTAT CTTCTTTCCC ACAGTGGTAA CCATCGGTCT TACTACCAGC
CTTATTCCAA ATATATCAGA TGCCCATGCC CGGAATAATA TAACTAAAAT AAGGAAAAAC
TATCAGGATG TTATCAGGGT AACAACGTAT CTTGGTTTTC CACTGACGGT AATCTTTTTT
CTAAGGGGAC GGGAAATATG TAATCTTTTA TTCAACTTCC CTGCTGCTGG CCCTATTCTA
TCAGCTATGG CTCTGACCGC CACCTTTATC TATTATCTTC ATGTCTCTTC AGGAATGCTC
AATGGCCTCG GAAAACCTCA ACTGGCCTTA TTAAATCTGG GAATCGGCTC TGCCATAAAA
CTTACTGGAA TTTACTTTTT AACCCCCAGA CCAGAGCTTA GAATAATTGG CTCTATAATA
AGTATAACTC TGGGTTATAT TGCAGCTGCC ATCCTTAATT TCTTTACCAT AGGAAATACA
ATTGGTTATG ACCTTGATAT TAAACAGACC CTGGTAAAAC CACTATTTTC CAGTTTTCTT
ATCTTCATAA TAACTCCGTA CCTGTCCCGT ATTTTACATC CCCTTTATAA CCTTTATAAT
ATCCGGTTGG TTACACTATT AGAACTTGTA ATACTCGGCT TTGTTTACCT GATAACCATG
TTTGCCATAA AAGCTATCAC GGCAGATGAT ATCAAGAGGT TTACCGGTAA CTAG
 
Protein sequence
MEKQSLLKGA FILIIAGFIN RVLGFILRII LVQMIGDEGL GLFQMVYPFF ITLLLISTAS 
FPTAISKLIP ERLARNDKKG VYQLLKTSLL FVGGMGLLTG TLLYFLSGFV SQNIFGDPRT
RIILMTLTPA LFITPLASSL RGFFQGHHTM IPTAVSQITE QINRMGSTLV MVSITGYLGL
KYQAASIGLG ISIGELSGLI ILLYFFVTHI KTDNKKITPL RIKTGVFHCF KEITKIAFPI
TAGRLINSLM LSVEAILIPR QLQNSGLGVR EATSLFGQLS GMVEQIIFFP TVVTIGLTTS
LIPNISDAHA RNNITKIRKN YQDVIRVTTY LGFPLTVIFF LRGREICNLL FNFPAAGPIL
SAMALTATFI YYLHVSSGML NGLGKPQLAL LNLGIGSAIK LTGIYFLTPR PELRIIGSII
SITLGYIAAA ILNFFTIGNT IGYDLDIKQT LVKPLFSSFL IFIITPYLSR ILHPLYNLYN
IRLVTLLELV ILGFVYLITM FAIKAITADD IKRFTGN