Gene Hore_11910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_11910 
Symbol 
ID7313889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1285219 
End bp1286403 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content30% 
IMG OID643611627 
ProductTPR repeat-containing protein 
Protein accessionYP_002508936 
Protein GI220932028 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000000944223 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AGTATTATTT ATTAATATTA TCAATTTTTA TTTTTATAAT GGCATTTAGT 
ACTACAGGAC TGGCTATTGA AATTAATGAT AATTCTTCAT TTATCAAGGG AATAACATCT
TATTATAATC AAAACTTTGA TGAAGCTATT GTTAATTTAA ATAAAGCAAT TAAAAATAAT
AATAATGAAC AAATAATAGT TGATTCCCTT TACTATCAAA CCCTGTGCTA TATTGGGAAA
AATGATATTG TCAAAGCTAA AGAAAACATC TTAAAATTAA AAAAAATGGG ATATGAATTT
GGAATTATTC ACTGGAAACT TGGTGAGGTT TACTTAAATA AACATAAACA ATTTGATAGT
CCATTTTATA ATGAAGCAAA AAAAGAACTG GAAAAAGCTT ATTTACTTGG TATTAATACT
ACTCTTTTCC ACAGGAGCCT TGCTCTTGTC TATAAAAATC TTGACGAATT AGATAAAGCA
GAAGAGGAGT ATGAAATGAT AATAGCCCAG AATGGACAAG CCGAAGATTA TATTAATCTG
GCTTCTATTT ATAAAAAACT GGGTAAGGTG AATTTGGCCA TTAATACTTA TGAAAAGGCT
CTTGATATGA CTGAAAGCAC TCAGAATAGG ATATCAATTT ATTCAAACCT GGCTGAACTT
TACATGAATG ATAAAGATTA TGAGAAGGCG ATAAAAGTTC TTGAGGAAAG TAAAAAGATT
AATCCTGATT TTGTTGCTAT AAGTACTAAA CTTGGTGAGG CCTATTATTT AAGTGGCAAT
TATGAACTGG CCAGAGAGGA ATTTGAAAAA GTAGTATCAA TCAATGATAA ATCCTATAAA
GCCTATTATT ATCTTGGAAA AATTCATGAA ATTAACCACA ATGAGGATAA AGCCATTTAT
TATTATAAAC AGGCCCTCAA ATATAACCCC GAATATGCCT CAGCATATAT CGCTCTGGGA
GATATTTATA TTAGACAGGA TAAACCCTAT TTAGCCATTT CCCATTATTC AACTGCAATT
GAAAAAAACC CGAATTACCC TGATAGCCAT TTCCACCTTG CGGTAACTTA TTATATACTA
GAAATGGAAG ATGCAGCCAT TGCTGAATTA AAGAAAACAT TACACTTAAA TCCAAATCAC
CGCGGTGCTC AAAAACTGTT AGAGAAATTA ACAGAGGGTG AGTAA
 
Protein sequence
MKRKYYLLIL SIFIFIMAFS TTGLAIEIND NSSFIKGITS YYNQNFDEAI VNLNKAIKNN 
NNEQIIVDSL YYQTLCYIGK NDIVKAKENI LKLKKMGYEF GIIHWKLGEV YLNKHKQFDS
PFYNEAKKEL EKAYLLGINT TLFHRSLALV YKNLDELDKA EEEYEMIIAQ NGQAEDYINL
ASIYKKLGKV NLAINTYEKA LDMTESTQNR ISIYSNLAEL YMNDKDYEKA IKVLEESKKI
NPDFVAISTK LGEAYYLSGN YELAREEFEK VVSINDKSYK AYYYLGKIHE INHNEDKAIY
YYKQALKYNP EYASAYIALG DIYIRQDKPY LAISHYSTAI EKNPNYPDSH FHLAVTYYIL
EMEDAAIAEL KKTLHLNPNH RGAQKLLEKL TEGE