Gene Hore_05840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_05840 
Symbol 
ID7313556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp639099 
End bp640319 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content41% 
IMG OID643611014 
Productpeptidase U32 
Protein accessionYP_002508336 
Protein GI220931428 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0000158903 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAAG TTGAACTACT GGCTCCGGCA GGAAATCTAG AAAAATTAAA ACTGGCTATT 
TTATATGGTG CCGATGCTGT TTACTGTGGT GGTTTGCGTT TTGGACTTCG TTATGGGGCT
GATAACTTTA CTCCTGAAGA ACTGGAAGAA GGAACCAGAT TTGTCCATAA CCACGGAGGC
AAAATTTATA TAACGGTAAA TATATATCCC CATAACAATG ATCTGGCAAA GCTGCCAGAC
TATTTACATA AACTGGAAGA AATCGGGGTT GATGGTTTAA TCGTGTCAGA TCCCGGTGTA
ATTGAATTTA TAAACAGGGA AAAAATAGAG ATACCTCTCC ATCTCAGTAC TCAGGCCAAT
ACTGTTAACT GGGCCAGTGC CAGCTTCTGG CATAAACAGG GTATTGAAAG AATAATTCTG
GCCCGGGAAT TGAGCCGTGA GGAAATAAAG GAAATTAGAG ACAGAACCAG TATTTCCCTG
GAAATGTTCG TTCATGGTTC AATGTGTATT TCCTATTCTG GTCGGTGTTT ATTGAGCAAT
TATATGGTCG GGCGTGATGC CAACCGGGGG AAATGTGCCC ATCCCTGTCG CTGGAAGTAT
CATCTGGTTG AGGAACAGCG ACCCGGGGAA TACTATCCAG TATATGAGAA TGAACAGGGA
ACTTTTATTA TGAATTCAAA AGACCTCTGC CTTATTGAGT ATTTACCAGA CGTTATTTCA
ACCGGGGTAG ATAGTTTAAA GATAGAGGGA CGAATGAAGA GCCTTCATTA TGTAGCTACT
GTAACCAGGG TTTACCGTAA GGCCATAGAC TCTTATTATC ATGATCCTGA AAATTTTAAG
GTTAAACCTG AGTGGCTTGA TGAGCTAAAG AAAGTAAGCC ACCGGGGTTA TACAACAGGG
TTTTTTATCT CTCCTCCGAC TGGAGAAGAC CATAATTATA ATTCTTCAGT ATATATAAGG
GATCATGACT TTATGGGGAT TATCAGGGAT TATGACAAAA AGAAAAATGA GGCTGTAGTT
GAAGTCAGGC ATAAATTCTT TAAAGGTGAC AGGGTTGAGG TGATGGGACC GGATACAACT
AATTTTGAAA CAACTGTAAA TTATATAATC AATGAAAACG GGGAAGAAGT GGATGAAGCT
CCCCATCCCA GGGAGCTAAT ACGGATACCG GTAACCCATA AGGTTAAACC CTATTACCTT
GTAAGGAGGA AAAAGTCATG A
 
Protein sequence
MKKVELLAPA GNLEKLKLAI LYGADAVYCG GLRFGLRYGA DNFTPEELEE GTRFVHNHGG 
KIYITVNIYP HNNDLAKLPD YLHKLEEIGV DGLIVSDPGV IEFINREKIE IPLHLSTQAN
TVNWASASFW HKQGIERIIL ARELSREEIK EIRDRTSISL EMFVHGSMCI SYSGRCLLSN
YMVGRDANRG KCAHPCRWKY HLVEEQRPGE YYPVYENEQG TFIMNSKDLC LIEYLPDVIS
TGVDSLKIEG RMKSLHYVAT VTRVYRKAID SYYHDPENFK VKPEWLDELK KVSHRGYTTG
FFISPPTGED HNYNSSVYIR DHDFMGIIRD YDKKKNEAVV EVRHKFFKGD RVEVMGPDTT
NFETTVNYII NENGEEVDEA PHPRELIRIP VTHKVKPYYL VRRKKS