Gene Hore_06140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_06140 
Symbol 
ID7314519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp668058 
End bp669128 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content39% 
IMG OID643611044 
Productpeptidase M24 
Protein accessionYP_002508366 
Protein GI220931458 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGA GAATACAGCA ATTAAGAAAA GCTATGAAAA AAATGGATCT GGACAGTTTG 
ATTATTGATT CCAGCCATAA CCGATTTTAT TTAACCGGTT TTACTGGCAC AGCCGGGAGG
GTATTATTCA CACCAGAAAA TAATTATTTT ATAACAGATT TTCGTTATAC AGAACAGGCC
CATGAGCAGA TATCGGGTTT TGAAATACTG GAGGTTAACC AGAAAGCAGT TGTGGAAATT
TCTGATATTC TTGATCAGGA TAATTCTAGC CGGGTCGGTT TTGAAGCAAG AAGTGTTACC
TATGATGTTT TTCAAAAATA CAAAAAGACA TTTAATGAAA GTATCAAGCT TGTACCAACG
GCCGGGCTTG TTGAGAAAAT CAGGGTTGTT AAAGATAGAA GTGAAGTTGA GACTATTAAA
AAGGCAGCTG AGATAGCCGA TAGTGCTTTT AAACACATCC TTGATTTTAT AAAACCGGGG
GTAACAGAAA GGGAGGTTGC CCTGGAATTA GAATACTTTA TGAAAAAAAA CGGTGGAGAA
GGAAATGCCT TTGATTTCAT AGTTGCTTCT GGTAAAAGGT CTTCTTTGCC CCATGGGGTA
GCCAGTGATA AGGTTATTGA AGATGGGGAT TTTGTTACCA TGGATTTTGG AACCTATTAT
AAGGGTTATT GTTCTGATAT GACACGGACA GTGATAGTGG GAGAACCGAC CCCTGAACAA
AAGGAGATTT ATAATATTGT GCTAAAAGCT CAGAATGAGG TCATAAAAAA TATCAGAGCA
GGTATGACCT GTAAAGAGGC AGATGCTATT GCCCGTGATA TAATAGCTGA ACATGGGTAT
AAGGATAATT TTGGTCACAG CCTCGGTCAT GGCCTCGGAG TTGAAGTTCA TGAGGATCCC
CGTGTTTCTT ATGCTTCAGA TGAGGTATTA AAACCGGGGA TGGTAGTTAC TGATGAACCC
GGTATTTATA TTGCTGACTG GGGTGGAGTC AGGATAGAAG ATGACCTGTT GATAACTGAA
GATGGATGTG AAGTTCTGAC CAGTTCCCCT AAAGATCTTA TTTCGGTGTA G
 
Protein sequence
MEKRIQQLRK AMKKMDLDSL IIDSSHNRFY LTGFTGTAGR VLFTPENNYF ITDFRYTEQA 
HEQISGFEIL EVNQKAVVEI SDILDQDNSS RVGFEARSVT YDVFQKYKKT FNESIKLVPT
AGLVEKIRVV KDRSEVETIK KAAEIADSAF KHILDFIKPG VTEREVALEL EYFMKKNGGE
GNAFDFIVAS GKRSSLPHGV ASDKVIEDGD FVTMDFGTYY KGYCSDMTRT VIVGEPTPEQ
KEIYNIVLKA QNEVIKNIRA GMTCKEADAI ARDIIAEHGY KDNFGHSLGH GLGVEVHEDP
RVSYASDEVL KPGMVVTDEP GIYIADWGGV RIEDDLLITE DGCEVLTSSP KDLISV