Gene Hore_21090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_21090 
Symbol 
ID7313343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2289494 
End bp2290837 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content45% 
IMG OID643612557 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_002509849 
Protein GI220932941 
COG category 
COG ID 
TIGRFAM ID[TIGR01319] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTGG ATCTATTAAT AGCCGAAATA GGGAGCACAA CGACCGTAGT TAACGGGTTT 
ACCGGGATAA AAAAGGGCAC TCCCCGCTTA AAAGGACAGG GTCTGGCTCC GACTACAGTA
CTCGAGGGAG ATGTTACGGT AGGTTTGAAT AATGCTATTG AAAACCTGAA GGAAGTCCTG
GGAGCAGATG ACCTGACCTG GTCTGAGTTT ATGGCTACCA GCAGTGCTGC CGGGGGGTTG
AAGATGACTG TTCACGGACT GGTTAAAGAT ATGACTGTTA AAGCTGCCCG GGAGGCAGCC
CTCGGGGCCG GTGCTGTTAT CAAAATGGTT ACAGCCGGTA AACTCCGGGA ATCTAAATTA
AATAAGATAA AAGAGATCAA CCCCAACATT ATCCTGCTGG CCGGTGGGGT TGACTATGGG
GAAGAAGAGG TTATTCTCCA CAATGCCAGG CTTCTGGCTC AAATGAAATT AAATGTTCCT
ATAATTTATG CCGGGAATAA AGCCATTCAG GACGAAGTCA GGGATATCTT TGACAATAGC
CAGCAGGATA TTCTTATAAC TGATAATGTT TATCCAGAAA TTGATACCCT GAATATCGAA
CCAACCCGGA GACTGATTCA ACAGGTCTTT GCCCGTCATA TTATTAAAGC TCCGGGTATG
GAAAAGATAA AACCGATGTT AAGTTCTGAA ATGATGCCAA CTCCGGGGGC TGTTATGAAG
TCTGCCCGGT TGTTACGGGA GGATATCGGT AATCTGGTGG TTATTGATAT CGGAGGGGCC
ACTACTGATG TTCATTCGGT GACAGAAGAT ACCCCCGGGG TTAAACAGGT TTTAATTAAC
CCTGAACCGG TAGCCAAAAG AACAGTGGAA GGGGATCTGG GGGTTTTTAT TAATGCCCCC
CATGTTTTTA AATTAATAGA AGAAACCCGG AAGAATGGGT ATGAATATAA AGAAGTTAAA
GCTATACCAG GTAATGAAAA AGAGAGGGGC TTTGTCCGAC TTCTGGCAGA AACAGCAGCC
CTGACTGCAA TAAAAAGGCA TGCCGGCCGG CTCAGGGATT TTTTTGGACC CCAGGGCAGG
CAAACGGTGG CTGAAGGTAA GGATTTAAGT GGTGTTAGAT GGGTTATCGG AACCGGTGGC
GCTTTAACCA GGCTTGGCCG GGGTGATGAA GTATTAACCA GAGTTATTGA ATACCGGGGT
TCGAGACTTT TTCCTCCTGA AGATGCTAAA ATTCTTATTG ACAGGAATTA TATTATGGCT
GCTATGGGCA TATTGAGTGA AAAATACCCC GATAAAGCCC TGAGATTATT AAAAGAAAGC
CTGGGGTTGG ATATTGAGGA TTAA
 
Protein sequence
MDLDLLIAEI GSTTTVVNGF TGIKKGTPRL KGQGLAPTTV LEGDVTVGLN NAIENLKEVL 
GADDLTWSEF MATSSAAGGL KMTVHGLVKD MTVKAAREAA LGAGAVIKMV TAGKLRESKL
NKIKEINPNI ILLAGGVDYG EEEVILHNAR LLAQMKLNVP IIYAGNKAIQ DEVRDIFDNS
QQDILITDNV YPEIDTLNIE PTRRLIQQVF ARHIIKAPGM EKIKPMLSSE MMPTPGAVMK
SARLLREDIG NLVVIDIGGA TTDVHSVTED TPGVKQVLIN PEPVAKRTVE GDLGVFINAP
HVFKLIEETR KNGYEYKEVK AIPGNEKERG FVRLLAETAA LTAIKRHAGR LRDFFGPQGR
QTVAEGKDLS GVRWVIGTGG ALTRLGRGDE VLTRVIEYRG SRLFPPEDAK ILIDRNYIMA
AMGILSEKYP DKALRLLKES LGLDIED