Gene Hore_22420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_22420 
Symbol 
ID7312994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2443423 
End bp2444766 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content46% 
IMG OID643612694 
Producthypothetical protein 
Protein accessionYP_002509982 
Protein GI220933074 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones66 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAATACT ACCGTTTTAA GCAGGAAGGG AGAATGAACT GTGATGTCCG GGTCTTTGCC 
ACCCGGGACC TCTATAACCA GATAGAAAAA ACAGCCTTAA AACAGCTTTT TAATGCAGCC
AGCCTTCCAG GAGTAGTCGG GGTTATAGGA CTCCCCGATA TCCATCAGGG GTATGGATTA
CCCATCGGTG GAGTAATGTG TTCCAGTTTA AAAAAAGGAG TCATCTCTCC CGGGGCGGTA
GGTTTTGATA TAAACTGTGG AGTCCGGCTT TTAGTTGCCG GTTTAAGGCT CGAAGATATA
ATTGATAAAC TGGATGATAT AATGTCCGGA CTGAAAAATG AAATCCCGGC CGGGTTAGGG
GTTAATTCAA CATTAACATT TACCGACCAG CAATTTGAAC GGGTAGTTGA GGAGGGACTA
CCCTTTTTAA TAACCAGGCT GGGGTATGGA CAGACCATTG ATATAGCAGC CTGTGAGGAA
AACGGCCACC TAAAGGGGGC AGATCTTACG GGGGTCTCAA AAAAGGCCAT AAACAGGGGT
AAAAAACAGC TGGGAACCCT CGGTTCAGGC AATCATTTTC TTGAAATTCA GGTAATTGAT
AAGGTCTATA ATCATAACTC CGGTCTCGAA GAGGGCCAGA TCAGTATCAT GATCCACACC
GGATCCCGGG GTTTTGGCCA CCAGATTGCT GAAGATTATA TCAACATTGC CAAAAAAAGG
GCCAAAAAAT ATAATTTTGA TTTCCCCACT AAAAACCTGG CCTCCTTCCC CATTAATTCC
CCGGAAGGGG AAGACTACTA CCGGGCCATG GCCTGTGCCG CTAACTTTGC TTTTGCCAAC
CGGCAGATAT TAACCCATTT TGTAAGACAG GTAATAAACC ACTTTATACC GGGAACCTTT
ATTACTGTAT ATTATGACCT CGCCCATAAT ATCTGCAAAA AGGAAATTCA CCAGATAAAT
GGCAAGAAAA AAGCCCTTCT GGTCCACCGT AAAGGGGCTA CCAAGCTATC CCCTGACGGC
ATTGCCCTTG TTCCAGGATC TATGGGAACA GACAGTTATA TTGTCAGGCC GAAAAATCAG
GAGGCCCTGA AAGCTGCCTT TGAATCTGTT TCCCATGGAG CCGGTCGGAA AATGGGGAGA
AGGCAGGCCA GAAAGAAACT ATCATACCGG GAACATTTAA AGAGTCTGGG GGAAGTCAGA
GTGACCTCGG CCACCAATGA CAACCTCCTG GATGAATCAC CACTGGCCTA TAAGGATATT
AGTGAGGTCA TAAGGTCCCT TAAAGAAACC GGGCTGGCAG AACCGGTGGT CCGTCTTAAA
CCCCTGGCTG TTTTAAAGGG ATAG
 
Protein sequence
MKYYRFKQEG RMNCDVRVFA TRDLYNQIEK TALKQLFNAA SLPGVVGVIG LPDIHQGYGL 
PIGGVMCSSL KKGVISPGAV GFDINCGVRL LVAGLRLEDI IDKLDDIMSG LKNEIPAGLG
VNSTLTFTDQ QFERVVEEGL PFLITRLGYG QTIDIAACEE NGHLKGADLT GVSKKAINRG
KKQLGTLGSG NHFLEIQVID KVYNHNSGLE EGQISIMIHT GSRGFGHQIA EDYINIAKKR
AKKYNFDFPT KNLASFPINS PEGEDYYRAM ACAANFAFAN RQILTHFVRQ VINHFIPGTF
ITVYYDLAHN ICKKEIHQIN GKKKALLVHR KGATKLSPDG IALVPGSMGT DSYIVRPKNQ
EALKAAFESV SHGAGRKMGR RQARKKLSYR EHLKSLGEVR VTSATNDNLL DESPLAYKDI
SEVIRSLKET GLAEPVVRLK PLAVLKG