Gene Hore_12420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_12420 
Symbol 
ID7313563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1336540 
End bp1337547 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content40% 
IMG OID643611682 
ProductDeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_002508987 
Protein GI220932079 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGCA GGACAAGAAT AGAACAGATT GAAGAAGAGG TTTTATCTCC CTATGCCTGT 
TTAAGTAAAA ACAGCAGGGG GCGATTGAAA ACAGAAGATG AGTGTAATAT CAGGACAATA
TTTCAACGGG ACCGGGATAG AATAATTCAT TCTAAAGCCT TCAGGCGTTT AAAACATAAA
ACTCAGGTCT TTATCGCTCC GGAGGGTGAT CACTACCGGA CCAGATTAAC CCATACCCTG
GAGGTATCCC AGATTGCCAG AACTATTGCC AGGGGTTTAG GGCTTAACGA AGACCTGACA
GAAGCAATTG CTCTGGGGCA TGACCTGGGA CACACACCTT TTGGACATGC CGGGGAAGAG
GTTCTGGATG AAATAAGTTC AAATGGTTTT TCCCATAATG TCCAGAGCCT CAGAGTGGTA
GATTACCTTG AGGTCAGGAA TTCAAATTTA AGGGGATTAA ATTTGAGCTA TGAAGTACGG
GATGGAATTT TAACCCATAC TGGAAAACAA GATCCCTCTA CCCTGGAGGG ACAAATTGTG
AAAATTGCAG ATAGAATTGC CTATATTAAC CATGATATTG ATGATGCCCT CAGAGGGGGT
ATTATTAGTG AAAAAGATTT GCCTCATACA GCTATCAAGA TACTTGGTAA TACCCATTCT
GACCGGATAG ATACAATGGT GAGGGATATT ATCAGGGAAA GCTGGAATAA ATCAGTTATA
AAAAGAAGTC GAGATGTTGC AGAAGCAACC ACTGAACTAA GGCAATTTCT TTTTGAAAAT
GTATATATCG GTTCCAAGGC AAAAAAAGAA GAGAATAAAG CTAAAAATCT GGTCAAAAAA
CTATATTATT ATTATCTGGA CCATCCCCGG GAGATCCCTG ATGAATTTAA ACAAAAAGAA
GGAAATGAAG ATATAGAACA AATGGTTATT GATTATATTG CAGGTATGAC TGACCGTTAT
GCTATTAAGA TGGGACAGGA ATTATTTCTT CCTTCTCCAT GGGGGTAA
 
Protein sequence
MNGRTRIEQI EEEVLSPYAC LSKNSRGRLK TEDECNIRTI FQRDRDRIIH SKAFRRLKHK 
TQVFIAPEGD HYRTRLTHTL EVSQIARTIA RGLGLNEDLT EAIALGHDLG HTPFGHAGEE
VLDEISSNGF SHNVQSLRVV DYLEVRNSNL RGLNLSYEVR DGILTHTGKQ DPSTLEGQIV
KIADRIAYIN HDIDDALRGG IISEKDLPHT AIKILGNTHS DRIDTMVRDI IRESWNKSVI
KRSRDVAEAT TELRQFLFEN VYIGSKAKKE ENKAKNLVKK LYYYYLDHPR EIPDEFKQKE
GNEDIEQMVI DYIAGMTDRY AIKMGQELFL PSPWG