Gene Hhal_2025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2025 
Symbol 
ID4710378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2227836 
End bp2228804 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content74% 
IMG OID639856498 
Productmutator MutT protein 
Protein accessionYP_001003591 
Protein GI121998804 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase
[COG1051] ADP-ribose pyrophosphatase 
TIGRFAM ID[TIGR00586] mutator mutT protein
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.153674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCATACGG CATCGCCCGC CGCGCCGATC CACGTGGCCG CCGCCGTGGT CCGCGGCGAA 
GACCAGCGCG TACTGGTCCA GTGCCGGCCG GATCACCTCG ATCACGGCGG TCTGTGGGAG
TTCCCCGGGG GCAAGATTGA GCCCGGTGAG TCGGTCGCCG ACGCCTTGGT CCGCGAGCTG
GACGAGGAGC TGGGTATCCG TGTCCGTCCG GGGGCGCTGC GCATCCGCGT GCCGTGGGAC
TACGGCCACC GGCGCGTGGT GCTCCACGTC CTCGATGTGA ACGAGTGGAC GGGGCGTCCC
ATCGGCCGCG AGGGCCAGGC GGTGGACTGG CTCACCCCCG AGGCGATGGC CGAGCGGGCC
TGGCCCGCCG CCAACTGGCC GATCATCCGT TCGCTGCAGC TCCCCGACCG GTATCTGATC
ACCCCCGTGG AGCCAGCGGA TGCCGATGCC TGGCTGGCCC GACTGGATGC GGCCCTGGCG
CGCGGTGTGC GCCTGGTTCA GCTGCGTCGC CCAGATCTCG ACGTGGAGGC CTGGGTGCGT
CTGGGGCGCG CCCTGCGCCG GCGCTGTGAC GCCCACGGTG CGTGGCTGCT AGCCAACGGA
CCGGCGGAAC AGGCCCGGGC GGTGGGCGCC GACGGGGTGC ACTGGAGCAG TCGCGTGCTG
GCCGAGGGGC CGCAACGCCC GGGGTGGGCG CGGTGGGTGG GCGCTTCTTG CCACAACGGC
GACGAGCTGG AGCGCGCCGC CGCCTGCGGG GCCGATTTCG CGCTGTTGTC ACCGGTGCAG
TGGACGGCCA GCCATCCGGA ACAGAGCGGC ATGGGGTGGG AGCGTTTCGC CGCCTGGGTG
GCCGGGGCGC GCCTGCCGGT CTACGCCCTC GGCGGCGTGG GCCCGGCGGA TATCCACCGG
GCTCGGGCCT GCGGCGGGCA GGGAGTGGCG GCTATCCGCG GCCTGCTGGC GGAGGCGGGC
CGCCGATAG
 
Protein sequence
MHTASPAAPI HVAAAVVRGE DQRVLVQCRP DHLDHGGLWE FPGGKIEPGE SVADALVREL 
DEELGIRVRP GALRIRVPWD YGHRRVVLHV LDVNEWTGRP IGREGQAVDW LTPEAMAERA
WPAANWPIIR SLQLPDRYLI TPVEPADADA WLARLDAALA RGVRLVQLRR PDLDVEAWVR
LGRALRRRCD AHGAWLLANG PAEQARAVGA DGVHWSSRVL AEGPQRPGWA RWVGASCHNG
DELERAAACG ADFALLSPVQ WTASHPEQSG MGWERFAAWV AGARLPVYAL GGVGPADIHR
ARACGGQGVA AIRGLLAEAG RR