Gene Hhal_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1302 
Symbol 
ID4710794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1412420 
End bp1413349 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content70% 
IMG OID639855771 
Producthelix-hairpin-helix DNA-binding motif-containing protein 
Protein accessionYP_001002873 
Protein GI121998086 
COG category[L] Replication, recombination and repair 
COG ID[COG1555] DNA uptake protein and related DNA-binding proteins 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGTCAGT CTACGTGGCC ACTAACAGCG AAGCTCTTCG GCGTCCAGGC CCGGGGGCAG 
AAGGCGGCGC AGCTGCAGCT GATCTACATG GTCTTCGCTG CCATCGTCGG CGGTGTGCAG
CTGCTCTTGA TGCTCATTGG CGGGCTGCTG CGCGGCGCCA TCGCCCTCGG GCAGTGGGTC
GGTGACAACG TGCACGCCCT GCCGCGGTGG TGGCGCAATG CCCGGCTGTG CCGGCAGCGC
CGGCGGATCG ACGAGGCCGT CGAGCAGGCG ATCAACCGCT GGGACGCGGA CCGGCTCAGC
GCCGCCTTCG CCCTGGCGGT GGCCGTCTGG GGGCGGGGTG AGGCGCTCCA GGACGGCCAG
CGGACGCACC GACGGGTCAG CGAGAAGGCG GGATGGGTCG CGCTTCCGCG CTCCCCGGAG
GTCTTCGCGG AGGCGGCACA GGGACTTGAA CATTGCCGCG CCGCCGTTCA GCCGAAGGAG
GATGCGCACC GGATCCTGAT CGCACTGCTC GCCGAAGTGG CGGCCGAGAA ACTGGAGGGG
TCCCGGCGGG CGGCGCTGCT CTTTGAGGCC GATGACCTGG CCCTGATCCA GGGCCCACGG
ACGGTGCTCC AGGAGCAGAT GCTGGAGATC TTCGCTGATC ACGCCCAGCT ACAGATCGAG
CCGGCCCGGC CGGTGGATGA GGCGACGAAG CCATCCTCAG CGCGGTCGGC TCGTGGTGCA
CCGGGGGCGG GGCAGGGCGA CGAGCCGACG GGGCGCATCG ACCTCAACAC GGCCTCCATC
GAAGAACTCC AGGCGATCCC CCACATCGGC CCGGAGCGCG CCGAGGCGAT CGTCGCCCTG
AGACCGATCC GGCGGATCGA GCAGCTTGAG GAAGTCGACG GGATCGGCAC GAGCCGCCTG
GCGGAGATCG CCGATCAGGT GAAGGTGTGA
 
Protein sequence
MGQSTWPLTA KLFGVQARGQ KAAQLQLIYM VFAAIVGGVQ LLLMLIGGLL RGAIALGQWV 
GDNVHALPRW WRNARLCRQR RRIDEAVEQA INRWDADRLS AAFALAVAVW GRGEALQDGQ
RTHRRVSEKA GWVALPRSPE VFAEAAQGLE HCRAAVQPKE DAHRILIALL AEVAAEKLEG
SRRAALLFEA DDLALIQGPR TVLQEQMLEI FADHAQLQIE PARPVDEATK PSSARSARGA
PGAGQGDEPT GRIDLNTASI EELQAIPHIG PERAEAIVAL RPIRRIEQLE EVDGIGTSRL
AEIADQVKV