Gene Hhal_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1067 
Symbol 
ID4709845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1159009 
End bp1160445 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content71% 
IMG OID639855538 
ProductGntR family transcriptional regulator 
Protein accessionYP_001002645 
Protein GI121997858 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.315299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACAC CACTCTACGA ACGGGTGGCC GAGACGCTGG CTCATCAGAT CGAGCAGGGC 
GTCTACGCGC CCGGGGATCG TCTGCCCGGA CTGCGGCGTC TGCGCCGGCA CTTCGGGGTG
AGCATGGCCA CCATCGTCGC CGCCTGCGAA CTCCTTGAGC AGCGCGAGGT CCTCGAGGCC
CGACCCCGCT CGGGGTTCTT CGTCCGCGAA CGGGCGACAG CGAGCGCAGC CCCCAGCGAG
CAAGTGGACG AGGTGGATGC GCCGAGTCTG GTGCTCGGCC AGGAGCGCGT CCTCGACCTG
GTGCGCGCCC ACCACGAGCC GGATGTGGTC TCGCTGGGGG CGGCCGCAGC GCCGGCGGCG
TTCCTCCCGG CCGCGGCCAT GGACCGCAGC TTCGCCCGGG TGCGTCGCCG CCAGCGCGAG
CGGGTCAATG CCTACGACTT CCCGCCGGGG TGTGCCGAGC TGCGCACCCA GATCGCCCGC
CGCATGGCCT ACGCCGGCTG CAGCCTGTCT CCGGACGACA TCGTGCTCAC CGCCGGGTGC
CAGGAAGCGA TCACCCTGAG CCTGCGCGCC GTGGCCGAGC CCGGCGACGT GGTGGCCATC
GAGTCGCCGA CCTTCTACGG CATCCTTCAG GCCATCGAGT CCCAGGGCAT GCAGGCCCTG
GAGGTGGCCA CGGACCCGCA TTCCGGGATG ATCCCCGAGG CGCTGGAACG GGCCCTGTCG
CGTTGGTCGA TCCGCGCCTG TGTGCTGATG CCCACCTTCG GCAATCCGTT GGGGCACGCC
ACCCCGGAGT CGCGCAAGCA GGAGCTGGTG GGGGTGCTCA GCGCCCACGG CGTGCCCTTA
ATCGAGGACG ATGTCTACGG TGAGCTGGCC TTCGACGGTA CCCGGCCGTG GGCAGCGAAG
GCCTTCGACG CCCACGGCGA GGTGCTCTAC TGCAGTTCCT TTTCCAAGGT CCTCGGTGCC
GGGCTGCGGG TCGGCTGGGT CGCGCCGGGC CGTTACCGGG ACCGCCTGGT CTATCTCAAG
TACGCGACCA GTCAGGCCAC TTGTACCCTG AGTTCGTTGG CGGTGGCCGA CTACCTGGAG
CAGGGCGGCT ACGACCGCTT CCTGCGCCGG GCCCGGCGCA ACTACGAGCG GCATGTGCGC
TGGGTGGCCG GCCTGGTACG CCGCCACTTC CCGGAGGGCA CCCGGGTGAC GCGGCCGCGG
GGCGGTTTCG TGGTGTGGGT GGAGCTCCCG GCCGGCTGCG ATAGCCTGGA GCTGCAGCGC
CGCGCCCTGG CCGAGGGAAT CAGCATCGCC CCCGGTCCCG TCTTCTCGCC CACCGGCCGG
TACGGACGCT GCCTGCGGCT CAACTGCGCC CAGGCCGACG TGCCGGCGAC AGAGTGGGCG
CTGCAGCGGC TCGGCGCCCT GGCCATCGGC GAGGATCGCG GCGACGTGCG CGGCTGA
 
Protein sequence
MTTPLYERVA ETLAHQIEQG VYAPGDRLPG LRRLRRHFGV SMATIVAACE LLEQREVLEA 
RPRSGFFVRE RATASAAPSE QVDEVDAPSL VLGQERVLDL VRAHHEPDVV SLGAAAAPAA
FLPAAAMDRS FARVRRRQRE RVNAYDFPPG CAELRTQIAR RMAYAGCSLS PDDIVLTAGC
QEAITLSLRA VAEPGDVVAI ESPTFYGILQ AIESQGMQAL EVATDPHSGM IPEALERALS
RWSIRACVLM PTFGNPLGHA TPESRKQELV GVLSAHGVPL IEDDVYGELA FDGTRPWAAK
AFDAHGEVLY CSSFSKVLGA GLRVGWVAPG RYRDRLVYLK YATSQATCTL SSLAVADYLE
QGGYDRFLRR ARRNYERHVR WVAGLVRRHF PEGTRVTRPR GGFVVWVELP AGCDSLELQR
RALAEGISIA PGPVFSPTGR YGRCLRLNCA QADVPATEWA LQRLGALAIG EDRGDVRG