Gene Hhal_2245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2245 
Symbol 
ID4709500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2458603 
End bp2460999 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content71% 
IMG OID639856721 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001003811 
Protein GI121999024 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.659597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAGTG AGTCGATCCC GCAGCGCATC GCCCAGGAGC TGGGCGTGGG CAACCGGCAA 
GTCGAGGCCG CCATCGAGCT GCTCGACGGG GGCGCCACGG TCCCCTTCAT CGCCCGCTAC
CGCAAGGAGG TCACCGGCGG CCTCGACGAC AGCCAGCTCC GCTCGCTCGA GGAACGGCTG
ACCTACCTGC GTGAGCTCGA CGAGCGCAAG GAGACGGTGC TCAAGGCCAT CGACGAACAG
GGCAAGCTCA CCGACGAGCT GGCCGAGGCC ATCCGGACCG CCGAGACCAA GACCCGGGTC
GAGGACCTCT ACCGCCCCTA CCAGAAGAAG CGCCGCACCA AGGCGCAGAT CGCCCGCGAG
GCCGGGCTGG AGCCCCTCGC CGACACCCTC CTCGCCGACC CGAGCCAGGT CCCCGAGGAG
GCGGCGGCGC CCTACGTCAC CGGACCCGCC GAGGAAGGGC AGGAGACCCC GGCCGTGGCC
GATGTCGAGG CCGCCCTCGA CGGCGCGCGG CAGATCCTCA TGGAGCGCTT CGCCGAGGAT
GCCGACCTGC TTGAGGAACT GCGCGACTAC GGCTGGCGCA AGGGCTATCT GGTCAGCCGC
GTGGCCGAGG GCAAAGAGAA GGACGGCGCC CGCTTCCGCG ACTACTTCGA GCACGCCGAG
CCGCTGCGCA AGATCCCCTC CCACCGCGCC CTGGCCATGC TCCGCGGGCA GAGCGAGGAG
ATCCTGCGCC TGGAGATCGC CTGGTCCGAC GCCGAGGCCC GCGGCGAGAG CGACGAGCGC
AGCGTCGGTG AGGCGGCCAT CGCCCGGCGG TTCGCCATCG CCGACCACGG CCGACCGGCC
GACGCCTGGC TGGCGCGCAC CGTTCGCCTG GCCTGGCGGG CGAAACTCTC CACCCACCTC
GACCTGGCCC TCAAGCGCCA GCTGCGCGAG CAGTCCGAAG AGGAAGCCAT CCGCGTCTTC
GGCGCCAATC TGGAGGATCT GCTGCTGGCG GCGCCGGCCG GCCCCAGGCC GACCATCGGC
CTCGATCCGG GGCTGCGCAC CGGGGTCAAG GTCGCGGTCA TCGACGCCAC CGGGGCCGTG
GTCGATACCG CCACGATCCA CCCCTTCACC AGCCGCAACA AAGACCCCGA GGGATCGCTC
AAGAGCCTCG CTGACCTGGC CCGCAAGCAC GAGGTGGGGC TGGTGGCCAT CGGCAACGGC
ACCGCCTCCC GCGAGACCGA CGCCCTGGTC GGTGAGCTGA TCAAGCGCCA CCCGGAGCTG
GGCCTCCACA AGGTGGTGGT CTCGGAGGCC GGCGCCTCGG TCTACTCCGC CTCCGAGCAC
GCCTCGCGAG AACTCCCCGA GCTGGACGTC TCCCTGCGCG GCGCGGTCTC CATCGCCCGA
CGCCTGCAGG ACCCCCTGGC CGAGCTGGTG AAGATCGAGC CCAAGTCCAT CGGCGTCGGT
CAGTACCAGC ACGACGTCAA TCAGAGCCAC CTCGGGCGAA AGCTCGATGC GGTCGTGGAG
GACTGCGTCA ACGCCGTCGG CGTGGACGCC AACACCGCCT CGGCGCCGCT GCTCGCCCGC
GTCTCCGGCC TCGGCCCGGG CCTGGCCGAG AAGATCGTCC AGCACCGCTT CGACAACGGC
CGCTTCCGCA CGCGCAAGGA TCTCCAGGGG GTCCCGCGAC TCGGCCCCAA GGCCTTTGAA
CAGGCCGCCG GCTTCCTGCG CATCCCCCAG GGCGACAACC CGCTGGACGC CTCCGCCGTC
CACCCGGAGG CGTACCCGGT GGTCGAGCGG ATCTGCGCCG AGACCGGCCG GAGCGTGGCC
GATCTGATCG GCGACGAGGG CTTCCTCGGC GGCCTCGACC CGAAGGCGTA CACCGACGAG
CGCTTCGGCG AGCCCACGGT GCGCGACATC CTCGGCGAGC TCGCCAAACC GGGCCGGGAC
CCGCGCCCCG AGTTCCGTAC CGCCGCCTTC CGCGAAGGGG TGGAGAAAAT CCAGGATCTG
GAACCGGGCA TGGTCCTCGA GGGCACGGTG ACCAACGTCG CCAACTTCGG CGCCTTCGTC
GATATCGGGG TCCACCAGGA CGGGCTGGTG CACATCTCCG CCCTCGCCCA CGAATTCGTC
CGCGACCCGC GCGACAAGGT CCGCACCGGG GACGTCGTCC AGGTCAAGGT CATGGAAGTC
GACCTGGAGC GCCAGCGGAT CGGCCTGTCC ATGCGCCTCG ACGACGACCC CAACGCCCAG
GCCGAGGGGG GCCGCAAGGG GGCCAATGGC AAGGGCGCCT CGGCCGCCCG GGGCAAGGGC
GATGGCTCCG GCAACAAGGC GACCGGGCGC GGATCGAAGA AGGGCAAGAA GCAGGAAAAG
GCCGAACCCG CCACCGCCAC AGCGCTGGCC GAGGCCTTCC GCAAGGCCCG CTCCTGA
 
Protein sequence
MVSESIPQRI AQELGVGNRQ VEAAIELLDG GATVPFIARY RKEVTGGLDD SQLRSLEERL 
TYLRELDERK ETVLKAIDEQ GKLTDELAEA IRTAETKTRV EDLYRPYQKK RRTKAQIARE
AGLEPLADTL LADPSQVPEE AAAPYVTGPA EEGQETPAVA DVEAALDGAR QILMERFAED
ADLLEELRDY GWRKGYLVSR VAEGKEKDGA RFRDYFEHAE PLRKIPSHRA LAMLRGQSEE
ILRLEIAWSD AEARGESDER SVGEAAIARR FAIADHGRPA DAWLARTVRL AWRAKLSTHL
DLALKRQLRE QSEEEAIRVF GANLEDLLLA APAGPRPTIG LDPGLRTGVK VAVIDATGAV
VDTATIHPFT SRNKDPEGSL KSLADLARKH EVGLVAIGNG TASRETDALV GELIKRHPEL
GLHKVVVSEA GASVYSASEH ASRELPELDV SLRGAVSIAR RLQDPLAELV KIEPKSIGVG
QYQHDVNQSH LGRKLDAVVE DCVNAVGVDA NTASAPLLAR VSGLGPGLAE KIVQHRFDNG
RFRTRKDLQG VPRLGPKAFE QAAGFLRIPQ GDNPLDASAV HPEAYPVVER ICAETGRSVA
DLIGDEGFLG GLDPKAYTDE RFGEPTVRDI LGELAKPGRD PRPEFRTAAF REGVEKIQDL
EPGMVLEGTV TNVANFGAFV DIGVHQDGLV HISALAHEFV RDPRDKVRTG DVVQVKVMEV
DLERQRIGLS MRLDDDPNAQ AEGGRKGANG KGASAARGKG DGSGNKATGR GSKKGKKQEK
AEPATATALA EAFRKARS