Gene Hhal_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0422 
Symbol 
ID4711520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp490262 
End bp491926 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content66% 
IMG OID639854880 
Producthypothetical protein 
Protein accessionYP_001002013 
Protein GI121997226 
COG category[R] General function prediction only 
COG ID[COG3972] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCATG TTCACCCTTC CAACATCGAC ACCCTTCGGC TGGCCGGTGC GCCGGAGCGC 
GAGCTTAGGA CGCTGGAGTG GCTGGGCGAC TCGCTGCCGC AGAGCTACAC CGTCTACCAC
GGCGTGCACT GGAGTGCCGG GTCCGGGCGG GGCGCCGTGT TCGGTGAGGT GGATTTTGTC
GTCGTTAACG CCGCCGGCGA GGTCCTGCTG ATTGAGCAGA AGAACGGGGC TTTGGCTGAG
AGCGACGGCC TGCTCGGCAA GGACTACGGG CATGAGCGCG GCCCCAAGGA CGTGGTCCGG
CAGCTGCACC GCAGCCGCGA AGGGTTGCTC GGAGCCCTCG AGCGCGGTCT CGGTGGCCGC
AAGCCGCCGG GTATGAGCCT GCTGCTGTAC TGCCCGGCGC ACCGGCTGCA GGGGGACGCA
CCGGCTGGGC TCTCTCGCGA GCAGATCGTC GATGCTTCAA GAGTCCAAGA ACTCCCCGCA
TCCGTCGAGG CGCAGCTCGG GCCGGGGCAG GACGCGCCGG ATACAGCCCG GACGGTTCGG
CGCGTGCTCG CGCAGGAGCT GGATTTGGCA CCGGATCTCG GCGACGAGGT CACGCTTCAG
GAGCAGACCT TCCAGCGTCT CGCTGGTGGC ATCACCGAGC TGGTCCAGGG GCTGGAGATG
AGCCCATGGC GCCTTCGGGT TATCGGTGCT GCGGGTAGCG GGAAGACGAT TGCCGCGATC
GAGTTTTTCG AGGCTGCGCA AGCCCGGGGA GAACGACCGG CGCTGGTCTG TTTCAACCGC
GTCCTGGGCG ACCGGCTGCG CGCACGACTG GAGGGCAACG CTGACGTCGG CAATTTCCAC
CGCCTTTGTC ATGCCTGGCT TGAGGCCGTT GGTGAGAGCT TCGATGCGCA GCGGGCGCGG
CGGGAGCCCC AGGAGTATTG GAGCGAGGTG GCGGACCGGC TCATCGAGCA TTCCGAGCGA
TTGCCCTGTT TCGACCGATT GATCGTCGAT GAAGGGCAGG ACTTCTCCGA AGAGTGGTGG
GAGCTCCTGC GCATCTGTCT CGTCGACGAT GATGCACCGG TCTTGTGGCT GGAAGACCCC
CAGCAGGATC TCTACGGGCG CAACGACCAG CAACAGTCCG CGTTCGTCAC CTACCGAACG
GGCAAGGCTT TTCGGACGCC ACGCCGGATT GCGCAGTTCG TCCGTCGCCT GCTCGAGGTC
GATATCGACT GGCGTAATCC GCTCGACGGC CATAAGCCGC GGGTTACCCG GTACGCAACG
GCCGATGAGC AGCGCGAGGC CCTTCTTCAG GCTGTGGAAC ACCTCGAGAG CGAGGGCTTT
CGCAAGGATC AGATGGTGTT GCTGAGCCTG CACGGTCACG GCCGTGATCC ACTGGCGGAG
ACGGCCCGGC TGGGTCGTTA CCGCCTCAAG CGGTTTACGG GGGATTTCAC CGAAGACGGC
CAGCCGGTGT ACTCGAAGGG CGATTTACGC GTCGAGACGG TCTATCGCTT CAAGGGCGAG
CAGCGCCCCG CCGTGATCCT GATGGACGTC GATTTCGACG GTAGCCGGCC CGAGCGTGAG
CAGCGTCTGC TCTACTGTGC GCTCACCCGG GCCTCGGTGG CCTGCGAGGT GCTGGTCGCT
GAGGGTTCCG CGTGGCGGAA ACGGCTGGAG AACGCGGCAT CGTGA
 
Protein sequence
MAHVHPSNID TLRLAGAPER ELRTLEWLGD SLPQSYTVYH GVHWSAGSGR GAVFGEVDFV 
VVNAAGEVLL IEQKNGALAE SDGLLGKDYG HERGPKDVVR QLHRSREGLL GALERGLGGR
KPPGMSLLLY CPAHRLQGDA PAGLSREQIV DASRVQELPA SVEAQLGPGQ DAPDTARTVR
RVLAQELDLA PDLGDEVTLQ EQTFQRLAGG ITELVQGLEM SPWRLRVIGA AGSGKTIAAI
EFFEAAQARG ERPALVCFNR VLGDRLRARL EGNADVGNFH RLCHAWLEAV GESFDAQRAR
REPQEYWSEV ADRLIEHSER LPCFDRLIVD EGQDFSEEWW ELLRICLVDD DAPVLWLEDP
QQDLYGRNDQ QQSAFVTYRT GKAFRTPRRI AQFVRRLLEV DIDWRNPLDG HKPRVTRYAT
ADEQREALLQ AVEHLESEGF RKDQMVLLSL HGHGRDPLAE TARLGRYRLK RFTGDFTEDG
QPVYSKGDLR VETVYRFKGE QRPAVILMDV DFDGSRPERE QRLLYCALTR ASVACEVLVA
EGSAWRKRLE NAAS