Gene Hhal_0653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0653 
Symbol 
ID4709915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp733481 
End bp736777 
Gene Length3297 bp 
Protein Length1098 aa 
Translation table11 
GC content65% 
IMG OID639855114 
ProductDnaB domain-containing protein 
Protein accessionYP_001002237 
Protein GI121997450 
COG category[L] Replication, recombination and repair 
COG ID[COG0305] Replicative DNA helicase 
TIGRFAM ID[TIGR00665] replicative DNA helicase
[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGGGTG ACGGATCCGC CTCGGACTCC GAGGCCCTGA AGGTCCCGCC GCACGACCTG 
GAGGCCGAAC AGGCGGTGCT GGGCGGGCTG ATGCTCGACA ACGCCGCCTG GGACCAGGTT
GCCGACCGTC TCCATGAGGA GGATTTCTAC CGCCGCGAGC ATCGCCTGGT CTATCGGGCC
ATGGGCGAGC TCGCGGAGGG GAATCACCCG ATGGATGTGG TCACCCTCTC CGGGTGGCTG
CGCCAGCAGG GCAAGCTTGA GGAGGCCGGT GGCCTCAGTT ACCTCGGCGG GATCGCCCGC
GAGACCCCAT CGGCGGCTAA CATCCGCGCC TATGCGGACA TCGTCCGCGA GCGCTCGGTT
CTCCGGCAGC TGATCCGGGC CGGCTCGGAT GTCGCCGAGG CGGCTTTCCG ACCCCAGGGC
CGTAACAGCG AGGACCTGCT CGACTACGCC GAGCAGACCA TCTTCCAGAT CGCCGAGCAG
ACCGGCCGGA ATCGGCAGGG CTTTGTCGGC ATGCGGCAGC TCATGCCGCA GGTCATCGAC
CGCATCGACA CCCTCTACCA CACCCAGGAG GCCGTCACCG GGCTGGCGAC CGGTTTCGAC
GACCTCGACC ACATGACCTC CGGCCTGCAG GACGGCGATC TCGTCATCGT CGCCGGGCGT
CCGTCCATGG GGAAGTGCCT CGCGTACGAC GCCGAGATTG TCCAGGCCGA TGGTGGCGTG
AAGACCATCG AGCAGATTGT CCGTGAGCGG CGAGCCCACC TGGCGACGGT GGGGGCCGAC
TGGCGACTGA CCTGGACCGA GCCCTGTGAT TATGTCGATG ACGGCCACAA GCCGGTGTTC
GAGGTGACGA CACGGCTCGG GCGGCGGATC GAGACCACCC TGACCCACCC GTTTCTGACG
GTCCACGGCT GGCAGCGGCT CGAGGATCTT GCCGAGGGCG ACGCCATCGG TGTGCCCCGT
CAACTGCCCG TCTTTGGGCA GGAGCCGATA CGCGACTGCG AGGTCCGGCT GCTTGGGCAC
CTGATTGGCG ATGGCGGGTT GACCGGTTCT CCGCCCCGGT TGACCAGTGG TCAAGAGGCG
ATGACCGCCG ACTTCCTGGA AGCCGTGGAC GCCTTTGGCG GCGTTGAGGC GAAGCCGATC
CGGGCCAGTC GCCGGACGCA GAGTTGGGTC GTGGTCGGTG CCGCGCAGGC ACGCGCAGCG
GCCCGGTCCA GCTTTGCGTC GCTTGTCGAC GCCCTGATTC GACGGTCGCC GCTCACGGGT
CGGGCTATCG CCCGAAACCT CGGGGTCGCC CCGGCAACAC TGACGTACTG GCGGCAGGGC
GTGAATGTGC CGGATGCCGC GATGGTCGGG CTGCTTGCTG GAGAACTCGG CGTCGATGTC
GGGGAACTGC GGCCAGAGCC CGTGGCCCGG CGCAACGATC GGAATCCGCT TCAGGCCTGG
CTTGATCGGC TTGGGTTGGC GGGCAAGTCG GCTCATGAAA AGACCGTCCC GGACTGCGTA
TTCCGGTTGC CGCGTGAGCA GCTGGCCCGT TTCCTCAATC GGCTGTTTTC ATCGGATGGG
TGGGTTACGC ACCTAGCCAG CGGCCAGGGG CAGATCGGAT ACACCACCGT TAGTGAGGCG
CTGGCGCGAC AGATCCAGCA TCTGCTGCTG CGCTTCGGGG TGCTGGCCAA ACTGCGGCAT
CGCTCCGTGC GCTACCAAGA CGGCCGACGG CCAGCCTGGC AACTGGATAT TACCCACGCC
GAATCGATCC TCACGTTCGC GGAGCAGATC GGCATCCTGG GTAAGGAACA GCGCCTGGCT
TCTGTGGCGG CCTCGGTGCG CGGCCGGCGC CGGCAGAGTC ATACCGATCA TATCCCGTGC
GAGATCTGGC AGTTCATTGA TCGAGCCCGG GGCGAATGGA CCTGGGCTGA ATTGGCCCGG
CGAGCCGGGG TGGCGTCGTC GAATATCCAT GCGTATCGGC GCGGCATGAG TCGCCAGCGT
CTGGCAGCGT TCGCCGATGC ACTGGGTTCT CGTGAGTTGA GGCAACTGGC TAGCAGCGAT
CTCTACTGGG ATCGCATTGC TTCTATTCGG CCGCTGGGTC ACAAACAGGT CTACGACCTG
ACCATTCCCG AGACCCACAA CTTCATCGCC AATGACGTGT GCGTGCACAA CACGACCGTG
GCGATGAACA TGGTCGAGCA CGTGGCGATG CAGCTGAAGA AGCCGGTGGC CGTGTTCTCC
ATGGAGATGC CAGCGGACGC CTTGGCGATG CGCATGCTGG CCTCGCTTGG GCGCGTGCAC
CTGCAGCGGG TGCGCTCGGG CAAGCTGCAG GACGACGACT GGCCCCGGTT GACCTCGACC
ATGAGCCTGC TCGCCGAGGC ACCACTTTTT ATCGACGATT CACCGGGGCT GACCCCGACG
GAGATCCGTG CACGTGCGCG GCGCCTGCAG CGCGAGCACG ACGAGCTCGG GTTGATTGTC
ATCGACTACC TGCAGCTCAT GCAGATCCCC GGCTTCCGCG AGAACCGCGC CGGCGAGCTC
TCGGAGATCT CCCGCGGGCT CAAGGCGCTG GCCAAGGAGC TGAACACGCC GGTGATCGCC
CTTTCGCAGC TCAACCGCTC GCTGGAACAG CGGCCGAACA AGCGGCCGAT TATGTCGGAT
CTGCGTGAGT GCGTGACCGG GGACACGCTG GTACTCCTCG CTGATGGACA GCGGAGACCC
ATTTCAGAGT TGGTCGGGAG CGCGCCAGAA GTTATCGCTA TTGATGATCG CCACCGCTTG
GTGCCCGCGC AGGCCGAGCG GGTCTGGCGA GTCGGTCGGC GCCCGGTGTA TCGCGTACAA
CTGGCGAGCG GCAGGTTGCT GCGTGCAACG GCGCGGCACC GTCTTCTAAC GGGGAGTGGC
TGGAAACGCG TTGATGAGCT CCGTGATGAA GACCGCATCG CGATTGCTCG GACGGTGCCC
GAGCCCGGAA GTGTAATGGA GTCCGAAAGC GATGTATTTT GGGACCATCT AGTCGCTGTC
GAACCAGATG GGGAAGAGGA CGTCTACGAC CTGACGGTTC CGGGGCCTGC ATCTTGGGTG
GCAGACAGTA TTATCAGTCA CAACTCTGGC GCCATCGAAC AGGACGCGGA CCTGATCGCG
TTCATCTACC GCGATGAGGT CTACAACGAG GACACCCCCG ACAAGGGCGT GGCCGAGCTG
ATCATCGCCA AGCAGCGGCA GGGGCCGATC GGGACGGTCA AGCTCACCTT CCTGGGTGAA
TACACCCGGT TCGAGAACTA CATCGAAGAC GTCTACGGCG GAGGGATCCC GGGGTGA
 
Protein sequence
MQGDGSASDS EALKVPPHDL EAEQAVLGGL MLDNAAWDQV ADRLHEEDFY RREHRLVYRA 
MGELAEGNHP MDVVTLSGWL RQQGKLEEAG GLSYLGGIAR ETPSAANIRA YADIVRERSV
LRQLIRAGSD VAEAAFRPQG RNSEDLLDYA EQTIFQIAEQ TGRNRQGFVG MRQLMPQVID
RIDTLYHTQE AVTGLATGFD DLDHMTSGLQ DGDLVIVAGR PSMGKCLAYD AEIVQADGGV
KTIEQIVRER RAHLATVGAD WRLTWTEPCD YVDDGHKPVF EVTTRLGRRI ETTLTHPFLT
VHGWQRLEDL AEGDAIGVPR QLPVFGQEPI RDCEVRLLGH LIGDGGLTGS PPRLTSGQEA
MTADFLEAVD AFGGVEAKPI RASRRTQSWV VVGAAQARAA ARSSFASLVD ALIRRSPLTG
RAIARNLGVA PATLTYWRQG VNVPDAAMVG LLAGELGVDV GELRPEPVAR RNDRNPLQAW
LDRLGLAGKS AHEKTVPDCV FRLPREQLAR FLNRLFSSDG WVTHLASGQG QIGYTTVSEA
LARQIQHLLL RFGVLAKLRH RSVRYQDGRR PAWQLDITHA ESILTFAEQI GILGKEQRLA
SVAASVRGRR RQSHTDHIPC EIWQFIDRAR GEWTWAELAR RAGVASSNIH AYRRGMSRQR
LAAFADALGS RELRQLASSD LYWDRIASIR PLGHKQVYDL TIPETHNFIA NDVCVHNTTV
AMNMVEHVAM QLKKPVAVFS MEMPADALAM RMLASLGRVH LQRVRSGKLQ DDDWPRLTST
MSLLAEAPLF IDDSPGLTPT EIRARARRLQ REHDELGLIV IDYLQLMQIP GFRENRAGEL
SEISRGLKAL AKELNTPVIA LSQLNRSLEQ RPNKRPIMSD LRECVTGDTL VLLADGQRRP
ISELVGSAPE VIAIDDRHRL VPAQAERVWR VGRRPVYRVQ LASGRLLRAT ARHRLLTGSG
WKRVDELRDE DRIAIARTVP EPGSVMESES DVFWDHLVAV EPDGEEDVYD LTVPGPASWV
ADSIISHNSG AIEQDADLIA FIYRDEVYNE DTPDKGVAEL IIAKQRQGPI GTVKLTFLGE
YTRFENYIED VYGGGIPG