Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0653 |
Symbol | |
ID | 4709915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 733481 |
End bp | 736777 |
Gene Length | 3297 bp |
Protein Length | 1098 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639855114 |
Product | DnaB domain-containing protein |
Protein accession | YP_001002237 |
Protein GI | 121997450 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0305] Replicative DNA helicase |
TIGRFAM ID | [TIGR00665] replicative DNA helicase [TIGR01443] intein C-terminal splicing region [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAGGGTG ACGGATCCGC CTCGGACTCC GAGGCCCTGA AGGTCCCGCC GCACGACCTG GAGGCCGAAC AGGCGGTGCT GGGCGGGCTG ATGCTCGACA ACGCCGCCTG GGACCAGGTT GCCGACCGTC TCCATGAGGA GGATTTCTAC CGCCGCGAGC ATCGCCTGGT CTATCGGGCC ATGGGCGAGC TCGCGGAGGG GAATCACCCG ATGGATGTGG TCACCCTCTC CGGGTGGCTG CGCCAGCAGG GCAAGCTTGA GGAGGCCGGT GGCCTCAGTT ACCTCGGCGG GATCGCCCGC GAGACCCCAT CGGCGGCTAA CATCCGCGCC TATGCGGACA TCGTCCGCGA GCGCTCGGTT CTCCGGCAGC TGATCCGGGC CGGCTCGGAT GTCGCCGAGG CGGCTTTCCG ACCCCAGGGC CGTAACAGCG AGGACCTGCT CGACTACGCC GAGCAGACCA TCTTCCAGAT CGCCGAGCAG ACCGGCCGGA ATCGGCAGGG CTTTGTCGGC ATGCGGCAGC TCATGCCGCA GGTCATCGAC CGCATCGACA CCCTCTACCA CACCCAGGAG GCCGTCACCG GGCTGGCGAC CGGTTTCGAC GACCTCGACC ACATGACCTC CGGCCTGCAG GACGGCGATC TCGTCATCGT CGCCGGGCGT CCGTCCATGG GGAAGTGCCT CGCGTACGAC GCCGAGATTG TCCAGGCCGA TGGTGGCGTG AAGACCATCG AGCAGATTGT CCGTGAGCGG CGAGCCCACC TGGCGACGGT GGGGGCCGAC TGGCGACTGA CCTGGACCGA GCCCTGTGAT TATGTCGATG ACGGCCACAA GCCGGTGTTC GAGGTGACGA CACGGCTCGG GCGGCGGATC GAGACCACCC TGACCCACCC GTTTCTGACG GTCCACGGCT GGCAGCGGCT CGAGGATCTT GCCGAGGGCG ACGCCATCGG TGTGCCCCGT CAACTGCCCG TCTTTGGGCA GGAGCCGATA CGCGACTGCG AGGTCCGGCT GCTTGGGCAC CTGATTGGCG ATGGCGGGTT GACCGGTTCT CCGCCCCGGT TGACCAGTGG TCAAGAGGCG ATGACCGCCG ACTTCCTGGA AGCCGTGGAC GCCTTTGGCG GCGTTGAGGC GAAGCCGATC CGGGCCAGTC GCCGGACGCA GAGTTGGGTC GTGGTCGGTG CCGCGCAGGC ACGCGCAGCG GCCCGGTCCA GCTTTGCGTC GCTTGTCGAC GCCCTGATTC GACGGTCGCC GCTCACGGGT CGGGCTATCG CCCGAAACCT CGGGGTCGCC CCGGCAACAC TGACGTACTG GCGGCAGGGC GTGAATGTGC CGGATGCCGC GATGGTCGGG CTGCTTGCTG GAGAACTCGG CGTCGATGTC GGGGAACTGC GGCCAGAGCC CGTGGCCCGG CGCAACGATC GGAATCCGCT TCAGGCCTGG CTTGATCGGC TTGGGTTGGC GGGCAAGTCG GCTCATGAAA AGACCGTCCC GGACTGCGTA TTCCGGTTGC CGCGTGAGCA GCTGGCCCGT TTCCTCAATC GGCTGTTTTC ATCGGATGGG TGGGTTACGC ACCTAGCCAG CGGCCAGGGG CAGATCGGAT ACACCACCGT TAGTGAGGCG CTGGCGCGAC AGATCCAGCA TCTGCTGCTG CGCTTCGGGG TGCTGGCCAA ACTGCGGCAT CGCTCCGTGC GCTACCAAGA CGGCCGACGG CCAGCCTGGC AACTGGATAT TACCCACGCC GAATCGATCC TCACGTTCGC GGAGCAGATC GGCATCCTGG GTAAGGAACA GCGCCTGGCT TCTGTGGCGG CCTCGGTGCG CGGCCGGCGC CGGCAGAGTC ATACCGATCA TATCCCGTGC GAGATCTGGC AGTTCATTGA TCGAGCCCGG GGCGAATGGA CCTGGGCTGA ATTGGCCCGG CGAGCCGGGG TGGCGTCGTC GAATATCCAT GCGTATCGGC GCGGCATGAG TCGCCAGCGT CTGGCAGCGT TCGCCGATGC ACTGGGTTCT CGTGAGTTGA GGCAACTGGC TAGCAGCGAT CTCTACTGGG ATCGCATTGC TTCTATTCGG CCGCTGGGTC ACAAACAGGT CTACGACCTG ACCATTCCCG AGACCCACAA CTTCATCGCC AATGACGTGT GCGTGCACAA CACGACCGTG GCGATGAACA TGGTCGAGCA CGTGGCGATG CAGCTGAAGA AGCCGGTGGC CGTGTTCTCC ATGGAGATGC CAGCGGACGC CTTGGCGATG CGCATGCTGG CCTCGCTTGG GCGCGTGCAC CTGCAGCGGG TGCGCTCGGG CAAGCTGCAG GACGACGACT GGCCCCGGTT GACCTCGACC ATGAGCCTGC TCGCCGAGGC ACCACTTTTT ATCGACGATT CACCGGGGCT GACCCCGACG GAGATCCGTG CACGTGCGCG GCGCCTGCAG CGCGAGCACG ACGAGCTCGG GTTGATTGTC ATCGACTACC TGCAGCTCAT GCAGATCCCC GGCTTCCGCG AGAACCGCGC CGGCGAGCTC TCGGAGATCT CCCGCGGGCT CAAGGCGCTG GCCAAGGAGC TGAACACGCC GGTGATCGCC CTTTCGCAGC TCAACCGCTC GCTGGAACAG CGGCCGAACA AGCGGCCGAT TATGTCGGAT CTGCGTGAGT GCGTGACCGG GGACACGCTG GTACTCCTCG CTGATGGACA GCGGAGACCC ATTTCAGAGT TGGTCGGGAG CGCGCCAGAA GTTATCGCTA TTGATGATCG CCACCGCTTG GTGCCCGCGC AGGCCGAGCG GGTCTGGCGA GTCGGTCGGC GCCCGGTGTA TCGCGTACAA CTGGCGAGCG GCAGGTTGCT GCGTGCAACG GCGCGGCACC GTCTTCTAAC GGGGAGTGGC TGGAAACGCG TTGATGAGCT CCGTGATGAA GACCGCATCG CGATTGCTCG GACGGTGCCC GAGCCCGGAA GTGTAATGGA GTCCGAAAGC GATGTATTTT GGGACCATCT AGTCGCTGTC GAACCAGATG GGGAAGAGGA CGTCTACGAC CTGACGGTTC CGGGGCCTGC ATCTTGGGTG GCAGACAGTA TTATCAGTCA CAACTCTGGC GCCATCGAAC AGGACGCGGA CCTGATCGCG TTCATCTACC GCGATGAGGT CTACAACGAG GACACCCCCG ACAAGGGCGT GGCCGAGCTG ATCATCGCCA AGCAGCGGCA GGGGCCGATC GGGACGGTCA AGCTCACCTT CCTGGGTGAA TACACCCGGT TCGAGAACTA CATCGAAGAC GTCTACGGCG GAGGGATCCC GGGGTGA
|
Protein sequence | MQGDGSASDS EALKVPPHDL EAEQAVLGGL MLDNAAWDQV ADRLHEEDFY RREHRLVYRA MGELAEGNHP MDVVTLSGWL RQQGKLEEAG GLSYLGGIAR ETPSAANIRA YADIVRERSV LRQLIRAGSD VAEAAFRPQG RNSEDLLDYA EQTIFQIAEQ TGRNRQGFVG MRQLMPQVID RIDTLYHTQE AVTGLATGFD DLDHMTSGLQ DGDLVIVAGR PSMGKCLAYD AEIVQADGGV KTIEQIVRER RAHLATVGAD WRLTWTEPCD YVDDGHKPVF EVTTRLGRRI ETTLTHPFLT VHGWQRLEDL AEGDAIGVPR QLPVFGQEPI RDCEVRLLGH LIGDGGLTGS PPRLTSGQEA MTADFLEAVD AFGGVEAKPI RASRRTQSWV VVGAAQARAA ARSSFASLVD ALIRRSPLTG RAIARNLGVA PATLTYWRQG VNVPDAAMVG LLAGELGVDV GELRPEPVAR RNDRNPLQAW LDRLGLAGKS AHEKTVPDCV FRLPREQLAR FLNRLFSSDG WVTHLASGQG QIGYTTVSEA LARQIQHLLL RFGVLAKLRH RSVRYQDGRR PAWQLDITHA ESILTFAEQI GILGKEQRLA SVAASVRGRR RQSHTDHIPC EIWQFIDRAR GEWTWAELAR RAGVASSNIH AYRRGMSRQR LAAFADALGS RELRQLASSD LYWDRIASIR PLGHKQVYDL TIPETHNFIA NDVCVHNTTV AMNMVEHVAM QLKKPVAVFS MEMPADALAM RMLASLGRVH LQRVRSGKLQ DDDWPRLTST MSLLAEAPLF IDDSPGLTPT EIRARARRLQ REHDELGLIV IDYLQLMQIP GFRENRAGEL SEISRGLKAL AKELNTPVIA LSQLNRSLEQ RPNKRPIMSD LRECVTGDTL VLLADGQRRP ISELVGSAPE VIAIDDRHRL VPAQAERVWR VGRRPVYRVQ LASGRLLRAT ARHRLLTGSG WKRVDELRDE DRIAIARTVP EPGSVMESES DVFWDHLVAV EPDGEEDVYD LTVPGPASWV ADSIISHNSG AIEQDADLIA FIYRDEVYNE DTPDKGVAEL IIAKQRQGPI GTVKLTFLGE YTRFENYIED VYGGGIPG
|
| |