Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0463 |
Symbol | |
ID | 4711510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 530284 |
End bp | 533418 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639854922 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_001002053 |
Protein GI | 121997266 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACTG TACAAACAAC CACATCTCCG GCGTACGCCG AACTCCACTG CCTGAGCGCC TTCACCTTCC GCCGCGGCGC CTCGCAGCCC GAGGAGCTGG TCGCCCGGGC CGCACAACTG GACTACCACG CCCTGGCCAT CACCGACGAG TGCTCGGTGG CCGGGGCCGT ACGCGCCTGG CAGGCGGCAC GGAGCCACGG CCTGCACTTG ATCGTCGGCA CCGAACTGCG CCCGGCCGAT GGCCCGCGGG TCGTCCTCCT CGCCCCCACC CGTCAGGCCT ACGGACAGCT CTGCGCCCTG ATCAGCCTGG CCCGCGGCCG CGCGGCCAAG GGCGGCTACC ACCTCACCCG GGAGGACCTG GACAGCGGCG CACCGGACTG CCTGGGGCTG CTCATCCCGG ACGCCCCCCA CCAGGAAGGG ACCGAGACCG AGGCCCAGAC CCGCTGGTTC GCCCGCCGCT TTGCCGGACG CGGTTACCTG GCGGTGGAAC TACTCGGCGG CCCCGACGAT GCCGCCCGCT GCCGGATGCT TGAGGCCCGT GCCGCCGCCG CGGGCCTGCC GGCGGTCGCG GCCACCGGCG CCGAGATGCA CACCCGGGGC CGCCGGGCCC TGCACGACAC CCTGACCGCC ATCCGCCACG GCCGCCCGGT CGCCGAGGTC ACCGCCCACC TGGCCAGTAA CGGCGAGCAC CACCTGCGCC GGCGTCCGGC CCTGGCCGAA CGCTACCCCC ACCACCTGCT CGCACAGAGC ACCGCCATCG CCGACCGCTG CACCTTCTCG CTGGACGAGT TGCGCTACGA GTACCCCGAG GAAGTCGTTC CCGAGGGCAC GACACCGACG GCACACCTGC GCCAGCTGGT CGAGACCGGG GCCCGCCGGC GCTATCCCGA CGGGGTGCCG GCAGCCGTGC AGGCCGGCTA CGAACGCGAA CTGGCGATCA TTGCCGAGCT CGGCTACGAG CACTACTTCC TGACCGTCCA CGATCTCGTC GCCTTCGCCC GCGGGCGGGG CATCCTCTGC CAGGGGCGCG GCTCGGCGGC CAACTCCGTC GTCTGCTTCT GCCTGGGGAT CACCGAAGTG GATCCGGCGC ACCAGTCGGT GCTCTTCGAG CGCTTCGTCT CCCGCGAACG TGGCGAGCCG CCGGATATCG ACGTGGACTT CGAGCACCAC CGGCGCGAAG AGGTCATCCA GTACATCTAC CAGCGCTACG GCCGCCACCG GGCCGCTCTG GCGGCCACGG TCATCGCCTA CCGGCCCCGC TCGGCCGTGC GGGACGTGGC CAAGGCCCTG GGCCTGGAGC GCGAGAGCGT CGAGCGCCTG GCCAAGGGGC TGCAGCCCTG GGACCACTCC GCCGCCAACG ACGAGCAGCT CCGCCAGGCC GGGCTCAACC CCGACGGCCC CATTGCGCGC CGACTGCGGA TACTGGCCGG GGAGCTCCTC GGTTTCCCGC GCCACCTCTC GCAGCACGTC GGCGGGTTTG TCATCGCCCG CTGCCCAATC ACCGAGCTGG TACCGGTGGA ACCAGCCGCC ATGGACGGGC GCACGGTGAT CCAGTGGGAC AAGGACGACC TGGAGGCGCT CGGACTGCTC AAGGTGGATG TCCTGGCCCT GGGCATGCTC AGCGTCATCC GCCGCGCCTT CGAGCTCATC CACCACTGGC GCGGCCATGC CTGGACCCTG GCCACTCTGC CGCCGGCCGA TCCGGCCACC TACGCCATGA TCCAGCGCGC TGACACCCTG GGGGTCTTCC AGATCGAGTC GCGGGCGCAG ATGGCCATGC TGCCGCGGAT GCGCCCGCGC TGCTTCTACG ACCTGGTCAT CGAGATCTCC ATCGTCCGCC CCGGCCCCAT CCAGGGCGAT ATGGTCCACC CCTACCTGCG CCGGCGCGAG GGCAGCGAGC CGGTGGACTA CCCATCCGAG GAGGTCCGTG GAGTTCTGGA GCGCACCCTC GGGGTACCGA TCTTCCAGGA GCAGGCCATG GAGTTGGCCG TCGTGGCCGC CGGCTTCACC CCGGACGAGG CCGACCGGCT GCGTCGCTCC ATGGCCGCCT GGCGCAAGAA GGGCGGACTG GCACCGCTGC GTGAGCGGCT CATCGAGGGG ATGCGCGAGC GCGGCTACTC CAGCGCCTAC GCCGAACGCC TCTACCGGCA GATCGAGGGC TTCGGCGAGT ACGGCTTCCC GGAGTCCCAC GCCGCCAGCT TCGCGCTGCT CGTCTACGCC TCGGCCTGGC TCAAGTGCCA TGAGCCGGCG GCCTTCGCCG CCGCCCTGCT CAACAGCCAG CCGATGGGCT TCTACGCCCC GGCGCAGATC ATCCGCGACA CCCGGGAGCA CGACGTGGCG GTGCGCCCGG TGGACGTGCG CTTCAGCGAG CCGGCGTGCA CGCTCGAACC CCCGCAGCCG CCGGACAGCC CATCAGGCGC CGGCTCGCAA CCGGCCCTCC GCCTGGGGCT GAACCAGATC CGCGGCCTGT CCACCGCCGG CGCCGAGCGG ATCGCCGCCG CCCGCTCCCG GGCGCCCTTC ACCGGGGTCA ACGACCTCAA GCGCCGCGCG CACCTGTCCC GGGCCGATAT TCACGCCCTG GCCCGCGCCG ACGCGCTGCG CGGCATCGCC GGGCACCGCC GGGCGGCCAG CTGGGCAGCC CTGGGCGCCG ACGAGGGGAC CGCCCTGATC CCTCCCCCGC CGGCCGAGGC CGGCCGACCG AGCCTCGCCC CCGCCCGCGA GTCCCAGGCA GTCCTGGCCG ACTACGCCAG CACCGGCTTC ACCCTGCGCC GCCATCCGCT CGCGTTCCTG CGCCGGCAGC TCCGGGCTCG GCGCTACCGC AGTGCCGCCG AGCTCGCCAC CGCAGAGGAC GGCCGTTCGA TACGGGCTGC CGGGCTGGTG ATCAACCGCC AGCGCCCCGG CTCGGCCGGG GGGATCACCT TCGTTACGCT GGAGGATGAG ACCGGTCGGA TCAATCTGGT GGTGCGCCGC GCCACCGCCG AGGCCCAGTC CCGCCCCTTG CTCGAGGCCC GGCTTCTGGG GGTGGCCGGC ACCTGGCAGT ACCGGCACGG CGTCGGGCAC CTGATCGCCG GCCGGCTGGA GGATCTGTCG GGACTGATCG GCGAACTGGA CGTCCGCTCC CGCGACTTCG GCTAA
|
Protein sequence | MTTVQTTTSP AYAELHCLSA FTFRRGASQP EELVARAAQL DYHALAITDE CSVAGAVRAW QAARSHGLHL IVGTELRPAD GPRVVLLAPT RQAYGQLCAL ISLARGRAAK GGYHLTREDL DSGAPDCLGL LIPDAPHQEG TETEAQTRWF ARRFAGRGYL AVELLGGPDD AARCRMLEAR AAAAGLPAVA ATGAEMHTRG RRALHDTLTA IRHGRPVAEV TAHLASNGEH HLRRRPALAE RYPHHLLAQS TAIADRCTFS LDELRYEYPE EVVPEGTTPT AHLRQLVETG ARRRYPDGVP AAVQAGYERE LAIIAELGYE HYFLTVHDLV AFARGRGILC QGRGSAANSV VCFCLGITEV DPAHQSVLFE RFVSRERGEP PDIDVDFEHH RREEVIQYIY QRYGRHRAAL AATVIAYRPR SAVRDVAKAL GLERESVERL AKGLQPWDHS AANDEQLRQA GLNPDGPIAR RLRILAGELL GFPRHLSQHV GGFVIARCPI TELVPVEPAA MDGRTVIQWD KDDLEALGLL KVDVLALGML SVIRRAFELI HHWRGHAWTL ATLPPADPAT YAMIQRADTL GVFQIESRAQ MAMLPRMRPR CFYDLVIEIS IVRPGPIQGD MVHPYLRRRE GSEPVDYPSE EVRGVLERTL GVPIFQEQAM ELAVVAAGFT PDEADRLRRS MAAWRKKGGL APLRERLIEG MRERGYSSAY AERLYRQIEG FGEYGFPESH AASFALLVYA SAWLKCHEPA AFAAALLNSQ PMGFYAPAQI IRDTREHDVA VRPVDVRFSE PACTLEPPQP PDSPSGAGSQ PALRLGLNQI RGLSTAGAER IAAARSRAPF TGVNDLKRRA HLSRADIHAL ARADALRGIA GHRRAASWAA LGADEGTALI PPPPAEAGRP SLAPARESQA VLADYASTGF TLRRHPLAFL RRQLRARRYR SAAELATAED GRSIRAAGLV INRQRPGSAG GITFVTLEDE TGRINLVVRR ATAEAQSRPL LEARLLGVAG TWQYRHGVGH LIAGRLEDLS GLIGELDVRS RDFG
|
| |