Gene Hhal_0463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0463 
Symbol 
ID4711510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp530284 
End bp533418 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content73% 
IMG OID639854922 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_001002053 
Protein GI121997266 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACTG TACAAACAAC CACATCTCCG GCGTACGCCG AACTCCACTG CCTGAGCGCC 
TTCACCTTCC GCCGCGGCGC CTCGCAGCCC GAGGAGCTGG TCGCCCGGGC CGCACAACTG
GACTACCACG CCCTGGCCAT CACCGACGAG TGCTCGGTGG CCGGGGCCGT ACGCGCCTGG
CAGGCGGCAC GGAGCCACGG CCTGCACTTG ATCGTCGGCA CCGAACTGCG CCCGGCCGAT
GGCCCGCGGG TCGTCCTCCT CGCCCCCACC CGTCAGGCCT ACGGACAGCT CTGCGCCCTG
ATCAGCCTGG CCCGCGGCCG CGCGGCCAAG GGCGGCTACC ACCTCACCCG GGAGGACCTG
GACAGCGGCG CACCGGACTG CCTGGGGCTG CTCATCCCGG ACGCCCCCCA CCAGGAAGGG
ACCGAGACCG AGGCCCAGAC CCGCTGGTTC GCCCGCCGCT TTGCCGGACG CGGTTACCTG
GCGGTGGAAC TACTCGGCGG CCCCGACGAT GCCGCCCGCT GCCGGATGCT TGAGGCCCGT
GCCGCCGCCG CGGGCCTGCC GGCGGTCGCG GCCACCGGCG CCGAGATGCA CACCCGGGGC
CGCCGGGCCC TGCACGACAC CCTGACCGCC ATCCGCCACG GCCGCCCGGT CGCCGAGGTC
ACCGCCCACC TGGCCAGTAA CGGCGAGCAC CACCTGCGCC GGCGTCCGGC CCTGGCCGAA
CGCTACCCCC ACCACCTGCT CGCACAGAGC ACCGCCATCG CCGACCGCTG CACCTTCTCG
CTGGACGAGT TGCGCTACGA GTACCCCGAG GAAGTCGTTC CCGAGGGCAC GACACCGACG
GCACACCTGC GCCAGCTGGT CGAGACCGGG GCCCGCCGGC GCTATCCCGA CGGGGTGCCG
GCAGCCGTGC AGGCCGGCTA CGAACGCGAA CTGGCGATCA TTGCCGAGCT CGGCTACGAG
CACTACTTCC TGACCGTCCA CGATCTCGTC GCCTTCGCCC GCGGGCGGGG CATCCTCTGC
CAGGGGCGCG GCTCGGCGGC CAACTCCGTC GTCTGCTTCT GCCTGGGGAT CACCGAAGTG
GATCCGGCGC ACCAGTCGGT GCTCTTCGAG CGCTTCGTCT CCCGCGAACG TGGCGAGCCG
CCGGATATCG ACGTGGACTT CGAGCACCAC CGGCGCGAAG AGGTCATCCA GTACATCTAC
CAGCGCTACG GCCGCCACCG GGCCGCTCTG GCGGCCACGG TCATCGCCTA CCGGCCCCGC
TCGGCCGTGC GGGACGTGGC CAAGGCCCTG GGCCTGGAGC GCGAGAGCGT CGAGCGCCTG
GCCAAGGGGC TGCAGCCCTG GGACCACTCC GCCGCCAACG ACGAGCAGCT CCGCCAGGCC
GGGCTCAACC CCGACGGCCC CATTGCGCGC CGACTGCGGA TACTGGCCGG GGAGCTCCTC
GGTTTCCCGC GCCACCTCTC GCAGCACGTC GGCGGGTTTG TCATCGCCCG CTGCCCAATC
ACCGAGCTGG TACCGGTGGA ACCAGCCGCC ATGGACGGGC GCACGGTGAT CCAGTGGGAC
AAGGACGACC TGGAGGCGCT CGGACTGCTC AAGGTGGATG TCCTGGCCCT GGGCATGCTC
AGCGTCATCC GCCGCGCCTT CGAGCTCATC CACCACTGGC GCGGCCATGC CTGGACCCTG
GCCACTCTGC CGCCGGCCGA TCCGGCCACC TACGCCATGA TCCAGCGCGC TGACACCCTG
GGGGTCTTCC AGATCGAGTC GCGGGCGCAG ATGGCCATGC TGCCGCGGAT GCGCCCGCGC
TGCTTCTACG ACCTGGTCAT CGAGATCTCC ATCGTCCGCC CCGGCCCCAT CCAGGGCGAT
ATGGTCCACC CCTACCTGCG CCGGCGCGAG GGCAGCGAGC CGGTGGACTA CCCATCCGAG
GAGGTCCGTG GAGTTCTGGA GCGCACCCTC GGGGTACCGA TCTTCCAGGA GCAGGCCATG
GAGTTGGCCG TCGTGGCCGC CGGCTTCACC CCGGACGAGG CCGACCGGCT GCGTCGCTCC
ATGGCCGCCT GGCGCAAGAA GGGCGGACTG GCACCGCTGC GTGAGCGGCT CATCGAGGGG
ATGCGCGAGC GCGGCTACTC CAGCGCCTAC GCCGAACGCC TCTACCGGCA GATCGAGGGC
TTCGGCGAGT ACGGCTTCCC GGAGTCCCAC GCCGCCAGCT TCGCGCTGCT CGTCTACGCC
TCGGCCTGGC TCAAGTGCCA TGAGCCGGCG GCCTTCGCCG CCGCCCTGCT CAACAGCCAG
CCGATGGGCT TCTACGCCCC GGCGCAGATC ATCCGCGACA CCCGGGAGCA CGACGTGGCG
GTGCGCCCGG TGGACGTGCG CTTCAGCGAG CCGGCGTGCA CGCTCGAACC CCCGCAGCCG
CCGGACAGCC CATCAGGCGC CGGCTCGCAA CCGGCCCTCC GCCTGGGGCT GAACCAGATC
CGCGGCCTGT CCACCGCCGG CGCCGAGCGG ATCGCCGCCG CCCGCTCCCG GGCGCCCTTC
ACCGGGGTCA ACGACCTCAA GCGCCGCGCG CACCTGTCCC GGGCCGATAT TCACGCCCTG
GCCCGCGCCG ACGCGCTGCG CGGCATCGCC GGGCACCGCC GGGCGGCCAG CTGGGCAGCC
CTGGGCGCCG ACGAGGGGAC CGCCCTGATC CCTCCCCCGC CGGCCGAGGC CGGCCGACCG
AGCCTCGCCC CCGCCCGCGA GTCCCAGGCA GTCCTGGCCG ACTACGCCAG CACCGGCTTC
ACCCTGCGCC GCCATCCGCT CGCGTTCCTG CGCCGGCAGC TCCGGGCTCG GCGCTACCGC
AGTGCCGCCG AGCTCGCCAC CGCAGAGGAC GGCCGTTCGA TACGGGCTGC CGGGCTGGTG
ATCAACCGCC AGCGCCCCGG CTCGGCCGGG GGGATCACCT TCGTTACGCT GGAGGATGAG
ACCGGTCGGA TCAATCTGGT GGTGCGCCGC GCCACCGCCG AGGCCCAGTC CCGCCCCTTG
CTCGAGGCCC GGCTTCTGGG GGTGGCCGGC ACCTGGCAGT ACCGGCACGG CGTCGGGCAC
CTGATCGCCG GCCGGCTGGA GGATCTGTCG GGACTGATCG GCGAACTGGA CGTCCGCTCC
CGCGACTTCG GCTAA
 
Protein sequence
MTTVQTTTSP AYAELHCLSA FTFRRGASQP EELVARAAQL DYHALAITDE CSVAGAVRAW 
QAARSHGLHL IVGTELRPAD GPRVVLLAPT RQAYGQLCAL ISLARGRAAK GGYHLTREDL
DSGAPDCLGL LIPDAPHQEG TETEAQTRWF ARRFAGRGYL AVELLGGPDD AARCRMLEAR
AAAAGLPAVA ATGAEMHTRG RRALHDTLTA IRHGRPVAEV TAHLASNGEH HLRRRPALAE
RYPHHLLAQS TAIADRCTFS LDELRYEYPE EVVPEGTTPT AHLRQLVETG ARRRYPDGVP
AAVQAGYERE LAIIAELGYE HYFLTVHDLV AFARGRGILC QGRGSAANSV VCFCLGITEV
DPAHQSVLFE RFVSRERGEP PDIDVDFEHH RREEVIQYIY QRYGRHRAAL AATVIAYRPR
SAVRDVAKAL GLERESVERL AKGLQPWDHS AANDEQLRQA GLNPDGPIAR RLRILAGELL
GFPRHLSQHV GGFVIARCPI TELVPVEPAA MDGRTVIQWD KDDLEALGLL KVDVLALGML
SVIRRAFELI HHWRGHAWTL ATLPPADPAT YAMIQRADTL GVFQIESRAQ MAMLPRMRPR
CFYDLVIEIS IVRPGPIQGD MVHPYLRRRE GSEPVDYPSE EVRGVLERTL GVPIFQEQAM
ELAVVAAGFT PDEADRLRRS MAAWRKKGGL APLRERLIEG MRERGYSSAY AERLYRQIEG
FGEYGFPESH AASFALLVYA SAWLKCHEPA AFAAALLNSQ PMGFYAPAQI IRDTREHDVA
VRPVDVRFSE PACTLEPPQP PDSPSGAGSQ PALRLGLNQI RGLSTAGAER IAAARSRAPF
TGVNDLKRRA HLSRADIHAL ARADALRGIA GHRRAASWAA LGADEGTALI PPPPAEAGRP
SLAPARESQA VLADYASTGF TLRRHPLAFL RRQLRARRYR SAAELATAED GRSIRAAGLV
INRQRPGSAG GITFVTLEDE TGRINLVVRR ATAEAQSRPL LEARLLGVAG TWQYRHGVGH
LIAGRLEDLS GLIGELDVRS RDFG