Gene Hhal_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0402 
Symbol 
ID4711497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp467277 
End bp470720 
Gene Length3444 bp 
Protein Length1147 aa 
Translation table11 
GC content68% 
IMG OID639854864 
Productpyruvate carboxylase 
Protein accessionYP_001001997 
Protein GI121997210 
COG category[C] Energy production and conversion 
COG ID[COG1038] Pyruvate carboxylase 
TIGRFAM ID[TIGR01235] pyruvate carboxylase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCCGT TTCAGAAGAT CCTGATCGCC AATCGCGGTG AGATCGCCAT TCGGGTGATG 
CGTGCGGCCA ATGAGCTCGG CAAGCGCACC GTCGCCATCT ACGCCCAGGA GGACAAGCTC
GGGCTGCACC GCTTCAAGGC AGACGAGGCT TATCAGGTGG GCGAGGGCAT GGGGCCGGTG
GAGGCGTACC TCTCCATCGA CGAGGTCATC CGGGTCGCGA AGATGGCCGG GGCGGACGCC
GTGCACCCGG GCTACGGGCT GCTCTCCGAG AATCCGGAAC TGGTCGACGC CTGCGAGGCC
GCCGGCATCA CCTTCATCGG GCCGCGTGCG GACACCATGC GCGCGCTCGG CGACAAGGCG
AGTGCGCGCC GTGTGGCTAT CGAAGCGGGC GTGCCGGTCA TTCCCGCCTC CGAGGTGCTG
GGCGACGACA TCGAGGCCGC CCGGCGCTGG GCCGACGAGA TCGGTTACCC GATGATGCTC
AAGGCCTCCT GGGGCGGGGG CGGGCGCGGT ATGCGCCCGA TCCGTGAGCC CGAGGAGCTG
GAGGCCCGGG TCCTGGAGGG ACGGCGCGAG GCGGAAGCGG CCTTCGGCAG TGGCGAGGGC
TACCTCGAGA AGATGATCGA GCGGGCTCGC CACGTCGAGG TGCAGGTGCT GGGGGATACC
CACGGCGGTC TCTACCACCT CTTTGAGCGG GACTGCACGG TGCAGCGGCG CAACCAGAAG
GTGGTGGAGC GGGCCCCCGC GCCCTATCTC ACCGAGGCGC AGCGCGCCGA GGTCTGCGAT
CTGGGGCTGA AAGTGGCCCG CCACGTCGGC TACCAGAACG CCGGAACGGT CGAGTTCCTC
ATGGACATGG ACACCGACAC GTTCTACTTC ATTGAGGTGA ACCCGCGCAT CCAGGTGGAA
CACACCGTGA CCGAAGAGGT TACGGGGATC GACATCGTCA AGGCGCAGAT CCGTATCGCG
GAAGGCGAGC ACCTGGCCGC CGCCACCGGC AAGGCCGATC AGGGGGCGCT CTGGCTGAAC
GGTCACGCCA TGCAGTGCCG GGTGACCACC GAGGATCCTC AGAACAACTT CATCCCCGAT
TACGGTCGCA TCACCGCCTA TCGCTCGGCC ACCGGCATGG GGATCCGGCT GGATGGGGGC
ACGGCGTACG CCGGCGGCGT CATCACCCGC TACTACGACT CCCTGCTGGT CAAGGTCACC
GCCTGGGCCC CCACGCCGCA GGAGGCCATC TCGCGGATGG ATCGCGCCCT TCGCGAGTTC
CGCATCCGCG GCGTGTCCAC CAACATCCCC TTCGTCGAGA ATCTGCTCAA GCACCCGGCA
TTCCTGGACA ACAGCTACAC CACGCGCTTC ATCGACACCG CGCCGGAGCT GTTCGATTTC
GACAAGCGCC GGGACCGGGC CACCCGGCTG CTCACCTACC TGGCGGAGAT CACCGTCAAC
GGGCACCCGG AGACCCTCGG TCGTCCCCAG CCGGCGGCGG GTCTGCCGCT GCCGGTGCCG
CCCGAACCCC GGGACGAGCC GGCGCCGGGC ACCCGCAATC TGCTGGAGGC CCAGGGCCCC
CAGGCGGTGG CCGACTGGCT GGCCGGTCGC AAGGAGCTGT TGCTGACCGA CACCACCATG
CGCGACGCCC ACCAGTCGCT GCTGGCGACG CGCATGCGCA GCTTCGACAT GGCGCGGGTC
GCGCCTGCGT ACGCGGCCAA CCTGCCGCAA CTGTTCAGCG TGGAGTGCTG GGGCGGGGCC
ACCTTCGATG TCGCGTACCG TTTCCTGCAG GAGTGTCCCT GGCAGCGGCT GCGGCAGATC
CGCGAGGCCA TGCCGAACGT GATGACGCAG ATGCTCCTGC GCGGTTCCAA CGGCGTGGGC
TACACCAATT ACCCGGACAA CGTGGTCCGT GCCTTCGTCC ACCAGGCCGC CGATTCCGGC
GTGGACGTCT TCCGCGTGTT CGACAGCCTC AACTGGGTCG AGAACATGCG CGTGGCCATG
GATGCGGTGC TGGAGTCGGG GAAGGTGTGC GAGGGGACGC TGTGCTACAC CGGCGATATC
CTCGATCCGG GCAGGGACAA GTACGACCTC AAGTACTATG TGGCGATGGG CAAGGCGCTC
CGCGATGCCG GCGCCCACAT CCTCGGGGTC AAGGACATGG CGGGCCTGCT CAAGCCGGCG
GCGGCGCGGG TGCTGTTCCG GGCGCTCAAG GAGGAGGTGG GGCTGCCCAT CCACTTCCAC
ACCCACGACA CCAGCGGCAT TGCCGGCGCC ACCGTGCTGG CGGCGGCCGA TGCCGGGGTG
GATGTGGCCG ATGTGGCCAT GGACGCCTTC TCCGGGAATA CCTCCCAGCC GGTCTTCGGC
TCCATCGTCG AGGCGCTGCG GCACACGGAG CGGGATACCG GCCTGGACAT GAGCGCGGTC
CGCGAGATCT CCAATTACTG GGAACAGGTG CGTGCCCATT ACGCCGCCTT CGAGACCGGC
CAGCAGTCGC CGTCGTCCGA GGTCTACCTC CACGAGATGC CGGGCGGGCA GTTCACCAAT
CTCAAGGCCC AGGCCCGCTC CCTGGGACTG GAGGAGCGCT GGCACGAGGT GGCCCAGGCC
TACGCCGATG CCAACCAGAT CTTCGGTGAC ATCGTGAAGG TCACCCCCTC ATCCAAGGTG
GTGGGGGACA TGGCGCTGAT GATGGTGAGC CAGGGCCTGA CCCGCGAGCA GGTCGAAGAC
CCGGCAGTGG ACGTGAACTT CCCGGATTCG GTGATCGACA TGCTCCGCGG CAACCTGGGG
CATCCGCCGG GCGGCTGGCC GGAAGGGATC CAGAAGAAGG CGCTCAAGGG GGAACAGCCC
CTGCAGGATC GCCCCGGCAA GTACCTGGAG CCCCTGGACC TGGAGGCGGT TCGGCAGCAG
GCGAGCGACG AGCTCGACGG CGCCGAGATC GACGACGAGG ACCTCAACGG CTACCTCATG
TATCCGAAGG TCTTCACCGA GTATAAGCGC CGTCGTGAGC GCTTCGGTCC GGTGCGCACC
CTGCCGACGC GCAACTTCTT CTACGGCATG GAGGCCGGAG AGGAGATCAG CGTGGACATC
GATCCCGGCA AGACGCTGGA GATCCGTCTG ATGACCGTCA GCGAGCCGGG CGAGGATGGC
GATCGCCGCG TGTTCTTCGA GCTGAACGGC CAGCCCCGCA CCGTGCGCGT GGCGGACAAC
CAGGCCAAGG CGCAGGTGGT GCAGACCCCC AAGGCCGAGG AGGGCAACCC GGCCCACGTC
GGTGCCCCGA CGCCCGGCGT GGTGGCCTCG GTGGCGGCCA CCCCGGGGCA GAACGTGAAG
GCCGGGGATG TGCTGCTGAT CATCGAGGCC ATGAAGATGG AAATGGGGCT GCACGCCGAG
CGGGACGGTG TGGTGAAGGC CGTGCACGTG CAGCCGGGCA GCCAGATCGA GGCCAAGGAT
CTGCTGGTGG AGTTCGAAGC CTAG
 
Protein sequence
MVPFQKILIA NRGEIAIRVM RAANELGKRT VAIYAQEDKL GLHRFKADEA YQVGEGMGPV 
EAYLSIDEVI RVAKMAGADA VHPGYGLLSE NPELVDACEA AGITFIGPRA DTMRALGDKA
SARRVAIEAG VPVIPASEVL GDDIEAARRW ADEIGYPMML KASWGGGGRG MRPIREPEEL
EARVLEGRRE AEAAFGSGEG YLEKMIERAR HVEVQVLGDT HGGLYHLFER DCTVQRRNQK
VVERAPAPYL TEAQRAEVCD LGLKVARHVG YQNAGTVEFL MDMDTDTFYF IEVNPRIQVE
HTVTEEVTGI DIVKAQIRIA EGEHLAAATG KADQGALWLN GHAMQCRVTT EDPQNNFIPD
YGRITAYRSA TGMGIRLDGG TAYAGGVITR YYDSLLVKVT AWAPTPQEAI SRMDRALREF
RIRGVSTNIP FVENLLKHPA FLDNSYTTRF IDTAPELFDF DKRRDRATRL LTYLAEITVN
GHPETLGRPQ PAAGLPLPVP PEPRDEPAPG TRNLLEAQGP QAVADWLAGR KELLLTDTTM
RDAHQSLLAT RMRSFDMARV APAYAANLPQ LFSVECWGGA TFDVAYRFLQ ECPWQRLRQI
REAMPNVMTQ MLLRGSNGVG YTNYPDNVVR AFVHQAADSG VDVFRVFDSL NWVENMRVAM
DAVLESGKVC EGTLCYTGDI LDPGRDKYDL KYYVAMGKAL RDAGAHILGV KDMAGLLKPA
AARVLFRALK EEVGLPIHFH THDTSGIAGA TVLAAADAGV DVADVAMDAF SGNTSQPVFG
SIVEALRHTE RDTGLDMSAV REISNYWEQV RAHYAAFETG QQSPSSEVYL HEMPGGQFTN
LKAQARSLGL EERWHEVAQA YADANQIFGD IVKVTPSSKV VGDMALMMVS QGLTREQVED
PAVDVNFPDS VIDMLRGNLG HPPGGWPEGI QKKALKGEQP LQDRPGKYLE PLDLEAVRQQ
ASDELDGAEI DDEDLNGYLM YPKVFTEYKR RRERFGPVRT LPTRNFFYGM EAGEEISVDI
DPGKTLEIRL MTVSEPGEDG DRRVFFELNG QPRTVRVADN QAKAQVVQTP KAEEGNPAHV
GAPTPGVVAS VAATPGQNVK AGDVLLIIEA MKMEMGLHAE RDGVVKAVHV QPGSQIEAKD
LLVEFEA