Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4237 |
Symbol | |
ID | 5736091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5400885 |
End bp | 5403674 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281392 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001546997 |
Protein GI | 159900750 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.582668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATACAC CTGAAACCGA CCGCACGCCG CTTTCAATGC TGATCCATTC CCTCGGGAAT GTGCTTGGCG ATGTGATTGT GGCCCAAGAT GGGGTGTCAG CGTTTGAGCT TGAAGAAGAT GTGCGCCAAC GCACCAAGCA ACGCCGAAGC GATGGAACAT TGCAAGAGAC TCAAACCCTG ACTGAGTTGA TTAGTCAATT GCCAGTTGCC CAACTCATGG GGCTGATTAA GGCCTTTACT CATTATTTTG GCTTGGTCAA TTTGGCCGAA AGCGTTGAGC GCCTACGAGT GCTGGCCGAA CGCGACCGCC AAAATGGCGA TGCACCACGC TCTGAATCGG TCGAATTGGC ATTGCAAGAG TTGCGTGATC GTGGCATTAC CGCCCAGCAA GTGCAAGATT TGCTTGATCA TGCCGAGATT CGGCCAGTTT TTACTGCCCA CCCGACCGAG GCCAAACGCC GAACGACGCT CAAAAAGCAC CATCGCATTG CAGGGGCGGC GCGGCAATTA ACCGCCGATA CGACCTTTCA ACGCCAGCGC GAACGCTTGC TCGAATCAAT TCGTGAGGAA GTGATCTCGC TCTGGCAAAG CGATGAGGTG CGGATTATCA AGCCAACCGT GATCGACGAA GTGAAGAATA ATCTCTATTA CTTCGAAGAA TCGCTGTTCG ATATGATTCC GCAACTCTAC CGTGATACCG AGGCCTCGTT GCGCCAAATT TACCCTGAGC ACGAATGGCG TGTGCCAGCC TTCCTGCGCT TTGGCTCATG GGTTGGCGGC GATCGCGATG GCAATCCCTT TGTAATTCCC TCGGTCACGG TTGAAACGCT TAAACTGTTG ATGGGCCGTT CGTTGCGTGA GCATATTCAT TCAGTTGAGC GCTTGAGTCA TCGTTTAAGC CAATCGTCGC GCCAAGTACC AATTAGCGAA GAATTAGCCC AATCGCTAAT CCATGATGCG CCGTTGTTCC CCGAATTGGC CCAAGTGCTG GAGCGGCGCA ATCCGCATGA GCCATATCGC CAAAAATGCT CCTACATTCA CGCCAAATTG CATGCCACCT TGGCCTATGT TGAGCGCTAC GAGCCAGATT GGGCACGCGG CGGCCATCGC CCAGCTGAAG GCACCTGGTA TGCCAATGCC AACCAATATC TCGCCGATTT AGCAACCATG GAATATAGCT TGCGCACCAA TGGCGCGGCC TCAGTCGCCG ATGGCTTTTT GCGCGATATC CAATGCTCGG CCAAAGTCTT TGGTTTGCAC ACCGCCACCC TCGATATTCG CCAACACAGC AGCCGCCACA CCAACGCCCT GAGCGAAATT TTTGAATATG CAGGCATCTG CGACGATTAC GCCAGCCTGA GCCAAGCCGA ACGCACGGCT GTATTGGAAC GCGAGCTAGC CAATAATCGT CCGTTGATTC CAACTCATCT CTACTACAGC CCCGAAACCG TTGAGATTAT CGAAACCTTC CGCACAATTC GCGCAGTGCT TTCCGATTTA AATGCTGAGG CCATCGAAAC TTACATCATT TCGATGACCG AAGGCCCAAG CGATATTTTG GCGGTGCTGT TGCTGGCTCG CGAGGCGGGC ATTTATCAGC CAGGTGAGCA TAGTTGGCTG AATATTGTGC CATTATTTGA AACCGGAGCC GACCTCATCG CCGCGCCGGA GATAATGCAC ACGCTGCTTT CGAGCGAAGC CTATCGCCAA CATTTGGTGT TGCGCAACGA TGTGCAAGAA ATTATGTTGG GCTACAGCGA TTCCAACAAA GATGGTGGTT TTGCCGATGC GCACTGGGCG CTCTATCTCG CTCAAGTGGC CTTGGCCGAA ACCTGTTTCC GACATCGAGT GGCCATGCGG CTGTTCCATG GCCGTGGTGG GGCGGTTGGC CGTGGTGGCG GGCCTGCCAA CCGTGCGATT TTGGGTCAAC CACCAGGCAC AGTCGGTGGG CGGATCAAAA TCACTGAACA AGGCGAAGTG ATTAGCGATC GTTATGCCGA GCCAGAAACG GCCTATCGCC ATCAAGAGCA AATTATCAAC GCAGTATTGC GCTCATCGTT AGGCGTGAGC ATCGCGCATA TCAGCCAAGA ATGGCACGAC GCGATGAGTA GTTTGGCCAA GGTTTCGCGT AAAGTCTATC GCGGCTTGGT CTACGATCAT CCGCACTTCT TGGAATACTT CCGCAATGCT ACGCCGATTA CCGAAATTAG CCGCTTGAAC ATTGGCTCAC GCCCAGCCAG CCGCAAAGCC AGTGACCGGA TCGAAGATTT GCGAGCGATT CCCTGGGTTT TTAGTTGGAT GCAAAGTCGG CATACCTTGC CAGGTTGGTA TGGCTTGGGC AGTGCCTTGG AGCATTTAAT CCAAGCTGAT GCCAATGGCT TGACCACCTT GCAGGGAATG TACAACGATT GGCCATTTTT CCGCACCATG CTGGATAATG CCCAAATGAT TTTATCCAAG GCTGATATGG ATATTGCGGC GCAATATGCC CTGCTTGTGC CCGACCAAGC CTTAGCCAAC GAAATCTTTG GCCTGATCAA AGCTGAATAC ACCCGCACCG TTAAATGGAT TTGCGAGGTG GCGCAAATTA ATGAGCTGCT GGATACTAGC CCAATTTTGC AGCACTCAAT TAAGCAGCGC AACCCGTATG TTGACCCATT AAGTTTCGTA CAAATCGAAT TGCTCCGGCG TTTGCGCACC GATCCCGATG GACTTGAGCA TAGCGATCTT GAAGATGCAA TTTTGTTAAG TATCAACGGG ATTGCCGCAG GCTTGAAAAA TACGGGTTAG
|
Protein sequence | MYTPETDRTP LSMLIHSLGN VLGDVIVAQD GVSAFELEED VRQRTKQRRS DGTLQETQTL TELISQLPVA QLMGLIKAFT HYFGLVNLAE SVERLRVLAE RDRQNGDAPR SESVELALQE LRDRGITAQQ VQDLLDHAEI RPVFTAHPTE AKRRTTLKKH HRIAGAARQL TADTTFQRQR ERLLESIREE VISLWQSDEV RIIKPTVIDE VKNNLYYFEE SLFDMIPQLY RDTEASLRQI YPEHEWRVPA FLRFGSWVGG DRDGNPFVIP SVTVETLKLL MGRSLREHIH SVERLSHRLS QSSRQVPISE ELAQSLIHDA PLFPELAQVL ERRNPHEPYR QKCSYIHAKL HATLAYVERY EPDWARGGHR PAEGTWYANA NQYLADLATM EYSLRTNGAA SVADGFLRDI QCSAKVFGLH TATLDIRQHS SRHTNALSEI FEYAGICDDY ASLSQAERTA VLERELANNR PLIPTHLYYS PETVEIIETF RTIRAVLSDL NAEAIETYII SMTEGPSDIL AVLLLAREAG IYQPGEHSWL NIVPLFETGA DLIAAPEIMH TLLSSEAYRQ HLVLRNDVQE IMLGYSDSNK DGGFADAHWA LYLAQVALAE TCFRHRVAMR LFHGRGGAVG RGGGPANRAI LGQPPGTVGG RIKITEQGEV ISDRYAEPET AYRHQEQIIN AVLRSSLGVS IAHISQEWHD AMSSLAKVSR KVYRGLVYDH PHFLEYFRNA TPITEISRLN IGSRPASRKA SDRIEDLRAI PWVFSWMQSR HTLPGWYGLG SALEHLIQAD ANGLTTLQGM YNDWPFFRTM LDNAQMILSK ADMDIAAQYA LLVPDQALAN EIFGLIKAEY TRTVKWICEV AQINELLDTS PILQHSIKQR NPYVDPLSFV QIELLRRLRT DPDGLEHSDL EDAILLSING IAAGLKNTG
|
| |