Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0808 |
Symbol | |
ID | 5732708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 912985 |
End bp | 915711 |
Gene Length | 2727 bp |
Protein Length | 908 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277939 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001543584 |
Protein GI | 159897337 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAATT TGCCAAGCTA CGATTTTGCC GATTTAATCA AAGCTGACCA TGATGCTGAA TTTGTTTTGA CATGTTTCGC TGAAGTATTA CAAGAATTAG GCGAGGCTGA ACTCGCTGCC TATCTTGGGT TGCCCACAAC CGCCTCAATT AACCAAACAA TTACGCCTGA ACGAGCGATT CAAGCGGCAT CGCTCAGCTT TCAATTGCTG AATATGGTCG AGGAAAATGC CGCTGCCCAA CAACGCCGCA TCCGCGAAGC TGCCGAAGGC TTTAATGCTG AAGTTGGGCT TTGGGGCAAT ACCTTGCAAA GCCTGAGCCA AGCTGGCTTC AGCGCCGAGC AGATTGGCGC TGCCTTAGGC CAGATTTATG TTGAACCAGT CCTGACCGCC CACCCCACCG AGGCCAAACG GGCGACCATG CTTGAGCATT ATCGACGTTT ATATTTGCTG TTGGTCAAGC GCGAAAACCC CATTTGGACT CCGCTGGAGC AACAGGCGCT GCGCGATGAG ATTAAAGTTG AGCTTGAGCG GCTTTGGCGC ACTGGTGAGA TTTTCCTTGA AAAGCCGAGC GTCGCCGATG AATTGCGCAA TATTTTGCAT TATTTGCGTC ATGTGTTTCC CGCCAGTTTG GTAACGCTCG ATTTGCGTTT GCAACAGGCT TGGCAATTGC AAGGCTTTGA TCGGCGGTTG TTACCAACAA CCGAGCAATT ACCGCATGTG CAGTTTGGTA CATGGGTTGG CGGCGATCGC GATGGACATC CCTTGGTTAC GGCGGCGGTC ACGCGCTTTG CTTTGCAGGA GTTGCGCCGC AATGCGCTTG AGGTTTTGCA CGAACAATTA GTTCAACTAG TGATCAAATT AAGCTTATCT GATCGCTTGC AACCACCGCA CGCCGCCTTG CTTGAAGCCC TCGATGCTAG TGCGGCTGCG TTGGGCCAAC GGGGGCAAAT GGCCTTGGCA CGGAATCCCG AGGAGCCATG GCGACAATGG ATCAACTTAA TCATAGCGCG ATTGCCCGAA TCAGGCCAAC TACGCCAGCC TTGGCAGTAT CGCAGTAGCA GCGAAACTGT GGCTGACCTG CAATTTTTGG CCGAGCAACT GCATGCTGTT GGCGCACAAC GGCTTGTTTT GAATGATCTT CAGCCTGTGA TTCGCAGTGT GCAAACTTTG GGCTTTCACA GCGCCGTGCT CGATATTCGC CAAAACAGCA AATTCCACGA TTTGGCGGTA GAACAATTGC TGCAAGCTGC TGGATTCAGC GATTATCAAT TTAGTAGTTG GCCCGAGGAA CAACGGCTCG AATTGCTCAA TCGCGAGTTG CAAAGTGCCC GCCCATTTGC CCACCCTAGC CTTGAGTTGG GCAACGAAGC GAGCGCTGTG CGCGATTGCT ATCGTGTGTT GGCCGATGAA ATTGCCCAGC ATGGCACGGC AGGCCTTGGC TCGTTGATTA TCAGCATGAC CCGCAGCCTC TCGGATTTGC TGGTGGTGTA TTTGTTGGCA CGTGAAGCTA GCTTGTTATA CGTTACTGAG GCTGGTTTGG CGTGTGTGCT GCCAGTTGTG CCATTATTCG AAACGATCGA AGATTTGGAA ATTAGCCCTG GTATTCTCGA TGCGTTTTTG GCGCATCCGG TCAGCCAAGC GAGCCGAGCG TTGCGCCAAA CTAGCGTGCA ACAGGTGATG GTTGGCTATA GCGATAGCAA CAAAGATGGC GGCATTTTGG CCAGTTTGTG GAGCTTGTAT CGGTCGCAAG GCACGCTAGC AGCGGTCGGA GCCAAGCATC ATGTGCGAGT ACGTTTCTTC CACGGTCGTG GCGGCACAAT CAGCCGTGGC GCTGGTCCAA CCCACCGTTT CTTGAACGCG CTACCAGCAG CGGCCTTGGC AGGCGATTTG CGCATGACCG AGCAAGGCGA AACCATCGCC CAAAAATATG CCAACCATAT CACGGCGGTC TATAATTTAG AATTAATGGT GGCGGGCGTA ACCGAAGCCA CTTTGCTTGG CTCGCAGCGT GATCAAACGC CCCACAGCCT TGCCCCAATC ATGGATGTAC TGACTGCCTA CAGCCGCCAA CGCTACGAAA CCTTGATTCA AACACCTGGT TTTATTCAGT TTTTTGGTCA AGCCACGCCG ATTGATGTGA TTGAGCAAGG CAAAATCGGC TCACGCCCTG CCCGCCGCAC TGGCCAACGC ACCCTTGGCG ATTTGCGGGC AATTCCATGG GTATTTAGCT GGAGCCAAGC CCGCTTTTTT CTTTCTGGTT GGTATGGGGT TGGCAGCAGC TTGGCATGGC TCGCCGAGCA ACATCCTGAA CAGTTTGACC AACTCAAACA AGCGGCCTTT GAATGGTATC CGTTGAAATA TTTATTAACC AATGTCAGCA CCAGCATGCT CTCGGCTGAT CTGGCAACCA TGCAAGCCTA TAGCCAGTTG GTCGAAGATC CAAGCGTGCG CCAGCCGATT ATGGCGGCCA TTGAGGCTGA GTTTAAGCAA ACCCAACAGC AACTAGAGCT GATTTTTGGC GGGAGTTTGG CCGAACGTCG CCCACGGATT TATCGTATGT TGCAAGGTCG TCAGTCACGA TTAAGCCAAT TGCATCAGCA ACAAATCAAT TTGCTGCATA CATGGCGCAG CCTGCCCAGC GAGCAAATTC CAGAGCGTGA GCGATTGCTA ACCCAGTTGT TGCTCACGGT CAATGCGATT GCTAGCGGCC TACGCACGAC TGGCTAA
|
Protein sequence | MANLPSYDFA DLIKADHDAE FVLTCFAEVL QELGEAELAA YLGLPTTASI NQTITPERAI QAASLSFQLL NMVEENAAAQ QRRIREAAEG FNAEVGLWGN TLQSLSQAGF SAEQIGAALG QIYVEPVLTA HPTEAKRATM LEHYRRLYLL LVKRENPIWT PLEQQALRDE IKVELERLWR TGEIFLEKPS VADELRNILH YLRHVFPASL VTLDLRLQQA WQLQGFDRRL LPTTEQLPHV QFGTWVGGDR DGHPLVTAAV TRFALQELRR NALEVLHEQL VQLVIKLSLS DRLQPPHAAL LEALDASAAA LGQRGQMALA RNPEEPWRQW INLIIARLPE SGQLRQPWQY RSSSETVADL QFLAEQLHAV GAQRLVLNDL QPVIRSVQTL GFHSAVLDIR QNSKFHDLAV EQLLQAAGFS DYQFSSWPEE QRLELLNREL QSARPFAHPS LELGNEASAV RDCYRVLADE IAQHGTAGLG SLIISMTRSL SDLLVVYLLA REASLLYVTE AGLACVLPVV PLFETIEDLE ISPGILDAFL AHPVSQASRA LRQTSVQQVM VGYSDSNKDG GILASLWSLY RSQGTLAAVG AKHHVRVRFF HGRGGTISRG AGPTHRFLNA LPAAALAGDL RMTEQGETIA QKYANHITAV YNLELMVAGV TEATLLGSQR DQTPHSLAPI MDVLTAYSRQ RYETLIQTPG FIQFFGQATP IDVIEQGKIG SRPARRTGQR TLGDLRAIPW VFSWSQARFF LSGWYGVGSS LAWLAEQHPE QFDQLKQAAF EWYPLKYLLT NVSTSMLSAD LATMQAYSQL VEDPSVRQPI MAAIEAEFKQ TQQQLELIFG GSLAERRPRI YRMLQGRQSR LSQLHQQQIN LLHTWRSLPS EQIPERERLL TQLLLTVNAI ASGLRTTG
|
| |