Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0577 |
Symbol | |
ID | 6374241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 604880 |
End bp | 607648 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642683090 |
Product | Phosphoenolpyruvate carboxylase |
Protein accession | YP_001959017 |
Protein GI | 189499547 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.222337 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCACCT CTGCAACAAG TGTCGTCGAT TACGATAAAG CCTTAGTCGA TTTCCGGTTT TTACTGGACT GTTTTCGGAA AGTACTTCAG GAATTCGGAG AAACAGAACT TGCAGCGCAC CTTCCCTGGG ACGAAGCATC AGTTCCGATA AGTCTCATCC CAAACAGAGG CAGAGCTGTA CAGGCTTATA CCATAGTATT CCAGCTTCTC AACATGGCTG AAGAAAACTC AGGGGCCCAG TTCCGCAGAA CCCTTGAAGC GACGAAAGAG CCTTCATCTC TTTCCGGACT ATGGGGAGAC GCGTTCAGCC ACCTTAAAAA AATCGGGATT TCAGAGTCGC GCATCGTAGA AGAACTTTCA CACATCTCGG TAGAACCCGT ACTGACCGCG CATCCTACAG AAGCAAAACG ACGCACGGTA CTCGAACAGC ACCGTGAACT CTACCTGCTT CTCGTGAAAC TTGAGAACCA GATGTGGACT CCTGCAGAAC GGGAGGATAT TCGAACAGAG ATCACAGTCG TGCTTGAACG ACTGTGGCGA ACAGGAGAAA TATTTTTTGA GAAACCAACC GTTGACGCAG AACGGCAGAA CGTCATCCAT TACCTCTACA ACATCTTCCC TTCCGTTCTT TCCATGCTTG ACAAAAGGCT CATCCAGGCC TGGACTGACG CAGGATTCGA CAGAGCATCG ATCGAAGATC CGAAAAACTT TCCCAAGCTC AGTTTCGGGA GTTGGGTAGG AGGAGACAGG GACGGTCACC CTTTCGTCAC CCCTGAGGTA ACACAAGAGA CGTTGATGGA GATGCGACAG TATTCGCTCA GACTGGTGCA GAACCACTTG CGTGAACTCG CCTCCAAACT CAGCCTTTCC GAGCGGTTTC AGTCTCCAAC ACCCGCCATT TCAGAGCGAA TCACCCGACT CAGGGAGCTG ACCGACACAA GGGGAGATAA CGCGATATCA AGAAATCCCA AAGAACCCTG GCGTCAGTTT ATCAATCTGA TGATAGCGGC TCTGCCTCCC GAACCCGTCC TTCATGAGTT TCACGATGAA GAGCACTATC CGGATAGATA CCGCCACCAT GACGAACTGC TTTCAGATCT CGACATTCTG AGAGACTCTC TCCTGACGAT AGGAGCGGAA AACATTGCGA AAAACGATGT AAACCCTGTT TACCGGATTG TGCAGACCTT CGGATTCCAT CTCGGCAAAC TGGATATACG GCAGAACAGC CGTATTCACG ATCTTGCGAT GTCACAGCTG ATGCGTGCGG CCGGATTTAA AAACTGCGAT TTTCCGGACT GGAACGAAAA AGAGCGCATG GCTTTTCTGA ACAGTGAACT CCTCTCTCCC CGCCCGTTCA CCCATCCAGA CATGGAGGTC GGTCCGGAAG CGGAAACACT TCTCCGATGC TACCGTGCTT TGGGAAAACA TCTTAAACAA TTCGGTCCTC ACGGTATCGG CTCCCTTATC GTCAGCATGA CCCGAAACGT TTCCGATCTG CTTGCCATCT ATCTTTTTGC ACGGGAAGTC GGCCTGATGA CCACACATGA CGGCGGCGAC GCATGCATGA TCCCCGTGGT TCCCCTGTTC GAAACCATTG ACGATCTCAC AAAAAGCCCG GAAATTCTGC GGGAATTTCT CGATAATCCA CTCACCAGAA GGAGCATCGC CTATCATCGG CAGATCCGGA ACGGGACAAG ACCGTCCCAG CAGGTCATGG TTGGCTACAG CGACAGCAAC AAGGACGGCG GGATCACTTC CAGCCTCTGG AACCTGTACA GGGCGGAAGA AAAAATGCTT GAAGTCGGCA GAGAACAGGA TATTGATATT CTTTTCTTTC ACGGACGCGG AGGAAGCATC AGCCGGGGAG CAGGACCGAC TCACCGCTTC ATCAGGGCCC AACCCCACTG CTCGCTCGAA ACCGGCCTCA GGGTCACCGA ACAGGGAGAA ACCATCTCCC AGAAATACGC GAACAAGATC AACGCGACCT ATAACCTTGA GCTGTTCATG GCCGGCGTGA CCGAGGCCCG GCTCGCTCAT CGGGTCAATC CAAAAAGCAA AATGTCGATC GAGCCTGTCA TGGATCTCCT CGCCGTTGAA AGCAACAGAG CTTACCGGAG TTTGATCGAA TCGGAAGGAT TGCTTGAGTT TTTCCATCAG GCGACCCCGA TAGACATTAT TGAATCGAGC AGGATCGGTT CCCGTCCTTC ACGAAGAAGC GGAAAAGAAA GCCTCGATGA CCTGCGCGCA ATACCCTGGG TATTCAGCTG GAACCAGGCC CGGTTCGCCA TTTCCGGATG GTACGGATTC GGAAGCGCCC TCGAACACCT CAGGAAAACA CAGCCGGAAG CGTTCGATCT CATCAGGGAA AAGGATTTTT CATGGCCGCC GCTGCGCTAC ATTGCAAGCA ATGTCTCGAC AAGTATCGCG ACTGTTGACC CTGAAATCAT GCAGAAATAT GCCATGCTGG TCGAAAACCC TGATACCCGT CATAGAAATC TCACGATGAT TACCGCCGAG TACCGGAGAA CACGGGAGCT TCTGGACATA TTCTACGGTG GTTCACTTCA GGACAAATTC TTTAACGTGT TCCGCTTCAT CACTATGCGT CAGGAAGGAC TGAGCGAACT GCACCATTTG CAGATCGATC TCCTGAGAAC CTGGAGAAAA CATAAATTAT CTGGAAAAGA GGACACAGCC GATGCCATGC TCCCTGAGCT GCTGCTCACC GTCAACGCCA TAGCGAGCGG ACTGCGGTCC ACGGGATGA
|
Protein sequence | MITSATSVVD YDKALVDFRF LLDCFRKVLQ EFGETELAAH LPWDEASVPI SLIPNRGRAV QAYTIVFQLL NMAEENSGAQ FRRTLEATKE PSSLSGLWGD AFSHLKKIGI SESRIVEELS HISVEPVLTA HPTEAKRRTV LEQHRELYLL LVKLENQMWT PAEREDIRTE ITVVLERLWR TGEIFFEKPT VDAERQNVIH YLYNIFPSVL SMLDKRLIQA WTDAGFDRAS IEDPKNFPKL SFGSWVGGDR DGHPFVTPEV TQETLMEMRQ YSLRLVQNHL RELASKLSLS ERFQSPTPAI SERITRLREL TDTRGDNAIS RNPKEPWRQF INLMIAALPP EPVLHEFHDE EHYPDRYRHH DELLSDLDIL RDSLLTIGAE NIAKNDVNPV YRIVQTFGFH LGKLDIRQNS RIHDLAMSQL MRAAGFKNCD FPDWNEKERM AFLNSELLSP RPFTHPDMEV GPEAETLLRC YRALGKHLKQ FGPHGIGSLI VSMTRNVSDL LAIYLFAREV GLMTTHDGGD ACMIPVVPLF ETIDDLTKSP EILREFLDNP LTRRSIAYHR QIRNGTRPSQ QVMVGYSDSN KDGGITSSLW NLYRAEEKML EVGREQDIDI LFFHGRGGSI SRGAGPTHRF IRAQPHCSLE TGLRVTEQGE TISQKYANKI NATYNLELFM AGVTEARLAH RVNPKSKMSI EPVMDLLAVE SNRAYRSLIE SEGLLEFFHQ ATPIDIIESS RIGSRPSRRS GKESLDDLRA IPWVFSWNQA RFAISGWYGF GSALEHLRKT QPEAFDLIRE KDFSWPPLRY IASNVSTSIA TVDPEIMQKY AMLVENPDTR HRNLTMITAE YRRTRELLDI FYGGSLQDKF FNVFRFITMR QEGLSELHHL QIDLLRTWRK HKLSGKEDTA DAMLPELLLT VNAIASGLRS TG
|
| |