Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0559 |
Symbol | |
ID | 4569426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 625602 |
End bp | 628370 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639765157 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_911039 |
Protein GI | 119356395 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCACCA CGACAAGCAG CGTCATTAAT TTTGAAAAAG CGGAAAGGGA CTTCACCTTT CTGCTCCAGT GTTTTCAGGA AATGCTTCAT GATCTGGGCG AAAAAGAGCT TGCCGAACAT CTTCCGTGGA AACGGGCGTC CATGCGCCTC GATGATTATC CCGACATTGA GCGAGCCATA CAGGCATACT CGATTTTTTT CCAACTGCTC AACATGGCAG AAGAAAACTC AGGCGCGCAG TATCGACGCA CAATTGAAAC AGAAAAAGGA CTATCGGAGC TCAAGGGACT CTGGGGAAAA AACCTGCATC GGCTCAAAAA AAACGGCATA ACCGAAGAGG AAATTTTGAG CGAACTATCG CGGGTCACCG TCGATGTCGC CCTGACAGCC CACCCCACCG AGGCAAAACG AAAAACCGTT CTTGAACAGC ACCGCCAGCT CTACATTCTT CTGGTAAAAC GTGAAAACCA GATGTGGACA CCACTTGAAC AAGAGGATAT CCGTAAAGAG ATAAAGGTCG TCATGGAAAG ATTGTGGCGA ACCGGTGAGC TCCTGTACGA CAAACCGGCC GTATCGGCTG AACGCAGAAA TATTGTTCAC TACCTCTATA ACATTTTTCC CGAAGTGCTG CCCATGCTTG ACAAACGGCT TCTTCACGCA TGGCAGGAAT GCGGGTTCGC TCCCGAAAAA CTCGCAGACC CGCAAACGCT TCCCCTGCTT CGTTTTGGAA GCTGGGTTGG CGGCGACAGG GACGGCCACC CTCTTGTCAC CGCAAAAGTA ACCAGAGAGA CCCTTCTGGA AATGCGTCGC AAAGCGCTTG CCCTGATCGC CTTTCATCTT ACCAGGCTTG CCTCAAAGCT CAGCCTGTCT GAAAAACACC AGTTGCCCCC GGACGCACTG GAAGAAAGAA TCAGAACGCT CGCCAACCAT CTGGGAGAAG ATGGCGAGGT GGCACTCATA AGAAACCGGC ATGAGCCATG GCGCCAGTTC ATAAATCTCA TGATTGCATC CCTGCCGAAA AACCCCCGGA ACGAACATGA CACAGGAAAC CAGAACGACA TATTCTCGTA CGGTTCTTCG CGTGAGCTGC TCGACGACCT TGAGCTGTTG CGACAATCCC TGCTTAACGT AGGAGCCCGG CATATTGCCG ATTCCGATGT GGCTCCGGTC TATCGCATTG TTCAGACTTT CGGCTTCCAT CTGGCATCAC TCGATATCCG GCAAAACAGC GAGTTTCACG ATCTGGCCCT CTCTCAGCTT ATGAGCGCTG CCGGCTTGAA CGGAAGTGCA TTTCTGACCT GGAATGAAGA AGAACGAACA GCCTTTCTTG ACAGGGAGCT GCAATCACCC CGACCGTTCA CCCATCCCGA TATGAAAGCC GGCCCTGAAG CCGAGGCGGT TCTCGACTGC TATCGGGTTC TTCTGGACCA TTGCAAGGAG TATGGAAGCG AAGGAATAGG CTCCCTGATC GTCAGCATGA CAAGAAATGC CTCCGATCTT CTTGTGGTCT ATCTTTTTAT GAGAGAAGTC GGCCTGATGA CCCCTTTCAA AGAGCACGAT GCCTGTATGC TTCCCGTCGT CCCGCTCTTT GAAACCATTG AAGACCTTGA AAAAAGCCCT GATATCATGA GAACATTTCT CTCCCATCCG CTCACCCGGC GAACTCTTGG CATGCAGCAA GAGATGAACG GCTGGGAAAA ACCGGCACAA CAGGTGATGA TCGGATACAG CGACAGCAAC AAGGACGGAG GAATTTTTGC CAGTATCTGG AACCTGTACC GGGCACAGGA GTCGTTGCTT GAGGCCGGCA AAAAATCAGA AACAGAGATA CTTTTCTTTC ATGGCCGAGG AGGAAGCATC AGCAGGGGCG CTGGCCCGAC CCACCGGTTT TTGCGAGCCC AGCCTCACGG TTCATTCCAG GCGGGCATCC GCTTTACCGA ACAGGGTGAA ACAATTGCAC AAAAATATGC AAACAGGATC AGTGCACTCT ACAACCTCGA ACTCTTCATG GCGGGAGTAA CCGGAGAGCT GCTGAAACAC CGCAATAAAG AAAAAACAAG CCATGAACTT GAGCCGTTGA TGGATATGCT CTCACGGACA AGCAACCTTG CCTACCGCAG GCTGATCGAG GCAGAAGGGT TTATTGATTT TTTCAGGGAG GCAACGCCGA TAGATATTAT TGAGGCTACC CGGATAGGAT CCCGTCCCTC AAGGCGTTCA GGGAAAAAAA GCCTTGCGGA TTTACGCGCA ATCCCGTGGG TATTCAGCTG GAACCAGGCG AGGTTCTCCC TTTCAGGATG GTTCGGCATC GGCTCAGCAC TCGAACAGCT TCAGCATGAA CATCCCGAAG CGTGCGACAA CATTCGACGC CAGGATGCGT CATGGGCACC CCTCCGCTAC ATCATAAGCA ATGCGGATGC AAGCCTTGCA ACGGTCGATA CCGAGATTAT GAGAGAGTAT GCCGCCCTTG TCGAAAACAG CGAACTCCGC TCAAGAATCT TCGGCATCAT TGAAAACGAG TACCATAAAA CCCGAACAAT GATAGAGAGC CTCTATGGCG CGCATCTTGA AGAACAACGC CTCAATGTCT CCAGATTCAT CTCCCTGCGT CAGGAAGGGC TCAGGGAGCT GCACCGCCAG CAGATCCGGC TGCTGAAAAA ATGGCGGAAG CTGCACCAAA CAGGACAGAA CAAAAAAGCT GACAATCTGC TGGCAGAACT TTTTCTTACC GTCAATGCCA TATCAGGAGG ACTGAGAACC ACCGGATGA
|
Protein sequence | MITTTSSVIN FEKAERDFTF LLQCFQEMLH DLGEKELAEH LPWKRASMRL DDYPDIERAI QAYSIFFQLL NMAEENSGAQ YRRTIETEKG LSELKGLWGK NLHRLKKNGI TEEEILSELS RVTVDVALTA HPTEAKRKTV LEQHRQLYIL LVKRENQMWT PLEQEDIRKE IKVVMERLWR TGELLYDKPA VSAERRNIVH YLYNIFPEVL PMLDKRLLHA WQECGFAPEK LADPQTLPLL RFGSWVGGDR DGHPLVTAKV TRETLLEMRR KALALIAFHL TRLASKLSLS EKHQLPPDAL EERIRTLANH LGEDGEVALI RNRHEPWRQF INLMIASLPK NPRNEHDTGN QNDIFSYGSS RELLDDLELL RQSLLNVGAR HIADSDVAPV YRIVQTFGFH LASLDIRQNS EFHDLALSQL MSAAGLNGSA FLTWNEEERT AFLDRELQSP RPFTHPDMKA GPEAEAVLDC YRVLLDHCKE YGSEGIGSLI VSMTRNASDL LVVYLFMREV GLMTPFKEHD ACMLPVVPLF ETIEDLEKSP DIMRTFLSHP LTRRTLGMQQ EMNGWEKPAQ QVMIGYSDSN KDGGIFASIW NLYRAQESLL EAGKKSETEI LFFHGRGGSI SRGAGPTHRF LRAQPHGSFQ AGIRFTEQGE TIAQKYANRI SALYNLELFM AGVTGELLKH RNKEKTSHEL EPLMDMLSRT SNLAYRRLIE AEGFIDFFRE ATPIDIIEAT RIGSRPSRRS GKKSLADLRA IPWVFSWNQA RFSLSGWFGI GSALEQLQHE HPEACDNIRR QDASWAPLRY IISNADASLA TVDTEIMREY AALVENSELR SRIFGIIENE YHKTRTMIES LYGAHLEEQR LNVSRFISLR QEGLRELHRQ QIRLLKKWRK LHQTGQNKKA DNLLAELFLT VNAISGGLRT TG
|
| |