Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_1629 |
Symbol | |
ID | 3746135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 1823431 |
End bp | 1826202 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637769662 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_375526 |
Protein GI | 78187483 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.194507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAGCA CGACCAGCAC CATCATCGAC TTTGAAAAAG CCGAACGGGA TGCGGGCTTC CTCATCAGCT GTTTTCAGGA GATGCTCTGC TGCCTCGGTG AAGAGGCTCT GGCAGCCCGC CTCCCGCGCG GCGGTCGCGG AATCGACCTT GGACACTACC CCGACACGGA GCGGGCGCTG CAGGCCTATT CCATATTCTT CCAGCTGCTC AACATGGCAG AGGAAAACTC GGCGGCACAG AACCGCCGGA CGCTTGAAGC CGAACACGGT GCCGCTTCTC TGGAGGGGCT CTGGGGCCGG ACCCTCCTTC CGGTCCAGCA GCAGGGGCTC TCGGAGGAAG AAGCCCGGAT TGTGCTCTCA CGGGTGCAGG TTCATCCCGT ACTGACAGCC CATCCGACAG AAGCGAAACG AAAAACCGTG CTGGAACTCC ACCGCAACAT ATACCTGCTC CTGGTCAAAC GGGAGAACCA GATGTGGACC CCCGCCGAGC AGCTCGATAT CCGCAGGGAG ATTACGTCTG CAATGGAAAC GCTGTGGAGG ACCGGCGAAA TCCTGTTCCA GAAACCATCC ATCGCCGATG AAAGGGCAAA CGTCGCGCAC TATCTGGAGA ACATCTTTCC CGATGCCATC ACAATGATGG ACCGCCGGCT CATGAAAGCG TGGGAAGCTG CAGGGTTTTC CGCCCGCTCA TTCAAAGTCC CTGACGACCT CCCCTCTATC CGCTTCAGCA GCTGGGTGGG CGGCGACCGT GACGGCCACC CCCTCGTCAC CGCAGATGTC ACCGCCGAGA CGCTCGGCGA CATGCGCCTG AAATCCGTCG GACTCATTGA TGCCGCCCTC AGGAACCTTG GCTCGAAGCT CAGCCTGTCC GATCGTCTGC AGGATCCCGG AGAGGAGCTC ATTGGGCGCA TGGACACCCT CAGAAAAAAA CACGGAAAAC GCGGCATGGA TGCCGTCCGC CGCAATGAGA GGGAGCCATG GCGGCAGTTC ATCAACCTTA TGAGAGTATC TCTCCCGCCT GCCATCCGGG GATCGGGATC GGGATCGGGA TCGGCATTCC GCTACCACTC GGCCAGGGAG CTCCTCGGAG ATCTTGCCAT CCTCACCCGT TCACTTGAAG AAATAGGGGC GGAAAACATC GCCCGGGAGC TCGTGTTCCC TCTCTCACGG ACAGTGCAGT CATTCGGGTT CCACCTCGCA TCGCTCGATA TCCGCCAGAA CAGCCGGATT CACGATCTGG CTCTCACCGC CCTTATAAGA GCGGCAGGAA ACGGGGAGCA GGATGAAACC CCATACATCG ACATGAGCGA AGAGAAAAAG CGGGAGCTGC TCGACCGTGA ACTCCGCTCC CCTCGCCCGT TCACACGCCC CGACATGCAG GCAGGACCGG AAGCCGAATC GGTGCTCGGC TGCTTCCGGG TACTTTCAGG CCATATCCGA TCGTTCGGCA CCGAAGGAAT CGGGTCGCTC ATCGTCAGCA TGACCCGCAG CGTCTCCGAC CTCCTCTCCG TCTATCTCTT TGCGCGTGAA ACCGGGCTCA TGACTTCGAT AAAAGGAAAG AGCGCCTGCA TCGTTCCGGT CGTCCCTCTC TTTGAAACCA TCGGCGACCT GGAAAAAAGT CCCGGGATTC TCGATGCCTT CCTCAGCCAC CCTGTCACAG AAAGCAGCCT CGAGTACCAG CGGCGGCGCG ATGGCGCAAA ACGAAGGGTG ATGGAAGTAA TGATCGGCTA CAGCGACAGC AACAAGGACG GAGGACTCCT GACCAGCGCC TGGGCGCTCT ACAGGGCAGA GGAAGCAATG CTGGAAGTCG GCCGCCGTCA CAACACATGC ATCCGCTTCT TCCACGGCAG GGGTGGAAGC ATCAGCCGCG GAGCCGGACC GACGCACCGG TTCATAAGGG CGCAGCCCTA CGGAGCATGC AACGGCGGAA TGCGCTTCAC CGAACAGGGT GAAACCATCG CACAGAAATA CGCGAACCGC ATCAGTGCGG TCTACAATCT TGAGCTCTTC ATGGCAGGTG TTGCCGGCTC GCTCATCAGG GAGAGCCGGC ACGAAAAAGC GGCGCACCAC CTCGAACCGG TGATGGACCG CCTCTCCAGC GCATCCTACA GCGCCTATCG GACACTCATT GAAACAGAGG GGTTTGTGAC CTTCTTCCGT CAGGCGGCTC CTGTCGACAT CATTGAAGCC AACCGGATCG GCTCACGCCC ATCGAGGCGC ACCGGTGGAG ACAGCCTGGA CGACCTGCGG GCCATTCCCT GGGTGTTCAG CTGGAATCAG GCCCGGTTTT CGCTTTCAGG GTGGTACGGA GTTGGCTCTG CGCTCGAAGC ACTCAATGAA AGCGACCCTG AAGCATTCGA GGACATCCGC ATCCGCTCCC ACCAATGGCC TCCCCTGCGC TACATCATCA GCAATGCCGA CACCGGACTT GCCGCCGCTG ACCTTTCAGT CATGCGCCTG TATGCAGGGC TCGTTACTGA GGCCGGACTG AGAGACAAGA TCATGGGCAT GATTGAAGAT GAGTATCGGA AAACTAAACG CTTCATCGAT ATCCTCTACA GCAAGCCATT GGCTACCCAG CGCCTCAACG TCAACCGATT CATAGAACTC CGGCGCGAAG GACTCGGCCT GCTGCACCGC TGGCAGGTAC AGCGCATCGC CGAGTGGAGG AAGCTGCTGG ATGATGGCCG AAAGGAAGAG GCCGACGCCA TGCTGCCGGA ACTCTTCCTG ACCGTAAACG CCATCTCGGG AGGGCTCCGC ACGACCGGCT GA
|
Protein sequence | MISTTSTIID FEKAERDAGF LISCFQEMLC CLGEEALAAR LPRGGRGIDL GHYPDTERAL QAYSIFFQLL NMAEENSAAQ NRRTLEAEHG AASLEGLWGR TLLPVQQQGL SEEEARIVLS RVQVHPVLTA HPTEAKRKTV LELHRNIYLL LVKRENQMWT PAEQLDIRRE ITSAMETLWR TGEILFQKPS IADERANVAH YLENIFPDAI TMMDRRLMKA WEAAGFSARS FKVPDDLPSI RFSSWVGGDR DGHPLVTADV TAETLGDMRL KSVGLIDAAL RNLGSKLSLS DRLQDPGEEL IGRMDTLRKK HGKRGMDAVR RNEREPWRQF INLMRVSLPP AIRGSGSGSG SAFRYHSARE LLGDLAILTR SLEEIGAENI ARELVFPLSR TVQSFGFHLA SLDIRQNSRI HDLALTALIR AAGNGEQDET PYIDMSEEKK RELLDRELRS PRPFTRPDMQ AGPEAESVLG CFRVLSGHIR SFGTEGIGSL IVSMTRSVSD LLSVYLFARE TGLMTSIKGK SACIVPVVPL FETIGDLEKS PGILDAFLSH PVTESSLEYQ RRRDGAKRRV MEVMIGYSDS NKDGGLLTSA WALYRAEEAM LEVGRRHNTC IRFFHGRGGS ISRGAGPTHR FIRAQPYGAC NGGMRFTEQG ETIAQKYANR ISAVYNLELF MAGVAGSLIR ESRHEKAAHH LEPVMDRLSS ASYSAYRTLI ETEGFVTFFR QAAPVDIIEA NRIGSRPSRR TGGDSLDDLR AIPWVFSWNQ ARFSLSGWYG VGSALEALNE SDPEAFEDIR IRSHQWPPLR YIISNADTGL AAADLSVMRL YAGLVTEAGL RDKIMGMIED EYRKTKRFID ILYSKPLATQ RLNVNRFIEL RREGLGLLHR WQVQRIAEWR KLLDDGRKEE ADAMLPELFL TVNAISGGLR TTG
|
| |