Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2967 |
Symbol | |
ID | 5540458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3847935 |
End bp | 3850667 |
Gene Length | 2733 bp |
Protein Length | 910 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640895086 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001433044 |
Protein GI | 156742915 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000103268 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCATCG GTTTGGGCGA CATACCGGCG CAGGCACGCA TCGAGCGTGA TCTGCGCTGG CTGCTTGATC GGTTCTACGA GGCGCTTGAT GCCGCCGGCG CCGCCGATAT TCATGCGCTG CTGCCCTGGA CGGGGAATCC GTCGTGTCCG GTGCGGTTGC CTGAGCGTGC GCCACAGGCA TATGCCATTG CGTTTCAGTT GCTCAACTTG GTTGAAGAGA ATGCTGAAGC GCAAACGCGC CGCTCTGCGG AGACCATTGA TGGACCGGCT GCACTGCGTG GTCTCTGGGG ACAGATCCTG GTTCAGTTGC GCGATGCAGG GTTGAGTGCA GCGCAGATTG CCGCTGCGTT GCCGCGCATA CAGGTCGAAC TCGTGTTGAC GGCGCATCCC ACCGAAGCCA AACGCGCGAC AGTGCTGGCG CACTACCGTG AGCTTTACCT GCAACTGGTC AAAGCCGAAA ACCAGATGTG GACGCCGCTG GAGCGGCAGG GCATTGCTGA GGATGTGCGC GCGATTCTGG AGCGGCTCTG GCGAACCGGC GACATCTTTC TCAAGCGACC CGATGTTTCC TCCGAGCTGC GCAACGTCAT TCACTACCTT CGCAATGTGT TCCCCGAAGC GCTGGCATAC CACGATCAAC GGTTACGTCA GATATGGGAG GCGATTGGCA ATGATCCGGC GTTGCTGGAT GATCCCGACT CGCTGCCGTT GATTACCTTT GGGACATGGG TCGGCGGCGA CCGGGATGGG CATCCGTTGG TGACCGCCGA TGTCACCCGC ATGGCGTTGA TGGAATTGCG CTCGCATGCG CTCGAACTGC TGCGTGATCA ATTAACAACG CTGGTGCGGC GTCTGAGTTT GTCTAATCTG CTGCAAACCC CGCCTGATTT CCTGACAAGC GCGTTGCACC ATTATGCTGA TCTGTTGGGA GATGCAGGGC AGCAGGCGCT GGATCGCAAT CCGCACGAAC CGTGGCGCCA GATGGTCAAC CTGATGCTGG CGCGGTTGCC GGAAACCGCT GGCGTGCCCG CGCCTGGACA TTACACGCGC GCTGCCGAAC TCATTGCCGA CCTGCGCCTG CTTGACGAGT CGCTTGTGGC AGTGGGTGCG ACGCGGTTGG CCCGATCCGA CGTGCGCCCG CTCATTCGCA GCGTCCGTGT CTTTGGCTTT CACCTCGCGG TGCTCGATAT TCGACAGAAC AGCGCATTTC ACGATCGGGC GCTGGCGCAA CTTCTCACGG CGGCCGGCAT CGACGGGAGC GATTACCCCT CCTGGGACGA AGCCCGCCGC ATGGCGCTGA TCGAAGCGGA ACTGGAGTCG CCGCGTCCCT TTACTCGAAT GGGCATGCCC GTCGGACCAG AAGCCGAGGC TGTACTGAGT TGTTATCGGG TGCTGGTCGA ACACATCCAC TCCTATGGCG CCGATGGATT GGGTGCGCTG ATCGTCAGTA TGACACGCAA CCTGTCCGAT CTCCTGGTTG TCTTCCTTTT CGCCCGTGAA ACGGGATTGC TCGTCAGCAC GCCGGAAGGT CCGGTCTGTC CTTTGCCGAT TGTGCCACTC TTCGAGACGA TCGACGATCT GGAACGGAGT CCGCTGGTTC TGCGCGACTA CCTGGCGCAT CCGATCGTAC AGCGCAGTCT GGAGTGGCAA CGGCAGACGC GAGGAGCGCC CGAACGGGTG CAGCAGGTGA TGATTGGGTA TAGCGACAGC AACAAGGATG GCGGCATCGG CGCAAGCCTG TGGGCATTGC AGAAGGCGCA GCAGGCGCTT GCCGCCGAAG GACGGATGGC AGGAGTGCGC ATTCGCTTCT TCCACGGTCG CGGGGGAACA ATGAGTCGCG GCGCTGGACC GACCGGACGC TTCATCAGGG CATTGCCGGT TGAAGCGCTG GCAGGCGACC TCCGCATGAC TGAACAGGGA GAGACAATTT CTCAGAAATA CGCCAATCGC ATCAGTGCTG TCTATAACTT CGAACTCTTG CTGGCGGGCG TCACGGGGTC CACGCTCCGA CCGTACACGC CCCGACCACA GCATCTAACG ATGGCGCTCG ACCTGCTTGC CCGGCACAGT CAGCGAGCAT ATCGCGCGCT GGTGGAGCAT GAGCGTTTTC TTCCGTTCTT TCGGGAAGCG ACGCCAATTG ACGCTATCGA GGCGAGTCGG ATCGGCTCGC GTCCGGCGCG GCGCACCGGC GCGCAGTCTC TCACCGACCT GCGCGCCATT CCGTGGGTGT TCAGTTGGAA CCAGGCGCGC TTTTACCTGT CGGGCTGGTA TGGCATCGGC AGCGGGCTGG CAATGGTGCA GGCGGAGGAA CCGGCGCTGT TCGATACCCT CTGTGCCGAA ATGCGCGAAT GGGCGCCACT GCACTATCTC CTGGGAAATG CGGCAACGAG CGTGATGAAT GCGGATGAGC ACATCATGCG CGCCTATGCG GAGTTGGTGC ATGATCCCGT CACCCGAACC GTCATTCTCG ATACCATTCT GGAAGAATTT GCGCGCACCC GCGCGATGCT CGAAAGGGTC TATGGCGGAT CTCTCGAGGA AAAACGTCCC TATATCGCCC GCCAGCTCGA ATTGCGGCGA GCGGGATTGT ATTCCCTCCA TCGCGAACAG ATCACGAGTC TACGCTCCTG GCGAGCGGTG CGCAACCTCG ATCCAGAAGC AGGCGAAGCG CAGCTGCGCC TGCTTCTGCT GCTGATCAAT GCGATTGCGG CTGGTTTGCG CGCCACAGGG TAA
|
Protein sequence | MSIGLGDIPA QARIERDLRW LLDRFYEALD AAGAADIHAL LPWTGNPSCP VRLPERAPQA YAIAFQLLNL VEENAEAQTR RSAETIDGPA ALRGLWGQIL VQLRDAGLSA AQIAAALPRI QVELVLTAHP TEAKRATVLA HYRELYLQLV KAENQMWTPL ERQGIAEDVR AILERLWRTG DIFLKRPDVS SELRNVIHYL RNVFPEALAY HDQRLRQIWE AIGNDPALLD DPDSLPLITF GTWVGGDRDG HPLVTADVTR MALMELRSHA LELLRDQLTT LVRRLSLSNL LQTPPDFLTS ALHHYADLLG DAGQQALDRN PHEPWRQMVN LMLARLPETA GVPAPGHYTR AAELIADLRL LDESLVAVGA TRLARSDVRP LIRSVRVFGF HLAVLDIRQN SAFHDRALAQ LLTAAGIDGS DYPSWDEARR MALIEAELES PRPFTRMGMP VGPEAEAVLS CYRVLVEHIH SYGADGLGAL IVSMTRNLSD LLVVFLFARE TGLLVSTPEG PVCPLPIVPL FETIDDLERS PLVLRDYLAH PIVQRSLEWQ RQTRGAPERV QQVMIGYSDS NKDGGIGASL WALQKAQQAL AAEGRMAGVR IRFFHGRGGT MSRGAGPTGR FIRALPVEAL AGDLRMTEQG ETISQKYANR ISAVYNFELL LAGVTGSTLR PYTPRPQHLT MALDLLARHS QRAYRALVEH ERFLPFFREA TPIDAIEASR IGSRPARRTG AQSLTDLRAI PWVFSWNQAR FYLSGWYGIG SGLAMVQAEE PALFDTLCAE MREWAPLHYL LGNAATSVMN ADEHIMRAYA ELVHDPVTRT VILDTILEEF ARTRAMLERV YGGSLEEKRP YIARQLELRR AGLYSLHREQ ITSLRSWRAV RNLDPEAGEA QLRLLLLLIN AIAAGLRATG
|
| |