Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0828 |
Symbol | |
ID | 5704132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 925785 |
End bp | 928571 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641270346 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001535737 |
Protein GI | 159036484 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.891362 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.27066 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGACC AGCACGACCA CGACGGCCCG GACGCCGCGC TGCGCGCGGA CATCCGCCGC CTCGGCACCC TGCTCGGCCA GACTCTCGCC CGCCAGGAGG GTCGGCCCCT GCTCGACCTC GTCGAGGACA TCCGCGCCCA GGTCCGCACC GACGCCCCGG CGGCCGCGCA GCGGCTCGCC GGGTTGGACG TCGCCACCGG TACCCGGCTC GCCCGCGCCT TCTCCACGTA CTTCCACCTG GCCAACATCA CCGAACAGGT CCATCGCGCC CGGGATCTGC GCCGCCGCCG GGCCATTCAG GGTGGCTGGC TGGACCAGGC CGCCAAGATG ATCGCCGAGC GCGGTGTCCC CGCCGAGGAG ATCGCGGCGG CGGCGCGGCG GCTGGCCATC CGTCCCGTCT TCACCGCCCA CCCCACCGAG GCCGCCCGCC GCTCGATCCT GAGCAAGCTG CGCGCGATCG CCGACGAGTT GGACACCGAG ACCACCAACG CGGTCCTCTA CGGCGCCAGC GACGAGGGCC CGGCCAACCG CCGCCTCGCC GAGCTGCTCG ACCTGATGTG GCAGACCGAC GAGCTCCGGC TCGACCGCCC GGACCCGACC GACGAGGCAC GAAACGCCAT CTACTACCTG CGCGACCTGT ACGCCGAGGC CGCTCCGCAG GTGCTCGACG ACCTCGCCGA CACGCTGCGT ACGCTCGGGG TGGAGACCTC GCCCACCGCC CGTCCGCTCA CCTTCGGCAC CTGGATCGGT GGCGACCGGG ACGGCAACCC GTTCGTGACG CCCACGGTGA CCCGGGAAGT CCTGGCCATC CAACACGAGC ACGGCCTCGC CGCCACCGAG CGGGCCATGG ACCAGCTGAT CAACGAAGTG TCCGTCTCCC GGCGGCTGCG CGGCGTCTCG CTGGACCTCT CCGCAAGCCT CGCCACCGAC CTGGACGCGC TGCCCGAGGT GGCGCCCCGG TTCCGGCGCG TCAATGCTGA GGAGCCGTAC CGGCTCAAGG CCCGGTGCGT GAAGGCGAAG CTGGCCAACA CCCGGCAGCG GCTGCGGCAG GGCACGGCGC ACGTGCCGGG ACGGGACTAC CGTGGGTCGG GCGAGCTGAT CGCCGACCTG GAGCTGCTAC GCGCCTCGCT GGCCCGCAAC TCCGGTCAGC TCACCGCGGT GGGCCGGCTC GCCTCGACCA TCCGCACGGT CTCCGCCTTC GGCCTGCACC TGGCCACCAT GGACGTACGC GAACATGCCG AGAAGCACCA CGAGGTGCTG ACGCAGCTGT TCGGTGCGGT GGGCGAGGTG TCGGACTACC CGGCACTGAG CCGGCTGGAG CGCACCAAGT TGCTCGCCGA CGAACTGACC GGACGTCGAC CCCTGTCCAC CGTCGACACC CCACTCACCG AGTCGGCCCG ACGGACGTTC GACGTCTTCG GCACCATCCG GGAGGCGCAG GACCGGTTCG GGACCGAGGT GATCGAGTCC TACATCATCT CGATGACCCT CGGCGTGGAC GACGTCCTGG CCGCCGTCGT GTTGGCCCGG GAGGCCGGGC TGGTGGACGT GCACAGCAGC CGGGCCCGGA TCGGCTTCGT GCCGCTGCTG GAGACCCCGG CCGAGTTGAA CGCCGGCGGT GACCTGCTGG ACGAGCTGCT GTCGCTGCCC GCCTACCGGG CCCTGGTCGC GGCCCGCGGC GACGTCCAGG AGGTGATGCT CGGCTACTCC GACTCCAACA AGGAGGCCGG CATCACCACC AGCCAGTGGT CCATCCACCG GGCCCAGCGC GCGCTGCGCG ACGTGGCCGC CCGGCACGGC GTGCACCTGC GGCTCTTCCA CGGCCGGGGC GGCACCGTCG GCCGGGGCGG TGGGCCGACG CACGACGCGA TCCTGGCCCA GCCGTACGGC ACCCTCGACG GCGAGATCAA GGTGACCGAG CAGGGTGAGG TGATCTCCGA CAAGTACACC CTGCCGGCGC TGGCCCGGGA GAACCTGGAG CTGACCGTTG CCGCGGTGCT CCAGTCGACA CTGCTGCACA CCGCGCCCCG GCAGCCGGCC GAGATGCTGG AACGCTGGGA CGCGACAATG GACGTGGTGA GCGCGGCGGC CTACCGCTCG TACCGGGACC TGGTCGAGGA TCCGGACCTG CCGGCGTACT TCTGGGCGTC CACACCGACC GAACTGCTGG GCGCGTTGAA CATCGGTTCC CGGCCGGCGA AGCGGCCGAA CACGGGCGCC GGGCTCGCCG GCCTGCGGGC CATCCCGTGG GTGTTCGGCT GGACCCAGAC CCGACAGATC GTCCCCGGCT GGTTCGGTGT CGGCTCCGGC CTGGCGGCGG CCCGTGCCGC CGGCCACGCG GACGTGCTCG CTGAGATGCA CCGCAGTTGG CACTTCTTCC GGACGTTCCT GTCGAACGTG GAGATGATGC TGACCAAGAC CGACCTGGCG ATCGCTCGCC GGTATGTGGA GACGCTGGTG CCGAAGAAGC TGCACCCGAT CTTCCACAAG ATCGAGCAGG AGTACGAGTT GACCCGGCGG GAGGTGCTCG CCGTGACCGT GACCCCGGAC CTGCTGGAGA ACGCGCCGGT GCTGCAGCGC ACGCTGGCCG TACGGGACAC CTACCTGGAG CCGCTGCACC ACCTCCAGGT GGCGTTGCTG CGGCAGTACC GTGAGTCCGG TGCGCCGGGC CGGGCGGTGG CGACGGCGCC GGGTGGCCGA CGGGCGCCGA GCGACGGCAC CGCTCTGGAA CGCGCCCTGC TCACCACGGT CAACGGCATC GCCGCCGGCA TGCGCAATAC CGGCTGA
|
Protein sequence | MTDQHDHDGP DAALRADIRR LGTLLGQTLA RQEGRPLLDL VEDIRAQVRT DAPAAAQRLA GLDVATGTRL ARAFSTYFHL ANITEQVHRA RDLRRRRAIQ GGWLDQAAKM IAERGVPAEE IAAAARRLAI RPVFTAHPTE AARRSILSKL RAIADELDTE TTNAVLYGAS DEGPANRRLA ELLDLMWQTD ELRLDRPDPT DEARNAIYYL RDLYAEAAPQ VLDDLADTLR TLGVETSPTA RPLTFGTWIG GDRDGNPFVT PTVTREVLAI QHEHGLAATE RAMDQLINEV SVSRRLRGVS LDLSASLATD LDALPEVAPR FRRVNAEEPY RLKARCVKAK LANTRQRLRQ GTAHVPGRDY RGSGELIADL ELLRASLARN SGQLTAVGRL ASTIRTVSAF GLHLATMDVR EHAEKHHEVL TQLFGAVGEV SDYPALSRLE RTKLLADELT GRRPLSTVDT PLTESARRTF DVFGTIREAQ DRFGTEVIES YIISMTLGVD DVLAAVVLAR EAGLVDVHSS RARIGFVPLL ETPAELNAGG DLLDELLSLP AYRALVAARG DVQEVMLGYS DSNKEAGITT SQWSIHRAQR ALRDVAARHG VHLRLFHGRG GTVGRGGGPT HDAILAQPYG TLDGEIKVTE QGEVISDKYT LPALARENLE LTVAAVLQST LLHTAPRQPA EMLERWDATM DVVSAAAYRS YRDLVEDPDL PAYFWASTPT ELLGALNIGS RPAKRPNTGA GLAGLRAIPW VFGWTQTRQI VPGWFGVGSG LAAARAAGHA DVLAEMHRSW HFFRTFLSNV EMMLTKTDLA IARRYVETLV PKKLHPIFHK IEQEYELTRR EVLAVTVTPD LLENAPVLQR TLAVRDTYLE PLHHLQVALL RQYRESGAPG RAVATAPGGR RAPSDGTALE RALLTTVNGI AAGMRNTG
|
| |