Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0875 |
Symbol | |
ID | 7315898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 943483 |
End bp | 946326 |
Gene Length | 2844 bp |
Protein Length | 947 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643615754 |
Product | Phosphoenolpyruvate carboxylase |
Protein accession | YP_002512953 |
Protein GI | 220934054 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.127499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAG ACCAGTTGAT GCCCGTCAAT CTGCCGCCCC GGGACAAGGA ACTCCGCGCC CGGGTCAAAC TGTTCGGCAA CCTGCTCGGC AAGGTGCTCA AGCGCCTGGA GGGGCGACGC GCCCTGGCCG CCGTGGAGAC CCTGCGCAAG GGCTTCATCG CCCTGCGCAA GCAGGACGAT CCCGCCAAGC GCGCCAAGCT TATGGCCTTC ATCGACAAGC TGGACGCGGA CAGCCTGGAG CTGATCATCC GTGCCTTCAG CACCTACTTC AGCCTGGCCA ACATCGCCGA GGAGGATTTC CTCTACCGGG AGCGCCGCCG TCAGGTGAGT GAGGGCGGTC CCCTGTGGGT GGGTTCCTTC GACCACACCC TGCGTGAGCT GCATGAGCAG GGCATCCCGG CGGACAGCCT GCAGCGGCTG CTGGATCAGT TGGCCTACGT CCCCGTGTTC ACCGCCCATC CCACCGAGGC CCGCCGGCGC ACGGTGATGG AGGCCCAGCG ACGCATCTTC CTGGCCGCCG ACCGGCTCAA CGACCGACGC CTGGGCCGCG AGGAACGCGA GGAGCTGACC CAGGAGCTGG AGACCCGCAT CCAGGTGCTC TGGCGCACCA ACGAGGTGCG CGTGAAAAAG CCCCAGGTGC AGGACGAGAT CAAGTACGGC CTGTTCTATT TCGAGGAGAG CCTGTTCAAC GCGGTGCCCA TCACTTATCG CTTCCTGGAG AAGGCCATCC GCCGCACCTA CGGCAACGAC GAGGCCGGCA ATCCCCGGGT GCGGGTGCCG AGCTTCCTGC GCTTCGGCTC CTGGATCGGC GGTGACCGGG ACGGCAACCC GAACGTCACC CCCGCGGTCA CCGAGCTGGC CTGTCGCCTG GCCATGGAAC AGGTGCTGTC CGAATACCTG CGCCGGGTCA GCGCCCTGCG CCACGAACTG ACCCATTCCC TGTACATGTG CCAGCCCTCC GAGGCCTTCC TGGAGAGCCT GGAGCGGGAC AGCAACATCG CCACCGCCGT GTTCCGGGGC AGTGCCGACC GCTTCGAGAC CGAACCCTAC CGGCGCAAGC TCTACATCAT GCGCTACCGG CTCACGGAGA CCCTGAACAC CGTGCGTCGC CGGCTCAACG GCGAGAATGC CGTGCTGCCG GCGAACTCCG CCTATGCCTC GGCGGCGGAA ATGCTCAACG ATCTCCGGCT GATGCACGAT TCCCTGGTGA GCCACGGCGA CGCCAACGTG GCCGCCGGCA AGCTCACCGA CCTGATCCGC CTGGCGGAGA CCTTCGGCTT TCACCTGTTC CACCTGGATA TCCGCCAGGA GTCCACGGTG CACGGCCAGA CCGTGGCCGA GATCCTCAAG GCCACCGGCC TCAACGAGGA CTACGACGCC CTCTCCGAGC CCGAGCGCCT GGCCCTGCTG GCACGGCTGG CCGAGGCGGA GGAACTGCCC AGGCTTGATG GGGAGGCGCT GTCCGAGACC GCCCGGGAGA CCCTGGAGGT GTTCCACGTG ATCGGCCGCA TGCGCGCCGA GGTGGGCCCC GAGGGCATCG GCACCTACGT GATCTCCATG ACCCACGCCG CTTCCCACGT GATGGAGGTG ATGTTCCTGG CGCGCCTGGC CGGCCTGGCG GGGCGCAACG AGGCCGGCGA GTGCTTCTGC CAGATCCGCG TCTCGCCCCT GTTCGAGACC ATCGAGGACC TGCGCCACAT CGAGGAGGTG CTGGAGGACC TGCTCACCCA GCCGGTGTAC GCGCGCATGC TCAAGGCCTC CGGCAACCTG CAGGAGGTGA TGCTGGGCTA CTCCGATTCC TGCAAGGACG GCGGCATCCT GGCCTCCAGC TGGAATCTCT ACGAGGCCCA GCAGAAGATC CTGCGCATCA CCGGCGCCCA CCACGTGGCC TGCCGTCTGT TCCACGGCCG GGGCGGTACC ATCGGCCGGG GCGGCGGTCC CACCCACGAA TCCATCCTGG CCCAGCCGCC GGGCACGGTG CACGGCCAGA TCAAGTTCAC CGAGCAGGGC GAGGTACTCT CCTACAAGTA TTCCAACGTG GAGACGGCGG TCTACGAGCT GAGCATGGGC GCCACCGGCC TGATCAAGGC CAGCCGCTGC CTGATCGACA ACCCGCCCAT GGACCGGCGC GATTATCTGG GCATCATGGA CGAGCTGGCG GCGCTGGGCG AGGAGGCCTA CCGGGACCTC ACCGACCGCA CCGAGGGCGT GCTGGACTAC TTCTACGAAA TCACCCCGGT GCAGGAGATC GGGCAGCTGA ACATCGGCTC GCGCCCCTCC CACCGGCGCA AGGCAGACCG TTCCAAGAGC TCCATCCGGG CCATCCCCTG GGTGTTCGGC TGGGCCCAGT CCCGCCACAC CCTGCCGGCC TGGTACGGCA TCGGCACGGC CCTGGAGCGT TGGCGTCAGA ACGACCCGAG CCGCCTGGCC AAGCTACAGA CTATGTACAA CGAGTGGCCG TTCTTCCGCT CGCTGCTGTC CAACTGCCAG ATGGCGCTGA CCAAGGCGGA CATGCGCACC GCCGAGGAAT ACGCCCGGCT GTGCCACGAC CCGGAACTGG CGAAACGCGT GTTCGGGCGC ATTCACGAGG AATTCGAGCG CACCGTCACC CAGGTGCTCA ACGTGGCCGA CACCCAGACC CTGCTGGACG AGAATCCGAC CCTGGCCCTG TCCCTGATGC GCCGCAACCC CTACATGGAT CCCCTGAACC ACATCCAGAT CACCCTGCTG CGCCGGCACC GTGAACTCCA CGAGCGCCAG CCGGAGGCGG AACAGGATCC GTGGATCAGT CCGCTGCTGC GCTCCATCAA CGCCATCGCG GCGGGGATGC GCAACACGGG GTGA
|
Protein sequence | MKEDQLMPVN LPPRDKELRA RVKLFGNLLG KVLKRLEGRR ALAAVETLRK GFIALRKQDD PAKRAKLMAF IDKLDADSLE LIIRAFSTYF SLANIAEEDF LYRERRRQVS EGGPLWVGSF DHTLRELHEQ GIPADSLQRL LDQLAYVPVF TAHPTEARRR TVMEAQRRIF LAADRLNDRR LGREEREELT QELETRIQVL WRTNEVRVKK PQVQDEIKYG LFYFEESLFN AVPITYRFLE KAIRRTYGND EAGNPRVRVP SFLRFGSWIG GDRDGNPNVT PAVTELACRL AMEQVLSEYL RRVSALRHEL THSLYMCQPS EAFLESLERD SNIATAVFRG SADRFETEPY RRKLYIMRYR LTETLNTVRR RLNGENAVLP ANSAYASAAE MLNDLRLMHD SLVSHGDANV AAGKLTDLIR LAETFGFHLF HLDIRQESTV HGQTVAEILK ATGLNEDYDA LSEPERLALL ARLAEAEELP RLDGEALSET ARETLEVFHV IGRMRAEVGP EGIGTYVISM THAASHVMEV MFLARLAGLA GRNEAGECFC QIRVSPLFET IEDLRHIEEV LEDLLTQPVY ARMLKASGNL QEVMLGYSDS CKDGGILASS WNLYEAQQKI LRITGAHHVA CRLFHGRGGT IGRGGGPTHE SILAQPPGTV HGQIKFTEQG EVLSYKYSNV ETAVYELSMG ATGLIKASRC LIDNPPMDRR DYLGIMDELA ALGEEAYRDL TDRTEGVLDY FYEITPVQEI GQLNIGSRPS HRRKADRSKS SIRAIPWVFG WAQSRHTLPA WYGIGTALER WRQNDPSRLA KLQTMYNEWP FFRSLLSNCQ MALTKADMRT AEEYARLCHD PELAKRVFGR IHEEFERTVT QVLNVADTQT LLDENPTLAL SLMRRNPYMD PLNHIQITLL RRHRELHERQ PEAEQDPWIS PLLRSINAIA AGMRNTG
|
| |