Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1819 |
Symbol | |
ID | 7094098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 1980422 |
End bp | 1983193 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643465146 |
Product | Phosphoenolpyruvate carboxylase |
Protein accession | YP_002362126 |
Protein GI | 217977979 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.218615 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAAAG TTCTCGACCC TATTTCGGCG AATAGCGCCT CGACGATCGA GCGGCCGCTG ATCGACGACA TTCGCCTGCT CGGCCAGATC CTCGGAGACA CCATCCGCGA GCAGGAAGGC GAGGACACGT TCGGCCTTGT CGAGCAGGCG CGCCGCCTCA GCGTCGAGTT CGAACGCCAT GCCGACCCGG ACGCCGAGCT CAAGCTGAAC GAGCTTTTGC GCTCGCTGAC GCCGCAGCAG GCGATGGCCG TGATCCGCGC CTTCAGCTAT TTCGCCCATC TCGCCAATAT CGCCGAGGAT CGGCACAATA TCCGCCGGCG CGCGGCGCAG GCGGCCGAGC GCCCCGACGA GCACGACGGC GGCACGCTCG ACAAGACCTT TGAGCTTCTC GCCGCCGCTT CGATCACGCC TGAGGCGGTG GCCGAAGCGA TCGCGCAAGC CCGCGTCTCG CCGGTGCTGA CCGCCCATCC GACGGAGGTG CAGCGCCGCA GCCTGCTCGA CGCCGAGCGG GCGATCGCGC AGTTGCTGCT CGCCCGCGAA ACCCTCAGCG GGGAAAGGCC GCGCAAGCAA AACGAGATGA TGCTGCGCGC CCGCATCGCT CAGCTCTGGC AGACGCGGCT GCTGCGCTAT GCGCGGCTCT CCGTGCGCGA CGAAATCGAA AATTCGCTGG CCTTCTACCA GATGACTTTT CTCAAGCAGA TCCCGCAGCT TTACGCCCGG ATCGAGGAGC GCCTCGGCAG CCTTCCGGTC GCCAGCTTCT TTCGCATGGG GTCGTGGATC GGCGGCGATC GCGACGGCAA TCCGAACGTC AACGCGGGAA CGCTCGAACT GGCCCTCCGC CGCCAATGCG ACACAGCGCT GCGCCACTAT CTGCGCGAGA TCCATGAGCT TGGCGCCGAG CTGTCGATTT CCGGCCGGCT GTTCGGCTGC GCGGAGGCGC TACAGGAGCT CGCCGAGAAA TCCGGCGACG AAAATCCCTT CCGCGACGAC GAGCCCTACC GGCGCGCGCT GATTGGCGTC TATTCGCGCC TGTCGTCGAC CTTGACCCTG CTGACCGGCG GCGAGGCGTT GCGCCACGCC GTCGCGCCGG GAGAGCCCTA CGCCAACGCC GGCGAGTTGC AGGCGGATTT GGGCGTTATC GAGGATTCCC TGCGCGCCAA CCATGGCGCG GCCCTCGTCC CCGGCCGGCT CGGACCGCTG CGCCGCGCGG TCGATGTGTT CGGCTTCCAT CTCGCATCGG TCGATCTGCG CCAAAGCTCG GACAGGCACG AGGAAACGCT GAAGGAGCTG ATGTCGGTCG CCCGCATCGT TCCCGACTAT TCCGCCCTCG ACGAAGACGC CCGGCAAAAT CTTCTGATCG ACCTTCTGCG CGATCCGCGC CCGCTGCGCG TGCCGGGCGT GACCTACAGC CCGCGCACGG AGGAGGAGCT CGAAATTTTC GAGGCCGCCC GCGACGCGCG CCGGACCTTC GGCGAGCCCG CCGTCGTCCA CTACATCATC AGCCACACCG AAACGGTCAG CGATCTGCTC GAAGTGCTCG CCTTGCAGAA AGAGTGCGGC ATGGTTCATG GCGTGCTCGG CGAGCCGGGG ACGTCGGCCG AGCTGATGGT CGTGCCGCTG TTCGAGACAA TCGAGGATCT GCGCAACGCC GCGACAATTA TGCGCGGCTT TTATGCGCTG CCAGGGGTGG CCGAGCTGAC GCGGGCCTCG GGCGCCGAGC AGGAAATCAT GCTCGGCTAT TCCGACAGCA ACAAGGACGG CGGCTTCTTC ACCAGCAATT GGGAGCTCTA CCGCTCCTCG ACCGCGCTCG CCGCGCTGTT CAAGGAGCAC GACGGCATCG CGCTGCGCCT GTTCCACGGC CGCGGCGGCA CAGTCGGGCG CGGCGGCGGT CCAAGCTATC AGGCGATCCT CGCCCAGCCT CCCGGCACCG TCGCGGGCCA GATCAGGCTC ACCGAACAAG GCGAGGTGAT CTCCTCGAAA TACGCCAATC CGGAGATCGG CCTCCTCAAT CTCGAAGCAC TCGTCGCCGC CACCATAGAG GCGACGCTGC TACCGTCCGC AGGCGCCGCG CCCAAGGAAT TCCTCGACGC GGCCGAGGAG CTGTCGCAAG CCAGCATGAA GGCGTTTCGC GCCGTCATTT ACGACAATCC GCGTTTCGTC GATTACTTCT TCACCGCGAC CCCGATCGCT GAAATTGCCG AACTCAACAT CGGCTCGCGC CCAGCCTCGC GGAAATCGAC CCGGCGCATC GAAGATCTGC GCGCGATCCC GTGGAGCTTT TCCTGGGGTC AGGCGCGCAT CAATCTGCCG GGCTGGTTTG GCTTCGGCTC GGCGGTCGAG CAATTTGTCG CCAAGGCGCC CGCCGAGCGC ATGGCGCTGC TGCGGCGCAT GGGCGCCGAA TGGCCATTCT TCCGCGCGCT CTTGTCGAAC ATGGACATGG TGCTCGCCAA GGCCGACATG GGATTGGCCG GGCGCTATGC CGAGCTCGTG CCCGATCAGG AGCTCGCCAA ATCCGTGTTC GGCTCGATCG AGGCGGAATG GGCCCGCACG ACCAAGGCGC TGAACGAGAT CACCGGCGTC TCGACGCGGC TCTCCGACAA TCCAGCGCTG GCCGGCGCCA TAAAACACCG CTTCCCTTAT ATTTCGCCGC TGAACCATCT GCAGGTCGAA CTGCTGCGCC GCTGGAGATC CGGCGAGCAC GACGACAAGA CGCTGCGCTC GATCCTGATC ACCATCAATG GCGTCGCCGC AGGCTTGCGC AATACCGGCT GA
|
Protein sequence | MTKVLDPISA NSASTIERPL IDDIRLLGQI LGDTIREQEG EDTFGLVEQA RRLSVEFERH ADPDAELKLN ELLRSLTPQQ AMAVIRAFSY FAHLANIAED RHNIRRRAAQ AAERPDEHDG GTLDKTFELL AAASITPEAV AEAIAQARVS PVLTAHPTEV QRRSLLDAER AIAQLLLARE TLSGERPRKQ NEMMLRARIA QLWQTRLLRY ARLSVRDEIE NSLAFYQMTF LKQIPQLYAR IEERLGSLPV ASFFRMGSWI GGDRDGNPNV NAGTLELALR RQCDTALRHY LREIHELGAE LSISGRLFGC AEALQELAEK SGDENPFRDD EPYRRALIGV YSRLSSTLTL LTGGEALRHA VAPGEPYANA GELQADLGVI EDSLRANHGA ALVPGRLGPL RRAVDVFGFH LASVDLRQSS DRHEETLKEL MSVARIVPDY SALDEDARQN LLIDLLRDPR PLRVPGVTYS PRTEEELEIF EAARDARRTF GEPAVVHYII SHTETVSDLL EVLALQKECG MVHGVLGEPG TSAELMVVPL FETIEDLRNA ATIMRGFYAL PGVAELTRAS GAEQEIMLGY SDSNKDGGFF TSNWELYRSS TALAALFKEH DGIALRLFHG RGGTVGRGGG PSYQAILAQP PGTVAGQIRL TEQGEVISSK YANPEIGLLN LEALVAATIE ATLLPSAGAA PKEFLDAAEE LSQASMKAFR AVIYDNPRFV DYFFTATPIA EIAELNIGSR PASRKSTRRI EDLRAIPWSF SWGQARINLP GWFGFGSAVE QFVAKAPAER MALLRRMGAE WPFFRALLSN MDMVLAKADM GLAGRYAELV PDQELAKSVF GSIEAEWART TKALNEITGV STRLSDNPAL AGAIKHRFPY ISPLNHLQVE LLRRWRSGEH DDKTLRSILI TINGVAAGLR NTG
|
| |