Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4447 |
Symbol | |
ID | 8450074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4932119 |
End bp | 4934956 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645043494 |
Product | Phosphoenolpyruvate carboxylase |
Protein accession | YP_003203722 |
Protein GI | 258654566 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACAGA TCTCGGACAC GCTCGGCCAG TCTGCTGCAC TCGCCGACGT CCCCCCGACC TCGCCCGGTG TCGATCGCGA TGCGGCCCTG CGCGCGGACG TGCGGCGGGT CGGCTCCCTG CTCGGCCAGA CCCTGGTCCG CCAGCAGGGA CCCGAACTGC TCGACCTGGT GGAGCAGGTG CGCGGGCTGA CCAAACAGTC CCGCGAGGCG GCGGCCGCCA CCGAGCGGCA GTCCGCCACC CTCAAGGTGC GAGAGATGCT GGCCGCCCTC CCGATTGCCG TGGCCACCGA ACTGGTGCGC GCGTTCGCCA CCTACTTCCA GCTGGCCAAC GGCGCCGAGC AGGTCCATCA GGTGCGCGAG CTGCGGGCGC GGGAGACCGG CGACGATCAT CTGGCCGCCG TCGTCCAGGA GATCGTGACG ACCCTGGGCA CCGACGCGCT GCGCGAGGCG GTCGAGGCAC TCGAGGTGCA GCCCGTCTTC ACCGCCCACC CGACGGAGGC CAGCCGCCGC TCGGTGTTGC TCAAGCTGCA GGCGTTGGCG GACATCCTCG TTGTGCCCAC CCCGGCCGGG TCGGCCGCGC GGACCCGTCA GGACCGCAAG CTGGCCGAGG TCATCGATCT GCTGTGGCAG ACCGACGAGA TCCGCCAGTT CCGGCCGACG CCGGTGGACG AGGCGCGCAA CGCGCTGTTC TACCTGGAGG CCATCGTCGG GGACACCATC CCGGCCCTGA CCGACGATCT GGCCGCCGCG ATGGCCCAGC ACGGGGTCGA ACTGTCGCCC GAGGCCACTC CCCTGCTGCT GGGCAGCTGG ATCGGGGGCG ACCGGGACGG CAACCCGAAT GTGACCGCCG CCGTCACTCG TGAGGTGCTG GCCCTGCAGC ATCAGTCGGC GATCGCCATC GCGATTGGCA AGATCGACGA GTTGTTGCTG GCGCTGTCCA GCTCGACCGT CATCGCCGGG GCCTCCGACG AGCTCGTCGC CTCCATCCGG ACCGATCTGG ACCACCTGCC GCTGCTGGAC CCTCGCGTCA AAGAACTCAA CAAGGACGAG CCGATCCGGC TCAAGCTGAC CTGTGTCAAG GCCAAGCTGA TCAACACCCG GGCCCGGGTG AACGCCGGCC GGGCGCACGA GCCGGGCCGC GACTACGCGA GCAAGGGCGA GTTGCTGGCC GACCTGGCCA TCGTTCATCG GTCACTGGAG CAGCACGGCG GCGCCCTGGC CGCCGCCGGC ATCCTGGCCA CCGCCCAGCG CGCCCTCGCC GTCAGCGGGC TGAACGTGGC CTACATGGAC ATCCGCGAGC ACTCCGAGGC GCACCACCAG GTGATCGCGC AGCTGGTCGA CCGGTTGGGT GAGCTGGACC GGCCGTACGA CTCGATGAGC CGGCCCCAGC GCATGCGGTG GCTGTCCAAA GAGTTGACCT CGCATCGACC GCTGACCACC CTGCCGGCCC CGCTGGATGC GGCCGGGACC AAGACGTTCT CGGTCTTCTC CGAGATCAAG GACGCCCAGG CGACCTACGG GCCGGAGGTG ATCGTCACCT ACATCACCTC GATGACCATG GGGGCCGACG ACATCCTGGC CGCCGCGGTG CTGGCCCGGG AGGCCGGCCT GATCGACGTC TACGGCACCC CCGGCGATCC GACCCGGGGC CCGTTCGCCG CCATCGGCTT CGCCCCGCTG CTGGAGACCG TCGAGGAACT GCGCAAGTCC GCCGAGGTGG TCGACGAGCT GCTCTCGGAC CCCACCTACC GGGAGATCGT CCGGCTGCGC GGCGACGTGC AGGAGATCAT GCTCGGCTAC TCGGACTCGA ACAAGCAGTC CGGGATCACC ACCAGCCAGT GGGAGATCCA CAAGACCGAA CGGCTGCTGC GCGACCTGGC CGCCCGGCAC GGGGTGTCGT TGACGCTGTT CCACGGCCGT GGCGGCACCG TCGGCCGCGG CGGGGGCCCG ACCTACGATT CGATCCTGGC CCAGCCGTAC GGAGTGATTA CCGGAGCCAT CAAGTTCACC GAGCAGGGCG AGGTGATCAG CGGCAAGTAC GGGATGCCGG ACCTGGCCAA GGAGAACCTG GCGCTGACCG TGGCCGCCAC CCTGCGGGCC ACCACCCTGC ACACCGAGTC CCGGCAGACC GCGCAGGAGC TGCGGGACTG GGACCAGTGG ATGGAGCAGG TCAGCGACGC GGCTTTCGCC GCCTACGTCG GGCTGATCGA CGACCCCGAC CTGCCGGCCT ACTTCCTGGC CTCCACCCCG ACCGAGCAAC TGGGTCAGCT CAACATCGGC TCGCGGCCGG CCCGCCGGCC CGACTCCGGC GGCGGCATCG GCGGTCTGCG GGCCATCCCC TGGGTGTTCG GCTGGACCCA GTCCCGGCAG ATCGTCCCCG GGTGGTTCGG CGTCGGCTCC GGCCTGCGTG CCGCCCGCGA AGCCGGCGGT GAGCAGATGC TGCGCACCAT GCACCGCAAG TGGCACTTCT TCCGCACCTT CATCTCCAAC GTCGAGATGA CCCTGGCCAA GACCGACATG GGGATCGCGG CCATGTACGT GGAGTCGCTG GTGCCCGCAC CGTTGCGACG CCTGTTCGAG GTGATCAAGG CCGAGCACGA TCTCACCGTC GCCGAGGTCC TGCGGGTCAC CGGCGAGGCC GAGCTCCTGG ACGATCAACC CGCCCTCAAG CGGACCCTGG GCGTACGCGA GCCCTATCTC GCCCCGATCT CCTACCTTCA GGTCGACCTG CTCAACCGCA TCCGATCGCA GGCCGACGAG CAGGTCGACC CCCAGTTGCG GCGGGCCATG CTGCTCACCA TCAACGGCGT GGCCGCCGGG ATGCGCAACA CCGGCTGA
|
Protein sequence | MTQISDTLGQ SAALADVPPT SPGVDRDAAL RADVRRVGSL LGQTLVRQQG PELLDLVEQV RGLTKQSREA AAATERQSAT LKVREMLAAL PIAVATELVR AFATYFQLAN GAEQVHQVRE LRARETGDDH LAAVVQEIVT TLGTDALREA VEALEVQPVF TAHPTEASRR SVLLKLQALA DILVVPTPAG SAARTRQDRK LAEVIDLLWQ TDEIRQFRPT PVDEARNALF YLEAIVGDTI PALTDDLAAA MAQHGVELSP EATPLLLGSW IGGDRDGNPN VTAAVTREVL ALQHQSAIAI AIGKIDELLL ALSSSTVIAG ASDELVASIR TDLDHLPLLD PRVKELNKDE PIRLKLTCVK AKLINTRARV NAGRAHEPGR DYASKGELLA DLAIVHRSLE QHGGALAAAG ILATAQRALA VSGLNVAYMD IREHSEAHHQ VIAQLVDRLG ELDRPYDSMS RPQRMRWLSK ELTSHRPLTT LPAPLDAAGT KTFSVFSEIK DAQATYGPEV IVTYITSMTM GADDILAAAV LAREAGLIDV YGTPGDPTRG PFAAIGFAPL LETVEELRKS AEVVDELLSD PTYREIVRLR GDVQEIMLGY SDSNKQSGIT TSQWEIHKTE RLLRDLAARH GVSLTLFHGR GGTVGRGGGP TYDSILAQPY GVITGAIKFT EQGEVISGKY GMPDLAKENL ALTVAATLRA TTLHTESRQT AQELRDWDQW MEQVSDAAFA AYVGLIDDPD LPAYFLASTP TEQLGQLNIG SRPARRPDSG GGIGGLRAIP WVFGWTQSRQ IVPGWFGVGS GLRAAREAGG EQMLRTMHRK WHFFRTFISN VEMTLAKTDM GIAAMYVESL VPAPLRRLFE VIKAEHDLTV AEVLRVTGEA ELLDDQPALK RTLGVREPYL APISYLQVDL LNRIRSQADE QVDPQLRRAM LLTINGVAAG MRNTG
|
| |