Gene Namu_4447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4447 
Symbol 
ID8450074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4932119 
End bp4934956 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content70% 
IMG OID645043494 
ProductPhosphoenolpyruvate carboxylase 
Protein accessionYP_003203722 
Protein GI258654566 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAGA TCTCGGACAC GCTCGGCCAG TCTGCTGCAC TCGCCGACGT CCCCCCGACC 
TCGCCCGGTG TCGATCGCGA TGCGGCCCTG CGCGCGGACG TGCGGCGGGT CGGCTCCCTG
CTCGGCCAGA CCCTGGTCCG CCAGCAGGGA CCCGAACTGC TCGACCTGGT GGAGCAGGTG
CGCGGGCTGA CCAAACAGTC CCGCGAGGCG GCGGCCGCCA CCGAGCGGCA GTCCGCCACC
CTCAAGGTGC GAGAGATGCT GGCCGCCCTC CCGATTGCCG TGGCCACCGA ACTGGTGCGC
GCGTTCGCCA CCTACTTCCA GCTGGCCAAC GGCGCCGAGC AGGTCCATCA GGTGCGCGAG
CTGCGGGCGC GGGAGACCGG CGACGATCAT CTGGCCGCCG TCGTCCAGGA GATCGTGACG
ACCCTGGGCA CCGACGCGCT GCGCGAGGCG GTCGAGGCAC TCGAGGTGCA GCCCGTCTTC
ACCGCCCACC CGACGGAGGC CAGCCGCCGC TCGGTGTTGC TCAAGCTGCA GGCGTTGGCG
GACATCCTCG TTGTGCCCAC CCCGGCCGGG TCGGCCGCGC GGACCCGTCA GGACCGCAAG
CTGGCCGAGG TCATCGATCT GCTGTGGCAG ACCGACGAGA TCCGCCAGTT CCGGCCGACG
CCGGTGGACG AGGCGCGCAA CGCGCTGTTC TACCTGGAGG CCATCGTCGG GGACACCATC
CCGGCCCTGA CCGACGATCT GGCCGCCGCG ATGGCCCAGC ACGGGGTCGA ACTGTCGCCC
GAGGCCACTC CCCTGCTGCT GGGCAGCTGG ATCGGGGGCG ACCGGGACGG CAACCCGAAT
GTGACCGCCG CCGTCACTCG TGAGGTGCTG GCCCTGCAGC ATCAGTCGGC GATCGCCATC
GCGATTGGCA AGATCGACGA GTTGTTGCTG GCGCTGTCCA GCTCGACCGT CATCGCCGGG
GCCTCCGACG AGCTCGTCGC CTCCATCCGG ACCGATCTGG ACCACCTGCC GCTGCTGGAC
CCTCGCGTCA AAGAACTCAA CAAGGACGAG CCGATCCGGC TCAAGCTGAC CTGTGTCAAG
GCCAAGCTGA TCAACACCCG GGCCCGGGTG AACGCCGGCC GGGCGCACGA GCCGGGCCGC
GACTACGCGA GCAAGGGCGA GTTGCTGGCC GACCTGGCCA TCGTTCATCG GTCACTGGAG
CAGCACGGCG GCGCCCTGGC CGCCGCCGGC ATCCTGGCCA CCGCCCAGCG CGCCCTCGCC
GTCAGCGGGC TGAACGTGGC CTACATGGAC ATCCGCGAGC ACTCCGAGGC GCACCACCAG
GTGATCGCGC AGCTGGTCGA CCGGTTGGGT GAGCTGGACC GGCCGTACGA CTCGATGAGC
CGGCCCCAGC GCATGCGGTG GCTGTCCAAA GAGTTGACCT CGCATCGACC GCTGACCACC
CTGCCGGCCC CGCTGGATGC GGCCGGGACC AAGACGTTCT CGGTCTTCTC CGAGATCAAG
GACGCCCAGG CGACCTACGG GCCGGAGGTG ATCGTCACCT ACATCACCTC GATGACCATG
GGGGCCGACG ACATCCTGGC CGCCGCGGTG CTGGCCCGGG AGGCCGGCCT GATCGACGTC
TACGGCACCC CCGGCGATCC GACCCGGGGC CCGTTCGCCG CCATCGGCTT CGCCCCGCTG
CTGGAGACCG TCGAGGAACT GCGCAAGTCC GCCGAGGTGG TCGACGAGCT GCTCTCGGAC
CCCACCTACC GGGAGATCGT CCGGCTGCGC GGCGACGTGC AGGAGATCAT GCTCGGCTAC
TCGGACTCGA ACAAGCAGTC CGGGATCACC ACCAGCCAGT GGGAGATCCA CAAGACCGAA
CGGCTGCTGC GCGACCTGGC CGCCCGGCAC GGGGTGTCGT TGACGCTGTT CCACGGCCGT
GGCGGCACCG TCGGCCGCGG CGGGGGCCCG ACCTACGATT CGATCCTGGC CCAGCCGTAC
GGAGTGATTA CCGGAGCCAT CAAGTTCACC GAGCAGGGCG AGGTGATCAG CGGCAAGTAC
GGGATGCCGG ACCTGGCCAA GGAGAACCTG GCGCTGACCG TGGCCGCCAC CCTGCGGGCC
ACCACCCTGC ACACCGAGTC CCGGCAGACC GCGCAGGAGC TGCGGGACTG GGACCAGTGG
ATGGAGCAGG TCAGCGACGC GGCTTTCGCC GCCTACGTCG GGCTGATCGA CGACCCCGAC
CTGCCGGCCT ACTTCCTGGC CTCCACCCCG ACCGAGCAAC TGGGTCAGCT CAACATCGGC
TCGCGGCCGG CCCGCCGGCC CGACTCCGGC GGCGGCATCG GCGGTCTGCG GGCCATCCCC
TGGGTGTTCG GCTGGACCCA GTCCCGGCAG ATCGTCCCCG GGTGGTTCGG CGTCGGCTCC
GGCCTGCGTG CCGCCCGCGA AGCCGGCGGT GAGCAGATGC TGCGCACCAT GCACCGCAAG
TGGCACTTCT TCCGCACCTT CATCTCCAAC GTCGAGATGA CCCTGGCCAA GACCGACATG
GGGATCGCGG CCATGTACGT GGAGTCGCTG GTGCCCGCAC CGTTGCGACG CCTGTTCGAG
GTGATCAAGG CCGAGCACGA TCTCACCGTC GCCGAGGTCC TGCGGGTCAC CGGCGAGGCC
GAGCTCCTGG ACGATCAACC CGCCCTCAAG CGGACCCTGG GCGTACGCGA GCCCTATCTC
GCCCCGATCT CCTACCTTCA GGTCGACCTG CTCAACCGCA TCCGATCGCA GGCCGACGAG
CAGGTCGACC CCCAGTTGCG GCGGGCCATG CTGCTCACCA TCAACGGCGT GGCCGCCGGG
ATGCGCAACA CCGGCTGA
 
Protein sequence
MTQISDTLGQ SAALADVPPT SPGVDRDAAL RADVRRVGSL LGQTLVRQQG PELLDLVEQV 
RGLTKQSREA AAATERQSAT LKVREMLAAL PIAVATELVR AFATYFQLAN GAEQVHQVRE
LRARETGDDH LAAVVQEIVT TLGTDALREA VEALEVQPVF TAHPTEASRR SVLLKLQALA
DILVVPTPAG SAARTRQDRK LAEVIDLLWQ TDEIRQFRPT PVDEARNALF YLEAIVGDTI
PALTDDLAAA MAQHGVELSP EATPLLLGSW IGGDRDGNPN VTAAVTREVL ALQHQSAIAI
AIGKIDELLL ALSSSTVIAG ASDELVASIR TDLDHLPLLD PRVKELNKDE PIRLKLTCVK
AKLINTRARV NAGRAHEPGR DYASKGELLA DLAIVHRSLE QHGGALAAAG ILATAQRALA
VSGLNVAYMD IREHSEAHHQ VIAQLVDRLG ELDRPYDSMS RPQRMRWLSK ELTSHRPLTT
LPAPLDAAGT KTFSVFSEIK DAQATYGPEV IVTYITSMTM GADDILAAAV LAREAGLIDV
YGTPGDPTRG PFAAIGFAPL LETVEELRKS AEVVDELLSD PTYREIVRLR GDVQEIMLGY
SDSNKQSGIT TSQWEIHKTE RLLRDLAARH GVSLTLFHGR GGTVGRGGGP TYDSILAQPY
GVITGAIKFT EQGEVISGKY GMPDLAKENL ALTVAATLRA TTLHTESRQT AQELRDWDQW
MEQVSDAAFA AYVGLIDDPD LPAYFLASTP TEQLGQLNIG SRPARRPDSG GGIGGLRAIP
WVFGWTQSRQ IVPGWFGVGS GLRAAREAGG EQMLRTMHRK WHFFRTFISN VEMTLAKTDM
GIAAMYVESL VPAPLRRLFE VIKAEHDLTV AEVLRVTGEA ELLDDQPALK RTLGVREPYL
APISYLQVDL LNRIRSQADE QVDPQLRRAM LLTINGVAAG MRNTG