Gene Ndas_1152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1152 
Symbol 
ID9245002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1405837 
End bp1409217 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content74% 
IMG OID 
Productpyruvate carboxylase 
Protein accessionYP_003679099 
Protein GI297560125 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCGCA AAGTGCTCGT GGCCAACCGG GGCGAGATCG CCATCCGCGC GCTGCGCGCC 
GGTTACGAGT TGGGCGCCCG CACCGTCGCC GTCTTCCCCC ACGAGGACCG CGGCTCCCTG
CACCGGCTCA AGGCCGACGA GGCCTACCAG ATCGGCGAGC CCGGCCACCC GGTGCGCGCC
TACCTGTCCG TGGACGAGAT CGTGGGCGCC GCCCGCCGGG CCGGGGCCGA CGCCGTCTAC
CCCGGCTACG GCTTCCTGTC GGAGAACCCC GAGCTGGCCC GCGCGTGCGA GCGCGCCGGG
ATCACCTTCG TGGGCCCGCC CGCCGACGTG CTGGAACTGA CCGGCAACAA GGCCAGTGCC
GTGGCCGCCG CCCGGGAGGC GGGCGTGCCC GTCCTGGAGT CCAGCGAGCC CTCCGACGAC
GTGGAGGCCC TGGTGGCCGC CGCGGAGCGG ATCGGCTTCC CGCTCTTCGT CAAGGCCGTG
GCCGGAGGGG GCGGGCGGGG CATGCGCCGC GTCCAGGAGC CCGAGCGGCT GCGCGAGGCG
GTGGAGGCGG CCATGCGCGA GGCCTCCGCC GCGTTCGGCG ACGCCACCGT GTTCCTGGAG
CGCGCCGTGG TCGACCCCCG CCACATCGAG GTGCAGATCC TCGCCGACGG CGAGGGCGGC
GTCGTGCACC TCTACGAGCG CGACTGCTCC CTCCAGCGGC GCCACCAGAA GGTCATCGAG
CTGGCCCCCG CGCCCAACCT CGACCCGGAC CTGCGCGACC GCATCTGCGC GGACGCGGTG
CGCTTCGCCC GCCGGATCGG CTACCGCAAC GCGGGCACGG TCGAGTTCCT GGTCGGCGCG
GACGGCAGCC ACGTCTTCAT CGAGATGAAC CCGCGCATCC AGGTCGAGCA CACGGTGACC
GAGGAGATCA CCGACGTGGA CCTGGTGCAG TCCCAGCTGC GGATCGCCTC CGGCGAGACC
CTGTCCGACC TGGGCATCTC CCAGGAGGGC GTCTACGTGC GCGGCGCGGC GCTCCAGTGC
CGCATCACCA CCGAGGACCC CGCCAACGGC TTCCGTCCCG ACACCGGGAC CATCAGCGCC
TACCGCTCCC CCGGCGGCTC GGGCATCCGC CTGGACGGCG GCACCTCCGC GGCGGGCACC
GAGATCAGCC CGCACTTCGA CTCGCTGCTG GTCAAGCTCA CCTGCCGGGG CCGCGACCTG
GCCACCGCGG TCAGCCGGGC GCGCCGGGCC GTGGCCGAGT TCCGCATCCG CAGCATCGCC
ACCAACATCC CCTTCCTCCA GGCGGTGCTG GACGACCCCG ACTTCCAGGC GGGGCGGATC
ACCACCTCCT TCATCGAGGA GCGCCCGCAC CTGCTGACCG CGCGGCCCTC CGCCGACCGC
GGCACGCGCC TGCTCACCTA CCTGGCCGAC GTCACGGTGA ACAAGCCGCA CGGGGAGCGC
CCCCAGCTGG TGGACCCCGC CACCAAGCTG CCGCCGCTGC CCGAGCCCGC CGGACCGCCG
CCCCCGGGTT CGCGCCAGCG CCTGGCCGAA CTGGGCCCTG AGGGGTTCGC GCGGTGGCTG
CGCGAGTCGC CGAACCTGGG CGTCACCGAC ACCACCTTCC GCGACGCGCA CCAGTCGCTG
CTGGCCACCC GGGTGCGCAC CCGCGACCTG CTGGCCGCGG CGCCCGCGGT CGCGCACACG
CTGCCCGAGC TGCTGTCCCT GGAGTGCTGG GGCGGCGCCA CCTACGACGT GGCGCTGCGC
TTCCTGGCCG AGGACCCCTG GGAGCGGCTG GCGGCGCTGC GCGAGGCGGT GCCCAACATC
TGCCTCCAGA TGCTGCTGCG CGGGCGCAAC ACGGTGGGGT ACACCCCCTA CCCGACCGAG
GTGACCGACG CGTTCGTGCG CGAGGCCGCC GAGACGGGCG TGGACGTCTT CCGGATCTTC
GACGCGCTCA ACGACGTCGA GCAGATGCGC CCGGCCATCG AGGCCGTGCG CGCGACCGGG
ACCTCGGTGG CCGAGGTGGC ACTGTGCTAC ACCTCGGACC TGTCCGACCC CGGCGAGAAG
CTCTACACGC TGGACTACTA CCTCAAACTG GCCGAGCGGA TCGTGGACGC GGGCGCGCAC
GTGCTCGCGA TCAAGGACAT GGCCGGTCTG CTGCGCGCTC CGGCCGCGGC GAAGCTGGTC
ACGGCGCTGC GCAGCGAGTT CGACCTGCCG GTGCACGTGC ACACCCACGA CACCCCGGGC
GGGCAGCTGG CCACCTACCT GGCGGCGGTC AACGCCGGGG CGGACGCCGT GGACGGCGCG
GTGGCGTCGA TGGCGGGCAC CACCTCGCAG CCGTCGCTGT CGGCGATCGT GGCCGCCTTC
GACCACTCCG AGCACTCCAC GGGCCTGAGC CTGGACGCGG TCAACGAACT GGAGCCGTAC
TGGGAGGCGG TGCGCCGGGT CTACGCGCCC TTCGAGGCCG GGCTGTCCTC GCCCACGGGC
CGGGTGTACC ACCACGAGAT CCCGGGCGGG CAGCTGTCCA ACCTGCGCAC GCAGGCGGTC
GCGCTGGGGC TGGGCGAGCA CTTCGAGGAG ATCGAGGCGA TGTACGGCGC CGCCGACCGG
ATGCTCGGGC ACCTGGTGAA GGTCACCCCC TCCTCCAAGG TCGTGGGCGA CCTGGCGCTG
CACCTGGTCG GCGCGGGGGT CTCCCCGGCC GACTTCGAGG CCGACCCCGG GCGCTTCGAC
GTCCCGGACT CGGTGGTGGG GTTCCTGCGC GGTGAGCTGG GCGTCCCGCC CGGCGGGTGG
CCCGAGCCCC TCCGCACGCG GGCGCTCCAG GGGCGCTCCG AGGCGCGGCC CGCTCAGGAG
CTGAGCGAGC AGGACCGCGC GGGACTGGCC GAGGACCGGC GGGCCACGCT CAACCGGCTG
CTCTTCCCCG GTCCGACGCG GGAGTTCGAG GAGCACCGGG CGGCCTACGG CGACACCAGC
GTGCTCTCCA GCGCGGACTT CTTCTACGGC CTGCGCGCGG GCGAGGAGTA CGCGGTGGAC
CTGTCGCCGG GCGTGCGGCT GCTCATCCAG CTGGAAGCGG TGGGCGAGGC CGACGAGCGC
GGCGTGCGCA CGGTGATGGC CACGCTCAAC GACCAGCTGC GCCCGCTCCA GATCCGCGAC
CGGGCCCTGG CCTCGCAGGT GCGGTCCGCC GAGAAGGCGG ACCGCAGCGA CCCCGGCCAG
GTCGCGGCGC CCTTCGCGGG CGCGGTGACC CTGACGGTCG CCGAGGGCGA GGCGGTGGAG
GCCGGTGCGA CGGTGGCCAC CATCGAGGCG ATGAAGATGG AGGCCGCGAT CACCGCGCCC
GTGTCGGGGA CGGTCACGCG GGTGGCGGTC GACCGGGTGC AGAAGGTGGA GGGCGGCGAC
CTGCTGGTCT GCCTCGGCTG A
 
Protein sequence
MFRKVLVANR GEIAIRALRA GYELGARTVA VFPHEDRGSL HRLKADEAYQ IGEPGHPVRA 
YLSVDEIVGA ARRAGADAVY PGYGFLSENP ELARACERAG ITFVGPPADV LELTGNKASA
VAAAREAGVP VLESSEPSDD VEALVAAAER IGFPLFVKAV AGGGGRGMRR VQEPERLREA
VEAAMREASA AFGDATVFLE RAVVDPRHIE VQILADGEGG VVHLYERDCS LQRRHQKVIE
LAPAPNLDPD LRDRICADAV RFARRIGYRN AGTVEFLVGA DGSHVFIEMN PRIQVEHTVT
EEITDVDLVQ SQLRIASGET LSDLGISQEG VYVRGAALQC RITTEDPANG FRPDTGTISA
YRSPGGSGIR LDGGTSAAGT EISPHFDSLL VKLTCRGRDL ATAVSRARRA VAEFRIRSIA
TNIPFLQAVL DDPDFQAGRI TTSFIEERPH LLTARPSADR GTRLLTYLAD VTVNKPHGER
PQLVDPATKL PPLPEPAGPP PPGSRQRLAE LGPEGFARWL RESPNLGVTD TTFRDAHQSL
LATRVRTRDL LAAAPAVAHT LPELLSLECW GGATYDVALR FLAEDPWERL AALREAVPNI
CLQMLLRGRN TVGYTPYPTE VTDAFVREAA ETGVDVFRIF DALNDVEQMR PAIEAVRATG
TSVAEVALCY TSDLSDPGEK LYTLDYYLKL AERIVDAGAH VLAIKDMAGL LRAPAAAKLV
TALRSEFDLP VHVHTHDTPG GQLATYLAAV NAGADAVDGA VASMAGTTSQ PSLSAIVAAF
DHSEHSTGLS LDAVNELEPY WEAVRRVYAP FEAGLSSPTG RVYHHEIPGG QLSNLRTQAV
ALGLGEHFEE IEAMYGAADR MLGHLVKVTP SSKVVGDLAL HLVGAGVSPA DFEADPGRFD
VPDSVVGFLR GELGVPPGGW PEPLRTRALQ GRSEARPAQE LSEQDRAGLA EDRRATLNRL
LFPGPTREFE EHRAAYGDTS VLSSADFFYG LRAGEEYAVD LSPGVRLLIQ LEAVGEADER
GVRTVMATLN DQLRPLQIRD RALASQVRSA EKADRSDPGQ VAAPFAGAVT LTVAEGEAVE
AGATVATIEA MKMEAAITAP VSGTVTRVAV DRVQKVEGGD LLVCLG