Gene Ndas_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3940 
Symbol 
ID9247811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4709678 
End bp4712404 
Gene Length2727 bp 
Protein Length908 aa 
Translation table11 
GC content73% 
IMG OID 
ProductPhosphoenolpyruvate carboxylase 
Protein accessionYP_003681843 
Protein GI297562869 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.622429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAACG CCCGGGCAAA ACATCGAAAA TACGCCCCCG CGCACCTGGC GGCGGGTGCG 
ATCTCTGTAG GGTCGGTGAC CATGACAGCG GTAAGCGCGG AACGCGACAG TGCACGGCAG
GAAGTTCCCG AGCAGCTCAG GAATGACGTC AAACTGCTCG GCGAGATGCT CGGAACGGTC
CTCGCCGAAA GCGGCGGCGA GGATCTGCTC GCCGATGTCG AGAAGCTCAG GAGGGCGGTC
ATCGGCGCCC GCGACGGGTC GGTCACGGGG GAGGAGATCA CCGCGATCGT CGCCGCCTGG
CCGCTGGAGC GCGCCAAGCA GGTGGCCCGC GCCTTCACCG TCTACTTCCA CCTGGCCAAC
CTGGCCGAGG AACACCAGCG GATGCGGGCC CTGCGCGAGC GCGACGACGC CGCCAACCCG
CCCCGCGAGT CCCTCGCGGC GGCCGTCCGC GCCATCCGCG AGGGCGAGGG CGGTGAGGAA
CGCCTCGACG AGCTGATCGC GGGCATGGAG TTCCACCCCG TGGTCACCGC CCACCCCACC
GAGGCGCGCC GCCGCGCCGT GTCCACCTCC ATCCTGCGCG TCAGCGCCCA GCTGGAGGCC
TGGCACTCCT CCCACGAGGG CAGCAGCGCC GCGGCCGAGG CGCACCGGCG CCTGCTGGAG
GAGATCGACC TGCTCTGGCG CACCTCCCAG CTGCGCTACA CCAGGCTCAA CCCGCTCGAC
GAGGTGCGCA CCGCCCTCGC CGCGTTCGAC GAGACCATCT TCTCCGTCAT CCCGCAGGTC
TACCGGAGCC TGGACGCCGC CATCGACCCC GAGGGCACCG GCGTGCGCCC GCCCCGCGCC
ACGCCCTTCG TGCGCTACGG CAGCTGGATC GGCGGCGACC GCGACGGCAA CCCCTACGTC
ACCTCCGACA TCACCCGGGA GGCCGTGCTC ATCCAGTCCG AGCACGTGCT GCGCGGCCTG
GAGGCCTCCT GCACCAAGGT GGCCCGCACC CTCACCGCCT ACTCCAACCT CACCCCCGCC
AGCCCGGCGC TGCTCGACGC GCTCGCCTCG GCCAAGGCGG GACAGCCCGA GCTGACGGCG
GAGATCGGGG CGCGCTCGCC CAACGAACCG CACCGCCAGC TCCTGCTGCT GGCCGCCGCC
CGGCTGCGCG CCACGCGCGA GCGCGACGCC GACCTGGCCT ACCCCGACGC CGACGCCTTC
CTGGCCGACC TGCGCACCGT GCAGGAGTCC CTGGCCGAGG CGGGCGCGGT CCGCCAGGCC
TACGGCGAGC TCCAGCACCT GGTCTGGCAG GCGCAGACCT TCGGCTTCCA CCTCGCCGAA
CTGGAGATCC GCCAGCACAG CGAGGTGCAC GCCGCCGCGC TCGCCGAACT GCGCGAGGGC
GGGGAGCTCT CCGAGCGCAC CGAGGAGGTC CTCGCCACGA TCCGGGTGAT CGCCTGGATC
CAGGAGCGCT TCGGCGTGGA GGCCTGCCGC CGCTACGTCG TCAGCTTCAC CCGCTCGGCC
GAGGACATCG CGGCCGTGTA CGAGCTGGCC GCCCACGCGC TGCCCGCCGG ACGCGTGCCC
GTCCTGGACG TCGTCCCGCT GTTCGAGACC GGCGCCGACC TGGACGCCTC GCCGCACGTG
CTCGACGGCA TGCTCAAGCT GCCCCAGGTC AACAAGCGCC TGGACGAGAC CGGCCGCAGG
ATCGAGGTCA TGCTCGGCTA CAGCGACTCC GCCAAGGACG TCGGGCCGGT CAGCGCCACC
CTGCGCCTCT ACGACGCCCA GGCCCGCCTG GCCGCGTGGG CGGAAGAGCA CGACGTGCGC
CTGACCCTGT TCCACGGCCG CGGCGGCTCG CTCGGCCGCG GCGGGGGCCC GGCCAGCCGC
GCCCTGCTCG CCCAGGCGCC CGGCTCGGTC GGCGGACGGT TCAAGGTCAC CGAGCAGGGC
GAGGTCATCT TCGCCCGCTA CGGCCAGCCC GCGATCGCCC GCCGCCACAT CGAGCAGGTC
GGCCACGCGG TGCTGATGGC CTCCACCGAC GCCGTCCAGG AGCGCGAGCG CTCCGCCGAG
CGCCGGTACC GCGCCCACGC CGACACCATC GCCCGCGCCG CCCAGGAGGC CTACCTGGAG
CTGATCAACA CCGAGGACTT CGCGGTGTGG TTCTCCCGGG TCAGCCCCCT GGAGGAGCTG
GGCGAGCTGC GGCTGGGCTC GCGCCCCTCG CGGCGCGGCG CCGCGCGCGG GCTCGGCGAC
CTGCGGGCCA TCCCGTGGGT GTTCGCCTGG ACCCAGACCC GGGTCAACCT GCCGGGCTGG
TTCGGCCTGG GCACGGGCCT GGCCGCGGTG GAGGACCTGG GCGTGCTCCA GGCGGCCTAC
CGCGAGTGGC CCATGTTCTC GTCCCTGCTG GACAACGCGG AGATGAGCCT GGCCAAGACC
GACCGCGACA TCGCCCAGCG CTACCTGGCG CTGGGCGGGC GGCCGGAGCT GACCGAGCGG
GTGCTCGCCG AGTACGACCG CACCCGCGAC CTGGTGCTCA AGGTGACCGG GCACAGCCGC
CTGCTGGAGA ACCGGGCGGT GCTCTCCCGC GCGGTGGACC TGCGCAACCC GTACGTGGAC
GCGCTCTCGC ACCTCCAGCT GCGCGCCCTG GAGGCGCTGC GGGGCGAGGA GGCCGACTCC
CTGTCTGAGG AGGACCAGCA GCACCTGGAG CGGCTGCTGC TGCTCTCGGT CAACGGCGTG
GCGGCCGGAC TCCAGAACAC CGGCTGA
 
Protein sequence
MVNARAKHRK YAPAHLAAGA ISVGSVTMTA VSAERDSARQ EVPEQLRNDV KLLGEMLGTV 
LAESGGEDLL ADVEKLRRAV IGARDGSVTG EEITAIVAAW PLERAKQVAR AFTVYFHLAN
LAEEHQRMRA LRERDDAANP PRESLAAAVR AIREGEGGEE RLDELIAGME FHPVVTAHPT
EARRRAVSTS ILRVSAQLEA WHSSHEGSSA AAEAHRRLLE EIDLLWRTSQ LRYTRLNPLD
EVRTALAAFD ETIFSVIPQV YRSLDAAIDP EGTGVRPPRA TPFVRYGSWI GGDRDGNPYV
TSDITREAVL IQSEHVLRGL EASCTKVART LTAYSNLTPA SPALLDALAS AKAGQPELTA
EIGARSPNEP HRQLLLLAAA RLRATRERDA DLAYPDADAF LADLRTVQES LAEAGAVRQA
YGELQHLVWQ AQTFGFHLAE LEIRQHSEVH AAALAELREG GELSERTEEV LATIRVIAWI
QERFGVEACR RYVVSFTRSA EDIAAVYELA AHALPAGRVP VLDVVPLFET GADLDASPHV
LDGMLKLPQV NKRLDETGRR IEVMLGYSDS AKDVGPVSAT LRLYDAQARL AAWAEEHDVR
LTLFHGRGGS LGRGGGPASR ALLAQAPGSV GGRFKVTEQG EVIFARYGQP AIARRHIEQV
GHAVLMASTD AVQERERSAE RRYRAHADTI ARAAQEAYLE LINTEDFAVW FSRVSPLEEL
GELRLGSRPS RRGAARGLGD LRAIPWVFAW TQTRVNLPGW FGLGTGLAAV EDLGVLQAAY
REWPMFSSLL DNAEMSLAKT DRDIAQRYLA LGGRPELTER VLAEYDRTRD LVLKVTGHSR
LLENRAVLSR AVDLRNPYVD ALSHLQLRAL EALRGEEADS LSEEDQQHLE RLLLLSVNGV
AAGLQNTG