Gene Ndas_4742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4742 
Symbol 
ID9248624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5624812 
End bp5626290 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content72% 
IMG OID 
Productglucose-6-phosphate 1-dehydrogenase 
Protein accessionYP_003682634 
Protein GI297563660 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCAGC AGTCCGCGTC CCGTGCCACC GACCTGGTGG TCTTCGGGGG TACCGGCGAC 
CTGTCCATGC GCAAACTCCT CCCCTCCCTG TACCTGCTGG ACCGCGACGG CCACCTGGAC
GGGGGGACCC GCGTCGTCGC CGTCTCGCGC GACGGCCTCA CCGACGCCGA CCTCAGGGAC
AAGGCCGCCT CGGCGGTCCG CGGCCACCAC GCCGTCCAGG TCGCCGAGCC CGACGTGCTC
CACCGGTTCC TGGAGCGCCT CTCCCACGTC ACCGTGGACG TCGGCGGCGA CCCCTCGGGC
TGGGACGACC TCACCGCCGC CCTGGCCCCC GGCCGCGACC GCGTCTTCTA CCTCGCCGTC
CCGCCGATGA TCTCCGGCGC GATCTGCCGC GGCCTGGACG ACGCCGGGCT CGTCACCCCC
GACTCCCGCG TGGTCATGGA GAAGCCCCTG GGCCGCGACC TGGCCTCCTC CCGCGCCGTC
AACGACGCGG TGGGGGCGGT CTTCGACGAG TCGCGCATCT ACCGCATCGA CCACTACCTG
GGCAAGGAGA CGGTGCAGAA CCTGCTGGTG CTGCGCTTCG CCAACGTCTT CCTCGAACCC
CTGTGGAACT CCCGCTGGAT CGACCACGTC CAGATCACCG CCGCCGAGAC CGTCGGCGTC
GGCGGGCGCC GGGGCTACTA CGACACCTCC GGCGCCATGC GCGACATGGT CCAGAACCAC
CTGCTCCAGC TGCTGTGCCT GACCGCCATG GAACCCCCGG CCAGCTACGA CCGCGACTCC
GTCCGCGACG AGAAGCTCAA GGTCCTCCAA TCGCTGCGCC CCCTCACCGG GGACCGCGTC
GCCGAGGACA CCGTGCGCGG CCAGTACGGA CGCGGCTCCG TCCACGGCTC GGAGGTGCTC
GGCTACCTCG ACGAGCCCGG CGGACCGGCC CTCAGCGACA CCGAGACCTT CGTGGCCCTG
CGCGCCGAGG TGGCCAACTG GCGCTGGGCG GGAGTGCCCT TCTACCTGCG CACCGGCAAG
CGCATGTCGC ACGCCCGCTC GGAGATCGTC ATCCGCTTCC GCGAGGTGCC GCACACGATC
TTCCCCGGCA CCAACCTGCC CGGCGCGGGC AGCCTCGTGA TCCGGCTCCA GCCGGACGAG
GGCATACACC TGACCATGCT GGCCAAGACG CCCGGCGCCG GAGCGCTGCG GCTGCGGCCC
GCCCCGCTGG AGCTCAGCTT CGCCGACACC TTCGCCACCC GCTCGCCCGA GGCCTACGAG
CGGCTGCTGA CGGACGTCCT GGCCGGTGAC TCCACCCTCT TCATGCGCCG CGACGAGGTC
GAGGCCGCCT GGCGCTGGGT GGACCCCGTC ATCGAGGCGT GGAACGGCCT GAGCACCACC
CCCGAGACCT ATCCAGCCGG AAGCGCCGGC CCCGCGGGCG CGCACCGGCT CATCGGCCGA
ACAGGCCGTA CGTGGTACGA AGAGGAGCGA CCCCGATGA
 
Protein sequence
MPQQSASRAT DLVVFGGTGD LSMRKLLPSL YLLDRDGHLD GGTRVVAVSR DGLTDADLRD 
KAASAVRGHH AVQVAEPDVL HRFLERLSHV TVDVGGDPSG WDDLTAALAP GRDRVFYLAV
PPMISGAICR GLDDAGLVTP DSRVVMEKPL GRDLASSRAV NDAVGAVFDE SRIYRIDHYL
GKETVQNLLV LRFANVFLEP LWNSRWIDHV QITAAETVGV GGRRGYYDTS GAMRDMVQNH
LLQLLCLTAM EPPASYDRDS VRDEKLKVLQ SLRPLTGDRV AEDTVRGQYG RGSVHGSEVL
GYLDEPGGPA LSDTETFVAL RAEVANWRWA GVPFYLRTGK RMSHARSEIV IRFREVPHTI
FPGTNLPGAG SLVIRLQPDE GIHLTMLAKT PGAGALRLRP APLELSFADT FATRSPEAYE
RLLTDVLAGD STLFMRRDEV EAAWRWVDPV IEAWNGLSTT PETYPAGSAG PAGAHRLIGR
TGRTWYEEER PR