Gene Ndas_4871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4871 
Symbol 
ID9248758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp1876 
End bp3363 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content74% 
IMG OID 
Productputative short chain dehydrogenase 
Protein accessionYP_003682760 
Protein GI297563787 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00534901 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.736841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACA CGGGTACCCG GACGCTGCCG GAGGAGTCGG AGCACCCGGC GGAGCGGATC 
GACCCCGACC GGTTGGCCGC GTGCCTGGAC GTGATCGCCC GCGCCGGTGA GCTGCCCAGC
GACCACCCGG ACTCGGTCGC CCTCCAGCGC GCGACCGCCC GCCTGTTCAA GAACGTCAAG
GAGCGGCGCC GCAAGGAGCG CCAGGCCGCC CGCCAGGCCC ACGACAGGGC CGTGTTCGCG
GCCACCGCCA CCGCGGCGCC GGACCGGATC GACGACGAGA CCAACGGCCG CGCCCTCACC
AGCGGCAGCG GCGGCGCCCT CGCGGGCGTG CTGAGCAGGC CCCGGCCCTG CTACATCTGC
AAGGAGAGGT ACCGGGAGGT CGACTCCTTC TACCACCAGC TGTGCCCCGC GTGCGCCGCG
TTCAACAGGG AGCGCCGCAA CGCCCGCACC GACCTGACCG GGCGCCGGGC CCTGCTCACC
GGCGGCCGGG CCAAGATCGG CATGTACATC GCACTCCGGC TGCTCAGGGA CGGCGCGCAC
ACGACCGTCA CCACACGTTT TCCCAACGAC GCGGTGCGCC GGTTCGCCGC GATGCCCGAC
AGCGGCGAGT GGCTGCACCG GCTGCGGGTG GTGGGCGTCG ACCTGCGCGA CCCCGCCCAG
GTGCTGCGGC TGGCCGACGC GGTGGCCGAA CAGGGGCCGC TGGACATCCT CATCAACAAC
GCGGCCCAGA CGGTGCGCCG TTCGCCCGGC TCCTACGGGC CGCTGGTCGA GGCCGAGGCC
GAACCGCTCA CGGGCGAGGG CCTCCCCGAG CCGCTGGTGC TGGGCGGCGT GCGCCCGCGC
GCCCTGGAGG ACCGCGCCGA CCCTGGGCGG GACGCCCCGG CGACCCACGC GCTCACTCCG
GCCATGCTCA CCTCCCTGGC GCTGACCACG GGCTCCGCGT CGATGGAGAG GGTGCGCACC
GGGACGGCGA TCGACGCCGG AGGGCTGGTG CCCGACCTGG CCCCGGTGAA CAGCTGGACG
CGGCGGATCG GCGAGGTCGA CCCCGTCGAG ATGCTGGAGG TGCAGCTGTG CAACGTGAGC
GCGCCGTTCC TGCTGGTGGA CCGGTTGCGC CCGGCCCTGG CCGCGTCCCC GGCGCGCCGC
ACCTACATCG TCAACGTGTC GGCGATGGAG GGGGTGTTCG GCCGCGGCTA CAAGGGGCCG
GGGCACCCCC ACACCAACAT GGCCAAGGCC GCGCTCAACA TGCTCACCCG CACCAGCGCC
CAGGAGATGC TGGAGTCCGA CGGAATCCTG ATGACCAGCG TGGACACCGG GTGGATCACG
GACGAGCGCC CGCACCCGGA GAAGGAGCGG CTGGTCGAGG CCGGGTTCCA CGCGCCCCTG
GACCTGGAGG ACGGGGCCGC GCGCGTGTAC GACCCCATCG TGCGGGGCGA GCTGGGAGAG
GACCTGCACG GGTGCTTCCT GAAGGACTAC GCCCCCGCCA ACTGGTAG
 
Protein sequence
MTDTGTRTLP EESEHPAERI DPDRLAACLD VIARAGELPS DHPDSVALQR ATARLFKNVK 
ERRRKERQAA RQAHDRAVFA ATATAAPDRI DDETNGRALT SGSGGALAGV LSRPRPCYIC
KERYREVDSF YHQLCPACAA FNRERRNART DLTGRRALLT GGRAKIGMYI ALRLLRDGAH
TTVTTRFPND AVRRFAAMPD SGEWLHRLRV VGVDLRDPAQ VLRLADAVAE QGPLDILINN
AAQTVRRSPG SYGPLVEAEA EPLTGEGLPE PLVLGGVRPR ALEDRADPGR DAPATHALTP
AMLTSLALTT GSASMERVRT GTAIDAGGLV PDLAPVNSWT RRIGEVDPVE MLEVQLCNVS
APFLLVDRLR PALAASPARR TYIVNVSAME GVFGRGYKGP GHPHTNMAKA ALNMLTRTSA
QEMLESDGIL MTSVDTGWIT DERPHPEKER LVEAGFHAPL DLEDGAARVY DPIVRGELGE
DLHGCFLKDY APANW