Gene Ndas_1365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1365 
Symbol 
ID9245215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1674248 
End bp1675330 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content76% 
IMG OID 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_003679303 
Protein GI297560329 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0340631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.835777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCGG GTGCCGCCCT GGGCCTGGGA GCCGCGGCGC TGGCCGGTCC GCGACGGCCC 
GCCCTGCTGG ACGGCAGGGT CGCGCTCATC ACCGGCGGCT CCCGCGGGCT CGGACTCCAG
CTGGCCCGCG AGTTCGGCCG GCGCGGCGCG TCCGTCGTGA TCTGCGCCCG CGACCAGGAC
GAACTCGACC GTGCGGTGGC CGACCTGGAG CAGCGGGGCG TGCGGGCGCG CGGCGTCCGG
TGCGACGTCC GCGACCCCGA GTGCGCCCAG GAGCTGGTCG AGGAGGCCAC CGCGGCCTTC
GGCCACCTGG ACTTCGTCGT CAACAACGCG GGCATCATCC AGGTGGGGCC GCAGGAGACG
CTCTCGGAGA GCCACTTCCG CGACGCCATG GAGACCATGT TCTGGGCGCC GCTGCGCATC
TCCCGGGCCG CCATCGGCCC GCTGGAGCGC ACCCGCGGCT CCCTGGTCAC CATCACCTCC
ATCGGCGGCC ACCTGAGCGT GCCGCACCTG CTGCCCTACT CCTGCGCCAA GTTCGCCGAG
GTCGGCCTCT CCGAGGGACT GGGAGCCGAG GTCGCCCGCC GTGGCGTGCG GACCACCACG
GTGGTCCCGG GGCTGATGCG CACCGGTTCC CACAAGGCCG CCGTCTTCTC CGGAGACCCC
GAACACGAGT ACACCTGGTT CTCCCTGGGC GCCGGGCTGC CCCTGGTCTC GGTGAGCGGG
GAGCGCGCGG CGCGCCGGAT CGTGGAGGCG GCCGCGCGGG GGCGCAGCTA CCTGGTGCTC
ACGCCACTGG CCCGGGCGGC CATCGTCGCG CGGGGGCTGT GCCCCTCCTT CGTCCAGGGC
GCCATGCGCG TCATGGACCG GGTCCTGCCG GATCCGGGCG GATCCGTGGA GCGCAGGCCC
GGGCGCGAGG CGGACGACAC CGTGGTCAAC CGGGTGCTGA AGCTGCCCAC GGCGCTCAAC
GAGCGCGCGA GCAGGCGCCT CAACCAGAGG GACGACGAGG GCGGCGGCAC CTCCGGGGAC
CGGGGGCCGG ACGCCGACGG ACCGGAAGGG TCCGGGCCGG GACCGGGCGG GCCCGGCCGC
TGA
 
Protein sequence
MVAGAALGLG AAALAGPRRP ALLDGRVALI TGGSRGLGLQ LAREFGRRGA SVVICARDQD 
ELDRAVADLE QRGVRARGVR CDVRDPECAQ ELVEEATAAF GHLDFVVNNA GIIQVGPQET
LSESHFRDAM ETMFWAPLRI SRAAIGPLER TRGSLVTITS IGGHLSVPHL LPYSCAKFAE
VGLSEGLGAE VARRGVRTTT VVPGLMRTGS HKAAVFSGDP EHEYTWFSLG AGLPLVSVSG
ERAARRIVEA AARGRSYLVL TPLARAAIVA RGLCPSFVQG AMRVMDRVLP DPGGSVERRP
GREADDTVVN RVLKLPTALN ERASRRLNQR DDEGGGTSGD RGPDADGPEG SGPGPGGPGR