Gene Ndas_5052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5052 
Symbol 
ID9248941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp192444 
End bp193748 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content70% 
IMG OID 
Productgeranylgeranyl reductase 
Protein accessionYP_003682939 
Protein GI297563966 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAGA CAGCCACCTC CTCTCCCCGA GGCCGGGAGC GGACCGAGTA CGACGCCGAC 
GTGATCGTGG TCGGCGCAGG TCCCTCCGGC TCCACCACCG CCTACTACCT TGCCCAGGCA
GGACTGGACG TCCTGCTCCT GGAGAAGACC TCCTTCCCGA GGGAGAAGGT CTGCGGAGAC
GGACTCACCC CCCGCGCGGT GAAGCAGCTC ACCGCTATGG GTGTCACCTT CGACGACCCG
GGGTGGATGA AGAACCACGG CCTGCGCATC ATCGGCGCGG GCGTCCGCCT GGAGCTGCCC
TGGCCCGACC TGGCCGCCTA CCCGGGTTTC GGCCTCGTGC GCACCCGTTA CGACTTCGAC
CAGATCGTGG TCAACCGCGC GGTGGCCGCC GGGGCCAAGC TCCTGGAGCG CACCACCGTC
ACCGGCCCCC TCATGGACGA GCGCAGCAAC CGCATCGTCG GAGTCCGGGC CAAGAACGCC
GACCGCGAGC CCGTCACCTT CCGGGCGCCG CTGGTCGTGG CCGCCGACGG CAACTCCTCC
CGGCTGTCCG TGGCCATGGG CATCCGCAAG CGCGACGACC GGCCCATGGG CGTGGCCGTG
CGCACCTACT TCGAGAGCCC CCGCCACGAG GACGACTACC TGGAGTCCTG GCTGGAGCTG
TGGGACCGCA GCGGCGACAA GGACGTCCTC CTGCCCGGCT ACGGCTGGGT CTTCGGCGTC
GGCGACGGCA CCAGCAACGT CGGCCTGGGC ATCCTCAACT CCACCGCGTC CTTCCAGGAC
ATGGACTACC GCAAGCTCCT GCGCCGCTGG ACCGAGTCCA TGCCCGAGGA GTGGGGCTTC
ACCGAGGACA ACCAGAAGGG CGCCATCCGT GGCGCCGCAC TGCCCATGGG CTTCAACCGG
GTGCCGCACT ACTCCCGCGG CCTCATGCTC GTCGGCGACG CGGGCGGCAT GGTCAACCCC
TTCAACGGCG AGGGCATCGC CTACGCCATG GAGGCCGGGA ACATCGCCGC CGACGTGATC
GTGCAGGCGC ACGGCAGACC CACCCAGCAG ACCCGCGAGC GGGCCCTGCT GCGCTACCCG
GACGTGCTCG CCGACACCTA CGGCGGCTAC TACACGCTCG GCCGCTACTT CGTGAAGGTC
ATCGGCCAGC CCGAGTTCAT GAAGTACGCG ACCAGGTACG GCCTGCGCCA GCGCACCCTC
ATGAAGTTCG TGCTGAAGAT GCTGGCCAAC CTCACCGAGC CCACGCAGGG AGACGCCATG
GACAGGGTCA TCAACGGCCT GTCCCGCATC GCCCCCGCGG CCTGA
 
Protein sequence
MSETATSSPR GRERTEYDAD VIVVGAGPSG STTAYYLAQA GLDVLLLEKT SFPREKVCGD 
GLTPRAVKQL TAMGVTFDDP GWMKNHGLRI IGAGVRLELP WPDLAAYPGF GLVRTRYDFD
QIVVNRAVAA GAKLLERTTV TGPLMDERSN RIVGVRAKNA DREPVTFRAP LVVAADGNSS
RLSVAMGIRK RDDRPMGVAV RTYFESPRHE DDYLESWLEL WDRSGDKDVL LPGYGWVFGV
GDGTSNVGLG ILNSTASFQD MDYRKLLRRW TESMPEEWGF TEDNQKGAIR GAALPMGFNR
VPHYSRGLML VGDAGGMVNP FNGEGIAYAM EAGNIAADVI VQAHGRPTQQ TRERALLRYP
DVLADTYGGY YTLGRYFVKV IGQPEFMKYA TRYGLRQRTL MKFVLKMLAN LTEPTQGDAM
DRVINGLSRI APAA