Gene Ndas_4806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4806 
Symbol 
ID9248689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5696069 
End bp5697379 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content71% 
IMG OID 
Productcatalytic domain of components of various dehydrogenase complexes 
Protein accessionYP_003682696 
Protein GI297563722 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA TCCAGATGCC GCGCCTCTCC GACACCATGG AGGAAGGCGT CATCAGCACG 
TGGGTCAAGA ACGTGGGCGA CAAGGTCGCC TCCGGTGACG TCCTGGTCGA GATCGAGACC
GACAAGGCCG TCATGGAGTA CGAGGCCTAC GAGGACGGCT ACCTGGTCAA GCAGTCCGTC
TCCGAGGGCG AGACGGTGCC GATCGGCGCG GTCATCGGCG TGATCGCCGA CTCCCCGGAC
GCGGTACCCG AGGACTCCGG CGACGGCGGC TCGGAGCCCG AGGCCGCGCC CGCCGAGGAG
GAGCAGGGCG AGAAGGCGGA GGAGATCCAG GAGGCCGCCG AGGGCACCGA GGCCGAGAGC
GCCGGGGAGT CCGCCGCCTC CTCCGGCGAT GGGGCCGCGC GCCCGCGCAC CTCCCCGCTG
GCCCGGCGTC TGGCCAAGGA GTACGGCCTG GACATCAACA GGATCCAGGG GTCGGGCCCC
AAGGGCCGGA TCGTGCGCGC CGACATCGAG GCCGCCCGGG AAGGCGGTGC CGCCGAGCAG
GCCGCACCCG CCGCGCAGCC CAAGGAGGAG GCCAAGCCCG CCGCGGAGAA GGCGGCGACC
GCTCCCGCCT TCGACGACGG CCGCGCCTCC GAGGAGCTCA AGGTCAGCAA CGTGCGCAAG
GTGATCGCGC GCCGCCTGAC CGAGAGCAAG CAGACGGTGC CGCACTTCTA CCTGCGCCGC
ACGATCGACG CCGAGGCGCT CAAGGCCTTC CGCGCGCAGA TCAACGAGCA GCTGTCCAGC
ACGGGCGTGA AGGTCAGCTT CAACGACCTG ATCGTCAAGG CCAGCGCGAC GGCGCTGAAG
CTGCACCCGG CGGTGAACAC CTCGTGGGTG GACGACAAGC TGCTCCAGCA CCACCGGGTC
AACGTCGGCG TGGCCGTGGC CGTGGACGCC GGGCTCGTGG TGCCGGTGCT GCACGACACC
GACAAGGCGA CGCTGTCGGA GATCTCCACG CGCACGCGCG AGCTGGCGGG CAAGGCCCGT
GACGGCAAGC TCAAGCCGCA GGAGATGAGC GGCGGCACGT TCAGCGTGTC CAACCTGGGC
ATGTTCGGCG TGGACAGCTT CTCCGCGGTG ATCAACCCGC CGGAGGCGGC CATCCTCGCG
GTCGGCGCGA TGCGCCAGGA GCCGGTGGTC GTGGACGGCG AGGTCGTCGT GCGCAACCGG
ATCTCCCTGG AGCTGTCGGT GGACCACCGC GCGGTGGACG GCGCCGTGGG CGCCGCGTTC
CTCAAGGACC TCGCGGAGAT CCTGGAAGAG CCGATGCGGA TCATCCTGTA G
 
Protein sequence
MSEIQMPRLS DTMEEGVIST WVKNVGDKVA SGDVLVEIET DKAVMEYEAY EDGYLVKQSV 
SEGETVPIGA VIGVIADSPD AVPEDSGDGG SEPEAAPAEE EQGEKAEEIQ EAAEGTEAES
AGESAASSGD GAARPRTSPL ARRLAKEYGL DINRIQGSGP KGRIVRADIE AAREGGAAEQ
AAPAAQPKEE AKPAAEKAAT APAFDDGRAS EELKVSNVRK VIARRLTESK QTVPHFYLRR
TIDAEALKAF RAQINEQLSS TGVKVSFNDL IVKASATALK LHPAVNTSWV DDKLLQHHRV
NVGVAVAVDA GLVVPVLHDT DKATLSEIST RTRELAGKAR DGKLKPQEMS GGTFSVSNLG
MFGVDSFSAV INPPEAAILA VGAMRQEPVV VDGEVVVRNR ISLELSVDHR AVDGAVGAAF
LKDLAEILEE PMRIIL