Gene Ndas_5056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5056 
Symbol 
ID9248945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp195982 
End bp197277 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content69% 
IMG OID 
ProductNADH dehydrogenase I, D subunit 
Protein accessionYP_003682943 
Protein GI297563970 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCA CCGAGGACTA CATCGACGCC TCCGGCGGGG ACTGGGACGA CGTCATCGAG 
AAGGCGCAGG CCACCAGCGC CGAACGCCTC GTCGTCAACA TGGGACCCCA GCACCCCTCC
ACGCACGGCG TCCTGCGGCT CATCCTCACC CTCGACGGTG AGACCTGCAC CGAGGCGCGC
GTCGGTATCG GCTACCTGCA CACCGGTATC GAGAAGAACA TGGAGTACCG GACGTGGACG
CAGGGCACCA CGTTCGTGAC CCGCATGGAC TACCTGACGC CGCTGTTCAA CGAGGCGGCG
TACTGCCTGG CGGTCGAGAA GCTGCTCGGC ATCGAGGACC GCGTCCCCGA GCGGGCCAGC
GTCATCCGCG TGATGATGAT GGAGCTCAAC CGGATGGCCT CGCACTTCGT GGCGATGGCG
ACCTTCGGCA TGGAGCTGGG CGCGACCACG GTCATGACCA ACGGCTTCCG TGAGCGCGAG
ATGATCCTGG ACATCTTCGA GCTGGTCACC GGCCTGAGGA TGAACCACGC CTACATCCGC
CCCGGCGGCG TGGCCCAGGA CCTGCCGCCC GGCGCGGCCG GCAAGGTCCG CGAGCTGCTC
AAGGAGATGC CCAAGCGCAT CGCCGTCATG CGCAAGCTCC TGGACGAGAA CCCCGTCTAC
CTCGCCCGCA CCAAGGACGT GGCCCACCTC AACCTGCCCG GCTGCATGGC GCTCGGCGTC
ACCGGCCCGC TGCTGCGGGC CTCCGGCCTG GCCTGGGACC TGCGCAAGGC CAAGCCCTAC
TGCGGCTACG AGGGCTACGA GTTCGACGTC CCCGTCTCCG ACGGCGGCGA CGTCTACGCC
CGCTACCGGG TGCGCATGGC CGAGATGGAG GAGAGCCTGA AGATCATCGA GCAGTGCCTG
GACAAGCTCC AGCCCGGCCC GGTGATGATC CAGGACGCCA AGATCGGATG GCCCGCCAAG
CTGGCGCTGG GGCCCGACGG CCTGGGCAAC TCGCCCGACC ACATCGCGCA CATCATGAGC
GGCTCCATGG AGGCGCTCAT CCACCACTTC AAGCTGGTCA CCGAGGGCTT CCGGGTCCCC
GCGGGCCAGG CCTACGCGGC CGTCGAGAGC GCCAAGGGCG AACTCGGCTG CTACGCGGTC
AGCGACGGGA GCACCCGCCC CCACCGCGTG CACTTCCGCG ACCCCTCGTT CACCCACCTG
CAGGCCGTCG CGGCCATGTG CGAGGGGGGA ACGGTGGCGG ACGTCATCGC CGCCGTGGCC
AGTATCGACC CGGTGATGGG AGGCGTGGAC CGGTGA
 
Protein sequence
MTITEDYIDA SGGDWDDVIE KAQATSAERL VVNMGPQHPS THGVLRLILT LDGETCTEAR 
VGIGYLHTGI EKNMEYRTWT QGTTFVTRMD YLTPLFNEAA YCLAVEKLLG IEDRVPERAS
VIRVMMMELN RMASHFVAMA TFGMELGATT VMTNGFRERE MILDIFELVT GLRMNHAYIR
PGGVAQDLPP GAAGKVRELL KEMPKRIAVM RKLLDENPVY LARTKDVAHL NLPGCMALGV
TGPLLRASGL AWDLRKAKPY CGYEGYEFDV PVSDGGDVYA RYRVRMAEME ESLKIIEQCL
DKLQPGPVMI QDAKIGWPAK LALGPDGLGN SPDHIAHIMS GSMEALIHHF KLVTEGFRVP
AGQAYAAVES AKGELGCYAV SDGSTRPHRV HFRDPSFTHL QAVAAMCEGG TVADVIAAVA
SIDPVMGGVD R