Gene Ndas_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0944 
Symbol 
ID9244789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1159923 
End bp1161299 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content69% 
IMG OID 
Productdihydrolipoamide dehydrogenase 
Protein accessionYP_003678894 
Protein GI297559920 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0941075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.125714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGAGA GCGGCGGCAC CTTCGACCTC GTCGTCCTTG GCGGCGGCAG CGGCGGCTAC 
GCGGCGGCAC TGCGCGCCGC GGAGCTGGAC ATGAGCGTCG TCCTGATCGA GAAAGACAAG
CTCGGCGGCA CCTGCCTGCA CCGCGGCTGC ATCCCCACCA AGGCCCTGCT GCACTCGGCC
GAGGTCGCCG ACTCCGCCAA GGAGAGCGAG AACTTCGGGG TCAAGGCCAC GTTCGAGGGC
ATCGACATCC AGGCCGTGCA CACCTACAAG GACAAGGTGA TCGGCGGCCT GTTCAAGGGC
CTGACCGGCC TGGTCAAGTC GCGCAAGATC ACCGTCGTCG AGGGCGAGGG CAAGCTCACC
GGCAAGGACG AGGTCACCGT CGACGGCGCC GTCTACAAGG GCCGCAACAT CCTGCTGGCC
ACCGGTTCCA AGCCCAAGAC CCTCGGGCTG GACATCGACG GCGAGAAGGT CATGACCAGC
GACCAGGCCC TCGACCTGGA CCGCGTGCCC GAGTCGGTCA TCGTCCTGGG CGGCGGCGTG
ATCGGCGTGG AGTTCGCCAG CGTGTGGCGC TCCTACGGCG CCGACGTCAC CATCGTCGAG
GCCCTGCCGC ACCTGGTGCC GGTGGAGGAG GAGTCCAGCT CCAAGCTGCT GGAGCGCGCC
TTCCGCAAGC GCAAGATCAA GTACGAGCTG GGCACCCCGT TCGAGTCGGT CAAGACCACC
GACTCCGGCG TCACCGTCAC CCTCAAGGGG GGCAAGACCC TGGAGGCCGA GGTCCTGCTG
GTGGCCATCG GCCGCGGGCC GGTCTCGGAG GGCCTGGGCT ACGAGGAGCA GGGCATCACC
CTGGACCGCG GCTTCGTCCA GGTGGACGAG AACCTGCACA CCGGCGTGGG CAACGTCTAC
GCCGTGGGCG ACCTCATCCC CACCCTCCAG CTCGCCCACG TCGGCTTCGC CGAGGGCATC
TTCGTCGCCG AGCACATCGC CGGGCAGAAC CCGCCCGCGA TCGACTACGA CGGCGTCCCC
CGCGTCACCT ACTGCGAGCC CGAGGTCGCG TCCGTGGGCC TGACCACCAA GGTCGCCAAG
GAGCGCGGAC ACGACGTCGT GGAGATGAAC TACAGCCTCG CGGGCAACGG CAAGAGCCAG
ATCCTGCAGA CACAGGGCGC GGTGAAGGTC ATCGCCGAGA AGGACGGCCC CGTACTGGGC
GTCCACATGG TCGGCAGCCG CGTCGGCGAG CTGATCGCCG AGGGGCAGCT GATCTACAAC
TGGGAGGCCC TGCCCTCCGA GGTGGCCCAG CTGATCCACC CGCACCCGAG CCAGTCCGAG
GCGCTGGGCG AGGCGCACCT CGCGCTGGCG GGCAAGCCGC TGCACGTCCA CGACTGA
 
Protein sequence
MSESGGTFDL VVLGGGSGGY AAALRAAELD MSVVLIEKDK LGGTCLHRGC IPTKALLHSA 
EVADSAKESE NFGVKATFEG IDIQAVHTYK DKVIGGLFKG LTGLVKSRKI TVVEGEGKLT
GKDEVTVDGA VYKGRNILLA TGSKPKTLGL DIDGEKVMTS DQALDLDRVP ESVIVLGGGV
IGVEFASVWR SYGADVTIVE ALPHLVPVEE ESSSKLLERA FRKRKIKYEL GTPFESVKTT
DSGVTVTLKG GKTLEAEVLL VAIGRGPVSE GLGYEEQGIT LDRGFVQVDE NLHTGVGNVY
AVGDLIPTLQ LAHVGFAEGI FVAEHIAGQN PPAIDYDGVP RVTYCEPEVA SVGLTTKVAK
ERGHDVVEMN YSLAGNGKSQ ILQTQGAVKV IAEKDGPVLG VHMVGSRVGE LIAEGQLIYN
WEALPSEVAQ LIHPHPSQSE ALGEAHLALA GKPLHVHD