Gene Ndas_5417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5417 
Symbol 
ID9249320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp596980 
End bp598095 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content71% 
IMG OID 
Productpyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit 
Protein accessionYP_003683302 
Protein GI297564329 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGACA CGACCGTGCA CGAGGAACCG GAGCTGATCC AGCTCCTGAC ACCGGAAGGG 
GAGTTGACGG GGCACCCCGA CTACCCCCTG GACATCAGCG CCGAGGAGAT CCGCGCCCTG
TACCGCGACC TGGTCCTGGT GCGCAGGTTC GACAGCGAGG CGGTCTCCCT CCAGCGCCAG
GGCGAGCTGG GCCTGTGGGC CTCGCTGCTG GGCCAGGAGG CCGCGCAGAT CGGCTCCGCA
CGCGCGCTGG GCGCGAAGGA CATGGCCTTC CCCTCCTACC GCGAGCACGG CGTCGCGTGG
TGCCGGGGCA TCGAGCCCCG TGAACTGCTC GGCATGTTCC GCGGCGTCAC CAACGGGGGC
TGGGACCCCC ACGAGCACGG CTTCCACCTG TACACGATCG TCATCGGCAG CCAGACCCTG
CACGCCACCG GCTACGCCAT GGGCGTCCAG CGCGACGGCG CCGTCGGCGA GGACGGCACC
GCCGTCATCT CCTACTTCGG CGACGGGGCC ACCAGCCAGG GCGACACCAA CGAGGCGTTC
AACTTCGCCT CGGTCAACAA CGCCCCGGTG GTCTTCTTCT GCCAGAACAA CCAGTGGGCG
ATCTCCGAAC CGCTGGAGCG CCAGGCCCGC GTGCCCATCT ACCGGCGCGC CGCCGGGTTC
GGCTTCCCCG GCCTGCGCGT GGACGGCAAC GACGTCCTGG CCTGCCTGGC CGTGACCCGG
GTCGCGCTGT CCAACGCCCG CGAGGGCAAC GGCCCCACGC TCGTGGAGGC GTTCACCTAC
CGGATGGGCG CCCACACCAC CAACGACGAC CCCACCCGCT ACCGCGCGTC GGCCGAGCTC
GACGAGTGGA AGGCCAAGGA CCCGATCCTG CGGGTCCGCC GCTACCTGGA GCGGGGCGGC
CACGCCGACG AGGAGTTCTT CGCGTCCGTG GACGCCGAGG CGGACCGGCT GGGCGAGCAG
GTGCGCACCG AGTGCCGTTC CCTGCCCGAC CCCGAGCCCC TCGACATCTT CCACGAGGTC
TACGCCGAGC CCAACGTCCA CATCGACCAG CAGCGGTCCG AGTTCGCCGA CTACCTGGCC
TCCTTCGAGG GCGCGGGCGC GGAAGGGGGC CGTTAG
 
Protein sequence
MSDTTVHEEP ELIQLLTPEG ELTGHPDYPL DISAEEIRAL YRDLVLVRRF DSEAVSLQRQ 
GELGLWASLL GQEAAQIGSA RALGAKDMAF PSYREHGVAW CRGIEPRELL GMFRGVTNGG
WDPHEHGFHL YTIVIGSQTL HATGYAMGVQ RDGAVGEDGT AVISYFGDGA TSQGDTNEAF
NFASVNNAPV VFFCQNNQWA ISEPLERQAR VPIYRRAAGF GFPGLRVDGN DVLACLAVTR
VALSNAREGN GPTLVEAFTY RMGAHTTNDD PTRYRASAEL DEWKAKDPIL RVRRYLERGG
HADEEFFASV DAEADRLGEQ VRTECRSLPD PEPLDIFHEV YAEPNVHIDQ QRSEFADYLA
SFEGAGAEGG R