Gene Ndas_1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1118 
Symbol 
ID9244968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1372006 
End bp1373181 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content70% 
IMG OID 
Productacyl-CoA dehydrogenase domain protein 
Protein accessionYP_003679065 
Protein GI297560091 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.549204 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGATT TCTCACTGAG CGATGAGCAG CGCGCCGTCC GCGACTGGGT GCGGACCTTC 
GTGGAGCGGG AGCTGATGCC GCTGGAGAAC GACGTGCTGC GCCGCGAGCG CCAGGGGCAG
CCGCCCGGCC TCAACGCCGA GGAGCGCACC CGGCTGCGCG AGCTGGCCCG CAAGTCCGAC
TTCTGGGGCG TGGAGACCCC GGTGGAGTAC GGCGGCATGG GCCTGGACCC GGTCACCGCC
GCGATCATCG AGATCGAGCT GGGCAGGACC TTCCTGACCT TCAAGTTCGG CGGCGAGGCC
GACAACATCC TCTACCACGC CAACGAGGAG CAGAAGGAGC GCTACCTGCT GCCCACCATC
GCCGGTGAGC GCCGCTCCTG CTTCGCCATC ACCGAGCCCG GCGCGGGCTC CGACGCCAAG
GCCATCCGCA CGCGCGCGGT CCGCGACGGC GACACGTGGG TGGTCAACGG CGAGAAGACC
TTCATCACCG GCGGCAACGA GGCCGACTTC GCCATGGTCT TCGCGGTCAC CGACCCGGAC
AGGGGCGCCG ACGGGGGCGT GACCTGCTTC CTGGTCGACC GCGAGGCGGG CTGGACCTCC
GAGCCCATCG ACACCATGGG CGAGCGCCGC CCGGCCTCCC TGGTCTTCCA GGACGTGCGC
GTGCCGCACG AGAACATCCT GGGCGAGGAG GGCCGCGGCT TCGAGCTGGC CATGAAGTGG
ATCGGCAAGG GCCGCTACCT GCTGCCCGCC CGCGCGCTCG GCGGCTGCGA GCGCCTGCTG
GACATGGCGA TCGAGTACTC GCGCACCCGC GAGACCTTCG GGGCTCCCAT CGCCGACCGG
CAGGCCATCC AGTGGATGAT CGCCGACTCC CAGACCGAGA TCGAGGCGCT GCGCCTGCTG
GTGCTGCACG CCGCGTGGCA GGTGTCGGTG GGCCGGGACT CGCGGCACGC CCAGTCCATC
GCCAAGCTCT TCGGCGGTGT GAAGGCCAAC GAGATCGTCG ACCGCGTGAT GCAGATCCAC
GGCGGCATGG GCTACACCCG TGAGCTGCCC ATCGAGCGCT TCTACCGCGA CGTGCGGCTG
CTGCGCATCT TCGAGGGCAC GGACGAGATC CAGCGCCGCA CGATCGCCCG CGACCTGCTC
AAGGGCCACG TCAAGGTGGG CGCAACCCTG GGCTAG
 
Protein sequence
MIDFSLSDEQ RAVRDWVRTF VERELMPLEN DVLRRERQGQ PPGLNAEERT RLRELARKSD 
FWGVETPVEY GGMGLDPVTA AIIEIELGRT FLTFKFGGEA DNILYHANEE QKERYLLPTI
AGERRSCFAI TEPGAGSDAK AIRTRAVRDG DTWVVNGEKT FITGGNEADF AMVFAVTDPD
RGADGGVTCF LVDREAGWTS EPIDTMGERR PASLVFQDVR VPHENILGEE GRGFELAMKW
IGKGRYLLPA RALGGCERLL DMAIEYSRTR ETFGAPIADR QAIQWMIADS QTEIEALRLL
VLHAAWQVSV GRDSRHAQSI AKLFGGVKAN EIVDRVMQIH GGMGYTRELP IERFYRDVRL
LRIFEGTDEI QRRTIARDLL KGHVKVGATL G