Gene Ndas_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0844 
Symbol 
ID9244689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1036312 
End bp1037502 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content73% 
IMG OID 
Productacyl-CoA dehydrogenase domain protein 
Protein accessionYP_003678794 
Protein GI297559820 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.114761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACT CCCGCAAGCC CGCCGCCCCG GACCCGCACG ACTTCCTGGC CGTGGACGCC 
GACCTGTCCG ACACCGAGCG CGACGTCCGC GACGCCGTGC GCGGCTTCGC CTCGCGCGAG
CTGGCGCCGA ACGTGGCCGA GTGGTTCGAG GCGGGCACCC TGCCCGACCC GCGCGGGCTG
GCCAGGGCCT TCGGCTCCCT CGGGGTGCTG GGCATGCACC TGGAGGGGTA CGGCTGCGGC
GGCTCCAGCG CGGTCGCCTA CGGTCTGGCC TGCCGCGAAC TGGAGGCCGT GGACTCGGGC
CTGCGCAGCT TCGTGTCGGT GCAGGGCTCG CTGGCGATGG CGGCCGTCCA CAAGTTCGGC
TCCGAGGAGC AGAAGCAGGA GTGGCTGCCG CGCATGGCCG CCGGTGAGGC GATCGGCTGC
TTCGGGCTGA CCGAGCCCGA CTCCGGTTCC GACCCGGGCT CTATGCGCAC CCGCGCCCGC
AGGGACGGGT CGGACTGGGT GCTCGACGGC ACCAAGATGT GGATCACCAA CGGATCGGTC
GCCGACGTGG CCGTGGTGTG GGCCGCCACC GACGAGGGCG TGCGCGGGTT CGTGGTGCCC
GCCGACACGC CCGGGTTCTC GGCCAACGTC ATCCACCGCA AGCTGTCGCT GCGCGCGTCC
ATCACCTCCG AACTGGTGCT GGAGGGCGTG CGCCTGCCCG CCGACGCGGT GCTGCCGGGA
TCGCGCGGCC TGGGCTCGCC GCTGTCGTGC CTGAACGAGG CCCGCTACGG CATCGTGTGG
GGCGCGGCCG GTGCCGCGCG CGCCTGCTAC GAGGCGGCCC TGGAGTACGC GCTGAGCCGC
GAGCAGTTCG GCCGGCCGAT CGCCGGGTTC CAGCTGACCC AGCGCAAGCT CGCCGACATG
GTGGTGGACG TCAACCAGGC GGCCATGACC GCGCTGCGGA TCGGGCGTCT GAAGGACGAG
GGCCGGTGCC ACCACAACCA CGTGAGCTTC GGCAAGCTCT CGTGCGTGGC CGCCGCCCAG
CGGGTGGCCG CGGCGGCGCG CTCGGTGCAC GGCGCCAACG GCATCACCCT GGAGTACCCG
GTGATGCGGC ACATGCTGAA CCTGGAGACG GTCGCCACCT ACGAGGGCAC CGAGGAGATC
CACGCGCTCA GCATCGGCCA GTCGGTGACG GGGATCTCGG CCTTCCGCTG A
 
Protein sequence
MSDSRKPAAP DPHDFLAVDA DLSDTERDVR DAVRGFASRE LAPNVAEWFE AGTLPDPRGL 
ARAFGSLGVL GMHLEGYGCG GSSAVAYGLA CRELEAVDSG LRSFVSVQGS LAMAAVHKFG
SEEQKQEWLP RMAAGEAIGC FGLTEPDSGS DPGSMRTRAR RDGSDWVLDG TKMWITNGSV
ADVAVVWAAT DEGVRGFVVP ADTPGFSANV IHRKLSLRAS ITSELVLEGV RLPADAVLPG
SRGLGSPLSC LNEARYGIVW GAAGAARACY EAALEYALSR EQFGRPIAGF QLTQRKLADM
VVDVNQAAMT ALRIGRLKDE GRCHHNHVSF GKLSCVAAAQ RVAAAARSVH GANGITLEYP
VMRHMLNLET VATYEGTEEI HALSIGQSVT GISAFR