Gene Ndas_3418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3418 
Symbol 
ID9247285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4088329 
End bp4090191 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content71% 
IMG OID 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003681329 
Protein GI297562355 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCCC TACGCTCACG CACCGTCACC CACGGCAGGA ACATGGCCGG CGCGCGCGCC 
CTCATGCGCG CCACCGGTGT GGAGCGCGAG GACTTCGGCA AGCCCATCGT GGCCGTGGCC
AACAGCTTCA CCGAGTTCGT GCCGGGCCAC GTCCACCTGC GCGAGGTGGC CGAGGTCGTC
GCCAACGCCG TCCGCGAGGC GGGCGGCGTC CCCCGCGAGT TCAACTCCAT CGCCGTGGAC
GACGGCATCG CCATGGGCCA CGGCGGCATG CTCTACTCCC TGCCCAGCCG CGAGCTGATC
GCCGACTCGG TCGAGTACAT GGTCAACGCG CACTGCGCCG ACGCCCTGGT GTGCGTGTCC
AACTGCGACA AGATCACCCC GGGCATGCTG CTGGCCGCGC TGCGCCTGAA CATCCCCACG
GTGTTCGTCT CCGGCGGCCC CATGGAGGCG GGCAAGGTCA CGGTGGTCGA CGGCACCGCC
ACCACCGTGC GCAAGCTGGA CCTGATCAAC CCGATGATCG CCGCGGCCGA CGAGAGCGTC
TCCCAGGCCG AGCTGGACGA GATGGAGGAG GCCGCCTGCC CGACCTGCGG CTCCTGCTCG
GGCATGTTCA CCGCCAACTC GATGAACTGC CTCACCGAGG CGATCGGCCT GGCCCTGCCC
GGCAACGGCA CCGTGCTGGC CACCCACACC GCCCGCCGCG CCCTGTACGA GGACGCCGGA
CGCCTGGTCG TGGAGGCCGC CAAGCGCTAC TACGAGGACG ACGACTCCTC CGTCCTGCCG
CTGTCCATCG CCACCCCCGA GGCCTTCGGC AACGCCATGG CCCTGGACGT GGCCATGGGC
GGCTCCACCA ACACGATCCT GCACCTGCTG GCCGCGGCCA CCGAGGCGGG CGTCGGCTTC
GGCCTGCCCG AGATCGACGC GGTCTCGCGC CGGGTGCCGT GCCTGTGCAA GGTCGCGCCG
AACACCGAGA AGTACCACAT CGAGGACGTG CACCGGGCGG GCGGCATCCC CTCCATCCTG
GGCGAGCTGG CCCGCGGCGG CCTGCTGGAC ACCTCCCTGC CCACGGTGCA CGGCAAGACG
GTCGGCGAGT TCATCGCCGA GTGGGACATC GTCTCCGACA CCGTCTCACC CGAGGCCGTG
GAGCTGTTCC ACGCCGCCCC CGGCGGCAAG CGCACCACGA AGGCCTACTC ACAGGACACC
CGCTGGGACA CCCTGGACAC CGACCGGGAG AAGGGCTGCA TCCGCTCAGT CGAGCACGCC
TACACCAAGG ACGGCGGCCT GGCGGTGCTG TTCGGCAACC TCGCCCCGGA CGGCGCGATC
GTCAAGACCG CGGGCGTGGA GGAGGAGCTG TGGACCTTCT CCGGACCGGC CAAGGTGTTC
GAGTCCCAGG AGGACGCCGT GGACGGCATC CTCAACAAGC GGATCGAGCC CGGTGACGTG
GTGGTCATCC GTTACGAGGG CCCCAAGGGC GGTCCGGGCA TGCAGGAGAT GCTGTACCCG
ACGAGCTTCC TCAAGGGCCG CGGCCTGGGC AAGGCGTGCG CCCTCATCAC CGACGGCCGC
TTCTCCGGCG GCACGTCGGG GCTGTCCATC GGCCACGCCT CCCCCGAGGC CGCCGCGGGC
GGTGACATCG CGCTGGTGGA GGACGGCGAC GTCATCAGCA TCGACATCCC CAACCGGGGC
ATCGTGCTGG AGGTCTCCGC GGAGGAGCTC GACGCGCGCC GCGAGCGCCT GCTCAAGGAG
CTGGGCCGGT TCAGGCCGCG CGACCGACAG CGGCCGGTGA CCGCGGCTCT GCGCGCCTAC
GCGGCCATGG CGACCTCGGC CTCGACCGGC GCCGCGCGCG ACGTGTCCCA GGTCGAGAAG
TAG
 
Protein sequence
MPALRSRTVT HGRNMAGARA LMRATGVERE DFGKPIVAVA NSFTEFVPGH VHLREVAEVV 
ANAVREAGGV PREFNSIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADALVCVS
NCDKITPGML LAALRLNIPT VFVSGGPMEA GKVTVVDGTA TTVRKLDLIN PMIAAADESV
SQAELDEMEE AACPTCGSCS GMFTANSMNC LTEAIGLALP GNGTVLATHT ARRALYEDAG
RLVVEAAKRY YEDDDSSVLP LSIATPEAFG NAMALDVAMG GSTNTILHLL AAATEAGVGF
GLPEIDAVSR RVPCLCKVAP NTEKYHIEDV HRAGGIPSIL GELARGGLLD TSLPTVHGKT
VGEFIAEWDI VSDTVSPEAV ELFHAAPGGK RTTKAYSQDT RWDTLDTDRE KGCIRSVEHA
YTKDGGLAVL FGNLAPDGAI VKTAGVEEEL WTFSGPAKVF ESQEDAVDGI LNKRIEPGDV
VVIRYEGPKG GPGMQEMLYP TSFLKGRGLG KACALITDGR FSGGTSGLSI GHASPEAAAG
GDIALVEDGD VISIDIPNRG IVLEVSAEEL DARRERLLKE LGRFRPRDRQ RPVTAALRAY
AAMATSASTG AARDVSQVEK