Gene Ndas_0107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0107 
Symbol 
ID9243938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp136693 
End bp138081 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content70% 
IMG OID 
Productfumarate lyase 
Protein accessionYP_003678064 
Protein GI297559090 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGT TCCGTATCGA GCACGACTCG ATGGGTGAGG TCAGGGTTCC GGCCGAGGCC 
AAGTGGAGGG CCCAGACGCA GCGTGCCGTC GAGAACTTCC CGATCTCCGG CCAGGGCCTG
GAGGGCGCGC ACATCGCCGC CCTCGGCCAG ATCAAGGCCG CCGCGGCCAA GGTCAACGCC
GAGCTCGGCG TCATCAGCGA CGACCTGGGC AAGGCGATCC GGGAGGCCGC CCTGGAGGTC
GCCGAGGGCA GGTGGAACGA CGAGTTCCCG ATCGACGTCT TCCAGACCGG CTCGGGCACC
TCCAGCAACA TGAACACCAA CGAGGTCGTG GCGACGCTGG CCACCGAGCG CCTCGGGGCC
CCGGTGCACC CCAACGACCA CGTCAACGCG TCGCAGTCCT CCAACGACGT GTTCCCCTCC
TCCATCCACA TCGCCGCCAC CTCCGCCGTG CAGAACGACC TGGTCCCGGC GCTGCGGCAC
CTGGAGGAGG CGCTCGGCGC CAAGGCGACC GAGTTCGCCT CCGTGGTCAA GAGCGGCCGC
ACCCACCTCA TGGACGCCAC CCCGGTCACC CTGGGCCAGG AGTTCGCCGG GTACGCCGCC
CAGGTGCGCT ACGGCGTGGA GCGCCTGGAG GCCGTCCTGC CGCGCGTGGC CGAGCTGCCG
CTGGGCGGCA CCGCCGTGGG CACCGGCATC AACACCCCCG AGGGCTTCTC CGCCCGGGTC
ATCGCCGAGA TCGCCGAGCA CACCGGCCTG CCGCTGACCG AGGCCCGCGA CCACTTCGAG
GCGCAGGGCG CCCGCGACGG CCTGGTCGAG CTGTCCGGCC AGCTGCGGAC CATCGCGGTC
GGCTTCGCCA AGATCGCCAA CGACATCCGC TGGATGGGCT CGGGCCCGAC CACGGGCCTG
GGCGAGATCC TCCTGCCCGA CCTCCAGCCC GGCTCCTCGA TCATGCCGGG CAAGGTCAAC
CCGGTCCTGT GCGAGGCCGT GCTCCAGGTG ACCTCGCAGG TCGTCGGCAA CGACGCCGCG
GTGGCCTTCG GCGGCGCGAG CGGCAACTTC GAGCTGAACG TGCAGCTGCC GATGATCGCC
CGCAACGTGC TGGAGTCGAT CCGCCTGCTC TCCAACGTCT CGCGCGTGTT CGCGGACCGC
TGCGTGTCCG GTATCGAGGC CAACGTCGAG CAGTGCCGCG TCTACGCCGA GTCCTCGCCG
TCGATCGTGA CCCCGCTCAA CCGCTACATC GGCTACGAGG AGGCCTCCAA GGTCGCCAAG
CAGTCGCTGA AGGAGAAGAA GACCATCCGC GAGGTGGTCA TCGAGCGCGG ATACGTCGAG
GACGGCAAGC TCACCGAGGC GCAGCTGGAC GAGGCCCTCG ACGTGCTGCG GATGACCAAC
TCCCAGTAG
 
Protein sequence
MSEFRIEHDS MGEVRVPAEA KWRAQTQRAV ENFPISGQGL EGAHIAALGQ IKAAAAKVNA 
ELGVISDDLG KAIREAALEV AEGRWNDEFP IDVFQTGSGT SSNMNTNEVV ATLATERLGA
PVHPNDHVNA SQSSNDVFPS SIHIAATSAV QNDLVPALRH LEEALGAKAT EFASVVKSGR
THLMDATPVT LGQEFAGYAA QVRYGVERLE AVLPRVAELP LGGTAVGTGI NTPEGFSARV
IAEIAEHTGL PLTEARDHFE AQGARDGLVE LSGQLRTIAV GFAKIANDIR WMGSGPTTGL
GEILLPDLQP GSSIMPGKVN PVLCEAVLQV TSQVVGNDAA VAFGGASGNF ELNVQLPMIA
RNVLESIRLL SNVSRVFADR CVSGIEANVE QCRVYAESSP SIVTPLNRYI GYEEASKVAK
QSLKEKKTIR EVVIERGYVE DGKLTEAQLD EALDVLRMTN SQ