Gene Ndas_3000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3000 
Symbol 
ID9246853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3584914 
End bp3586590 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content74% 
IMG OID 
Producturocanate hydratase 
Protein accessionYP_003680916 
Protein GI297561942 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.136861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.10397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGTC CACGTACCGT CCGCGCCGCG CGCGGCACCA CCCTGTCCGC GAAGGGGTGG 
CAGCAGGAGG CCGCCCTGCG CATGTTCCAC AACAACCTCG ACCCCGAGGT CGCCGAACGC
CCCGAGGAGC TGGTGGTCTA CGGCGGCACG GGCAGGGCGG CGCGCGACTG GCACAGCTTC
GACCGCATCA CCGCATCCCT GCGCGACCTG GAGGGCGACG AGACCCTCCT CGTGCAGTCC
GGCCGCCCGG TCGGGATCAT GCGCACCCAC GAGTGGGCTC CCCGCGTGCT CATCGCCAAC
AGCAACCTGG TGGGGGACTG GGCGAACTGG CCCGAGTTCC GCCGCCTGGA GGCCCAGGGC
CTGACCATGT ACGGGCAGAT GACCGCCGGT TCGTGGATCT ACATCGGCAC CCAGGGCATC
CTCCAGGGCA CCTACGAGAC CTTCGCCGCC GTGGCCGCCA AGCTGAGCGG TTCGGGCAGG
CACAGCGGCG GGACCCTGGC CGGAACCATC ACCCTCACCG CCGGACTCGG CGGCATGGGC
GGCGCCCAGC CGCTGGCCGT GACCATGAAC GACGGCGTCG CGATCGTGGT CGAGTGCGAC
CCCAGCCGCA TCGAACGCCG CATCGAGCAC CGCTACCTGG ACATGCGCGC CGACAGCCTC
GACCACGCCC TGGAACTGGC CGTGCGGGCG CGCGACGAGC GCCGCCCGCT GTCGGTCGGA
GTGCTCGGCA ACGCCGCCGA GGTCCTGCCC GAACTGCTCT CGCGCGGCGC ACCGATCGAC
GTGGTCACCG ACCAGACCTC CGCCCACGAC CCCCTCGCCT ACCTGCCCGC CGGGGTGGCC
TTCGAGGACT GGCGGGACCT GGCCGCCGCC AGGCCCGAGG AGTTCACCGA CCGGGCCCGC
GAGTCCATGG CCGCGCACGT GGAGGCCATG GTGGGCTTCC AGGACGCGGG GGCCGAGGTC
TTCGACTACG GCAACTCCAT CCGCGACGAG GCCCGCAGCG GCGGCTTCGC CCGCGCCTTC
GACTTCCCCG GGTTCGTCCC CGCCTACATC CGGCCCCTGT TCTGCGAGGG GAAGGGGCCG
TTCCGGTGGG CGGCGCTGTC GGGCGACCCC GCCGACATCG CCCGCACCGA CCGGGCCGTC
CTGGACCTGT TCCCCGACAA CGAGCACCTG GCCCGGTGGA TCCGCATGGC GGGGGAGCGG
GTCGCCTTCC AGGGGCTGCC CGCCCGCATC TGCTGGCTCG GCCAGGGCGA GCGGGCCGCC
GCCGGGGCGC GCTTCAACGA ACTGGTGGCC TCCGGCGAGG TCCGCGCGCC GCTGGCCATC
GGCCGCGACC ACCTGGACAC CGGCTCGGTG GCCTCCCCCT ACCGCGAGAC CGAGGGGATG
CGGGACGGCT CCGACGCCAT CGCGGACTGG CCGCTGCTCA ACGCCCTGGT CAACACCTCC
TCCGGCGCCT CGTGGGTGTC CATCCACCAC GGCGGGGGAG TGGGCATGGG CCGCTCGCTG
CACGCCGGGC AGGTCAGCGT CGCGGACGGC ACCGACCTGG CCGCCGCCAA GCTGGAGCGG
GTGCTCACCA ACGACCCCGC CACGGGCGTC ATCCGCCACG TGGACGCCGG GTACGACGAG
GCCGGGCGCG TCGCGCGCGA GCACGGCATC CGCGTCCCGA TGCGCGAGGG GGACTGA
 
Protein sequence
MSSPRTVRAA RGTTLSAKGW QQEAALRMFH NNLDPEVAER PEELVVYGGT GRAARDWHSF 
DRITASLRDL EGDETLLVQS GRPVGIMRTH EWAPRVLIAN SNLVGDWANW PEFRRLEAQG
LTMYGQMTAG SWIYIGTQGI LQGTYETFAA VAAKLSGSGR HSGGTLAGTI TLTAGLGGMG
GAQPLAVTMN DGVAIVVECD PSRIERRIEH RYLDMRADSL DHALELAVRA RDERRPLSVG
VLGNAAEVLP ELLSRGAPID VVTDQTSAHD PLAYLPAGVA FEDWRDLAAA RPEEFTDRAR
ESMAAHVEAM VGFQDAGAEV FDYGNSIRDE ARSGGFARAF DFPGFVPAYI RPLFCEGKGP
FRWAALSGDP ADIARTDRAV LDLFPDNEHL ARWIRMAGER VAFQGLPARI CWLGQGERAA
AGARFNELVA SGEVRAPLAI GRDHLDTGSV ASPYRETEGM RDGSDAIADW PLLNALVNTS
SGASWVSIHH GGGVGMGRSL HAGQVSVADG TDLAAAKLER VLTNDPATGV IRHVDAGYDE
AGRVAREHGI RVPMREGD