Gene Ndas_4932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4932 
Symbol 
ID9248819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp68240 
End bp70198 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content73% 
IMG OID 
Productaconitate hydratase 
Protein accessionYP_003682821 
Protein GI297563848 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.450299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGCG GAGTCGCGCG CAAGCTGATC GACGCGCACC TGGTCGAGGG GGAGATGGCG 
CCCGGGGAGG TCATCGGGCT CTCCGTCGAC CAGACCCTCA CCCAGGACGC CACCGGCACC
CTGGTCATGC AGGAGCTGGA GGCCCTCGGA CTGGACCGCA CCCGGGCCCG GGTCAGCGTG
CAGTACGTGG ACCACAACCT GCTCCAGACC GACGAGAAGA ACGCGGAGGA CCACGCGTTC
CTGCACTCGG CGGCGCGCCG GTACGGTCTG TGGTACTCCA AGCCCGGCAA CGGCGTCTCC
CACCCCACGC ACATGCAGCG CTTCGGGGTG CCCGGCGCGA CGATGGTCGG CTCGGACTCC
CACACCTGCG CCGCCGGATC GCTGGGCATG CTGGCGGTCG GCGTCGGCGG CCTGGAGGTG
GCCATGGCCA TCGCCGGGCG CCCGCTGCGC GTCCGGGCGC CGCTGATCTG GGGCGTGCGC
CTGACCGGGG AGCTGCCCCC GTGGACCTCG GCCAAGGACG TCATCCTGGA GATGCTGCGG
CGGCACGGGG TCAAGGGCGG GCTCAACCGG ATCATCGAGT ACCACGGTCC CGGAGTCGCC
GCGCTCACCG CCATGGACCG CCACGTCATC GCGAACATGG GCGCCGAGCT GGGCGCCACC
ACCACGGTCT TCCCCTCCGA CGGCGCGGTG CGCGACTTCC TGCGCGCCGA GGGCCGTGAG
GACGACTTCA CCGAGCTGCT GCCCGACGAC GACGCCGACT ACGACGTGAC CGACGAGATC
GACCTGTCCG GGGTGGAGCC GCTCATCGCC CGGCCCTCCT CGCCCGGCGA CGTCGTCCCG
GTCCGCGAGG TGGCGGGCAC CGACGTCAGC CAGGTCGTCA TCGGCTCCTC CGCCAACCCG
GGACTGCGCG ACTACGCGGT CGCCGCCGCC ATGGTCAGGG GCCGCCAGAC CGACAGCGCG
GTCAGCTTCG ACGTCAACCC GTCCTCACGC CAGATCCTCT CCGACCTGAC CCGCACGGGA
GCGACCCTCG ACCTCATCCA GGCCGGGGCC CGCATCCACC AGGCCGGATG CCTGGGCTGC
ATCGGCATGG GCCAGGCGCC CGCCGTCGGC CGCAACTCGC TGCGCACCTT CCCGCGCAAC
TTCCCCGGCC GCTCGGGCAC GGTGGAGGAC GCGGTGTGGC TGTGCTCCCC GGAGACGGCC
GCCGCCTCCG CGCTCACCGG AGTCATCACC GACCCGCGCG ACCTGGCGAG GGAGCTGGGC
CTGGACCATC CCGACCTGCG TCCGCCGGAG CGGGCGGCGG TCAACACCGC GATGCTGGTC
CCGCCGCTGC CGCCGGAGGA GGCCGCGCGG GTCGAACTCG TCAAGGGGCC CAACATCTCC
GCGCTGCCGG ACTTCCCGCC CCTGCCCGAC CGGATCGAGG CGCCGGTGCT GCTCAAGGTC
GGCGACGACG TCTCCACCGA CGAGATCTCC CCGGCGGGCG CCCGCGCGCT GCCCTTCCGC
TCGAACGTGC CCCGGCTCGC CGAGTTCACC TTCACCCGTA TCGACGAGGA CTACCCCCGC
CGGGCGAGGG AGGCGGGCCG CGAGAGGGGC CACATCGTGG TCGGAGGGGA GAACTACGGG
CAGGGCTCCT CCCGCGAGCA CGCGGCGATC ACCCCCCGCT ACCTGGGCCT GCGGGCGGTG
ATCGCCAAGT CCTTCGCCCG CATCCACTGG CAGAACCTCG CCAACTTCGG TGTGCTGGCC
CTGCGGTTCA CCGAACCCGG CGACTACGAC CGCGTCGGCT CCGGCGACGT CCTCGTCCTG
GACGGCCTGG AGGAGGCCCT GACCTCCGGC TCGGAGCTGA CCGCGCGCAA CACGACGAGG
GACGAGGAGT ACCGGCTGGA CCACCAGCTC TCCCCGGCGC AGGCCGAGGC GGTGCTGGCG
GGCGGCCGGA TCCCGCTGCT GGCCCGGGAG CTGGACTGA
 
Protein sequence
MGRGVARKLI DAHLVEGEMA PGEVIGLSVD QTLTQDATGT LVMQELEALG LDRTRARVSV 
QYVDHNLLQT DEKNAEDHAF LHSAARRYGL WYSKPGNGVS HPTHMQRFGV PGATMVGSDS
HTCAAGSLGM LAVGVGGLEV AMAIAGRPLR VRAPLIWGVR LTGELPPWTS AKDVILEMLR
RHGVKGGLNR IIEYHGPGVA ALTAMDRHVI ANMGAELGAT TTVFPSDGAV RDFLRAEGRE
DDFTELLPDD DADYDVTDEI DLSGVEPLIA RPSSPGDVVP VREVAGTDVS QVVIGSSANP
GLRDYAVAAA MVRGRQTDSA VSFDVNPSSR QILSDLTRTG ATLDLIQAGA RIHQAGCLGC
IGMGQAPAVG RNSLRTFPRN FPGRSGTVED AVWLCSPETA AASALTGVIT DPRDLARELG
LDHPDLRPPE RAAVNTAMLV PPLPPEEAAR VELVKGPNIS ALPDFPPLPD RIEAPVLLKV
GDDVSTDEIS PAGARALPFR SNVPRLAEFT FTRIDEDYPR RAREAGRERG HIVVGGENYG
QGSSREHAAI TPRYLGLRAV IAKSFARIHW QNLANFGVLA LRFTEPGDYD RVGSGDVLVL
DGLEEALTSG SELTARNTTR DEEYRLDHQL SPAQAEAVLA GGRIPLLARE LD