Gene Ndas_3034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3034 
Symbol 
ID9246887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3623015 
End bp3624022 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content76% 
IMG OID 
ProductHAD-superfamily hydrolase, subfamily IIA 
Protein accessionYP_003680950 
Protein GI297561976 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.224813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.86283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTGC TCGCCGCAGA CCGCCCCCTG AACGAGATCC ACGACGCGAT GCTCCTGGAC 
CTGGACGGGG TCGTCTACAT CGGCCCGAGG GCCGTACCGG CGGCACCGGA GGCGGTGGGC
AAGGCGCGGG CCGCCGGAGC GCGGGTGGCG TTCGTGACCA ACAACGCCGG GCGCACCCCG
GCGCGCATCG CCGAGCACCT GACCGAACTG GGCGTGGGCG CCGCCCCCGG GGACGTGGTG
ACCTCCGCGG AGGCCGCCGC CCGCCTGGTC GGCGAACACC ACCCCGCGGG TTCGGACGTA
CTGGTGGTGG GCGACACCGC GCTGCGCCAG GCGGTGCGCA GGATGGGGCT GCGCCCGGTG
TCGGTCGACA GCCCCTCGGT GGTGGCCGTG GTGCAGGGCT ACTCCCGGCA CATGACCCGC
GACCTGCTCG ACCAGGGCAC GGTCGCGGTC CGGCGCGGCG CGTTCTACGT GGCCAGCAAC
AACGACGCCA CCGCACCCAG CGAGTGGGGC CTGACCCCCG GCAACGGGTC CTTCGTCCGG
GTCATCGCCA ACGCCACCGG GGTCGAACCC GTCGTCGCCG GAAAGCCCAT GCGCCCCCTG
CACGAGGAGG GCATCCTGCG CACCGGAGCG CGCAACCCGC TGATCGTGGG CGACCGGCTG
GACACCGACA TCGAGGGCGC GACCGCCCAC GGCGCGGCGG GGATGCTGGT GCTGACGGGG
GTGGCCACCC CGATGGACGC CCTCGCCGCG CCCGAGCACC AGCGCCCCAG CTACCTGGCG
TGGGACCTGT CGGGCATGAA CCACACGCAC CCGGCCGTCG TCCGCGAGGG CGACCGCACC
CGCTGCGCGG GGTGGACGGT GACCGTCACC GGCGGCGCAC CACGCGTCGA GGGGGACGGG
GACCGGCTGG ACGGGCTGCG CGCGCTGTGC GTCGCGGTCT GGGCGGACCG GGCGGCGGAC
CCGTCCGGCC CGGCCGCACG CGAGGCGCTG TCCCGCCTGG GCTGGTGA
 
Protein sequence
MSLLAADRPL NEIHDAMLLD LDGVVYIGPR AVPAAPEAVG KARAAGARVA FVTNNAGRTP 
ARIAEHLTEL GVGAAPGDVV TSAEAAARLV GEHHPAGSDV LVVGDTALRQ AVRRMGLRPV
SVDSPSVVAV VQGYSRHMTR DLLDQGTVAV RRGAFYVASN NDATAPSEWG LTPGNGSFVR
VIANATGVEP VVAGKPMRPL HEEGILRTGA RNPLIVGDRL DTDIEGATAH GAAGMLVLTG
VATPMDALAA PEHQRPSYLA WDLSGMNHTH PAVVREGDRT RCAGWTVTVT GGAPRVEGDG
DRLDGLRALC VAVWADRAAD PSGPAAREAL SRLGW