Gene Ndas_3225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3225 
Symbol 
ID9247082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3855384 
End bp3856718 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content74% 
IMG OID 
Productbeta-galactosidase 
Protein accessionYP_003681137 
Protein GI297562163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCAC CGTTCCTGTG GGGAGTGGCC TCCTCCGCCT TCCAGATCGA GGGCGCCCTC 
GACGCGGACG GGCGCGGCCC GTCGGTGTGG GACGTCTTCG CCGAGCGCCC GGGCGCGGTC
CGCGACGGGC ACAGCCCCGC CCCGGCCTGC GACCACTACC ACCGGTGGGC CGAGGACGTG
GAACTGCTCG ACCGGCTCGG GGTGAACGCC TACCGGTTCT CGCTCGCCTG GCCCCGGGTC
GTACCGACCG GGCGCGGCGC GGTGAACGGG GCCGGCCTGG ACTTCTACGA CCGCCTGGTG
GACGCGCTGC TGGCCCGGGG GATCACCCCG GTGCCGACGC TGTTCCACTG GGACCTGCCC
CAGGCGCTGG AGGATGCGGG CGGTTGGAGC GAACGCGACA CGGCGTACGC GTTCGCCGAG
TACGCGGCGG CGGCCTCCGA CCGCCTCGGC GACCGGGTGG ACCGGTGGAT CACCCTGAAC
GAGCCCCTGG TGCACACCAC CTACGGGCAC GCGCTCGGCG TCCACGCCCC GGGGCGGACC
CTGGCGGTGC CCGAGGTGAT GCGGGTGGCC CACCACATGC TGCTCGCGCA CGGGCTGGCC
GCCGGGGAGC TGCGCTCGCG CGGCCTGGAG GCACTGCTCA CCAACAACTA CTCCCCCGTC
AGCCCCGCCA CCGGCTCCGA GGCCGACGCC GCCGCCGCGC ACGCCTACGA CACCCTGCAC
AACCGGCTGT TCACCGACCC CGTGCTGACC GGCGCCTACC CGGACCTGTC GGCCTTCGGC
GTCGCGGAGG TCCCCGGCGT GCGCGAGGGC GACCTGAAGG CGGTCGCGGG CAGCGCGGAC
GGCCTCGGGG TCAACTACTA CAACCCCACC GTGGCCACCG CGCCCGACGA GGGGTCGGGA
CTGCCGTTCG GGTTCGGTGA GGTCGCCGGG GCGCCGGTGA CCGCGTTCGG GTGGCCGGTG
GTGCCCGAGG GGCTGGGCCG GATGATCGAC CTGCTGCGCG AGCGCCACGG TGAGGCGCTG
CCGCCGCTGT ACGTCACCGA GAACGGCTGT TCCCACGAGG ACCGGGTCTC CCCCGGAGGG
CGGATCGCCG ACCCCGAGCG GATCGCCTAC CTGGAGGGGC ACGTGGCCGC CGTGGAGGCC
GCGCGGGAGC GGGGCGCGGA CGTGCGCGGG TACTTCGTGT GGACGCTCAC CGACAACTTC
GAGTGGGCCG AGGGCTACCA CCAGCGCTTC GGGCTGGTGC ACGTGGACCA CGCCACCCAG
GCGCGCACCC CCAAGGACTC CTTCGCCTGG TACCGGGACC TCGTCGCCGC CAGGACCGCG
TCAGCTGGTG CGTGA
 
Protein sequence
MSSPFLWGVA SSAFQIEGAL DADGRGPSVW DVFAERPGAV RDGHSPAPAC DHYHRWAEDV 
ELLDRLGVNA YRFSLAWPRV VPTGRGAVNG AGLDFYDRLV DALLARGITP VPTLFHWDLP
QALEDAGGWS ERDTAYAFAE YAAAASDRLG DRVDRWITLN EPLVHTTYGH ALGVHAPGRT
LAVPEVMRVA HHMLLAHGLA AGELRSRGLE ALLTNNYSPV SPATGSEADA AAAHAYDTLH
NRLFTDPVLT GAYPDLSAFG VAEVPGVREG DLKAVAGSAD GLGVNYYNPT VATAPDEGSG
LPFGFGEVAG APVTAFGWPV VPEGLGRMID LLRERHGEAL PPLYVTENGC SHEDRVSPGG
RIADPERIAY LEGHVAAVEA ARERGADVRG YFVWTLTDNF EWAEGYHQRF GLVHVDHATQ
ARTPKDSFAW YRDLVAARTA SAGA