Gene Ndas_4219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4219 
Symbol 
ID9248093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5036900 
End bp5038360 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content72% 
IMG OID 
Productalpha/beta hydrolase fold protein 
Protein accessionYP_003682117 
Protein GI297563143 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCCTCG CCCGAAGCGC CGCGCTGGCC GCGGCCTCCG GACTGTTGCT CACCGGCGTT 
GCCGCCCCCG CCTCGGCAAC CCCGGGCCCG GAACCGGAGT GGCGGCTGTG CTCCGACGTC
GCCCGGGGGT GGGACGGGAA CGACGACCGC ACCCTGTGCG CCACCGTCCC GGTGCCCTTG
GACCACGAGG ACCCCGACGG GCGCACGATC GGCATAGCGG TGACCCGTGT GCCCGCCACC
GGGGAGAACA CCTATCCCAT CCTGTTTAAC CCCGGCGGCC CGGGGCATCC GGGCGTGACC
ATGCCCGGGC GCATCCTCGA CAGCGAGGCC GCGGACCTGG CGCTGGACCA CGACCTCGTG
GGCTTCGATC CGCGCGGCGT GGGTTACAGC GACGCCGTGG AGTGCGGTCT GGAGGGGACC
GCCCCCGACC CCGGCCTGAG CGACGAGGAG AGCGCACGGC ACGTCGCCGA GGAGCAGAGC
CGGATCAATC GCGAATGCCA CGCCCGGGAT CCCGAGTTCG TGGACTCGCT GACCGCGGAG
AACGTGGCCC GGGACATGGA TCTGATCCGC GAGGCGCTGG GAGCGGAGAC GATCGGTTTC
TACGGAGTGT CCTGGGGAAC CCTGCTCGGC GCCGCCTACA GGTCGATGCA CGACGACCGG
GTCGAGGCCA TGCTCCTGGA CTCGGTGATG TCGCCCGAGG CCAGTGTCAC CATGTTGGAC
GAGGGGCAGG CCATGGCCGC CCAGGCCGCG TTCCACCGCT TCACCGACTG GCTGGCCGAG
CACGACGACC ACTACGGGCT CGGCACGGAG TCCGACCGCA TCCGGGACGA GGTCTACGGG
CTGCGGGAGG AACTGGCCGA TGAGCCCCGC ACCGGCCCCG ACGGGACGGT CGTCGACGGC
GGCGCCGTGA CCGCGCTGCT GGCCACCCCC GAACGCGAAT GGCCCGCCAA CGCCCGCTCC
CTCGTCACGC TCCTCGACGG AGGCGTGCCC GGGACAGGGG TCGCCCGCGG ACCGGTCTCC
GGCGCCGGTT GGGACTCCGA ACCCGTCTTC GACGCCTTCG CGCAGGTGTC ACTGCTCTGC
AACGACTCCG ACAGCCCGCG CGACTTCGAC CAGGTGTGGC AGCACCGGTT GGAGCGGGCC
GAACGGTACC CCGTCATGGG CACCCTGGGC TTCTACGAGC ACTCCTGTGT CGGCTGGCCC
GAGGAGGGCG CGGCTCCCGA CCTGACCCAC GGGGACAGCC CTCTGCAACT GGTGGGCCAC
GTCAACGAGA TGGTCACCCC GCACGACTGG GCGCTGGACA TGAGGCGGGT CGTCGGCGGA
GAGGTCATGA GTGTGGAGGA CGACGGGCAC GGGACCCTGT CGGGCCTGGA CTGCGCCGCG
GCGGCCGTGG ACTTCTTCAA CACCGGGCGG ACCACCACGC GAACGTGCCC GGGACCGCCC
GCGCCGACTC CCGAGGGCTG A
 
Protein sequence
MLLARSAALA AASGLLLTGV AAPASATPGP EPEWRLCSDV ARGWDGNDDR TLCATVPVPL 
DHEDPDGRTI GIAVTRVPAT GENTYPILFN PGGPGHPGVT MPGRILDSEA ADLALDHDLV
GFDPRGVGYS DAVECGLEGT APDPGLSDEE SARHVAEEQS RINRECHARD PEFVDSLTAE
NVARDMDLIR EALGAETIGF YGVSWGTLLG AAYRSMHDDR VEAMLLDSVM SPEASVTMLD
EGQAMAAQAA FHRFTDWLAE HDDHYGLGTE SDRIRDEVYG LREELADEPR TGPDGTVVDG
GAVTALLATP EREWPANARS LVTLLDGGVP GTGVARGPVS GAGWDSEPVF DAFAQVSLLC
NDSDSPRDFD QVWQHRLERA ERYPVMGTLG FYEHSCVGWP EEGAAPDLTH GDSPLQLVGH
VNEMVTPHDW ALDMRRVVGG EVMSVEDDGH GTLSGLDCAA AAVDFFNTGR TTTRTCPGPP
APTPEG