Gene Ndas_0319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0319 
Symbol 
ID9244154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp395244 
End bp396302 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content72% 
IMG OID 
Productthreonine synthase 
Protein accessionYP_003678273 
Protein GI297559299 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.147741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGGG CGTGGCGAGG CATCGTCGAG GAGTACCGCG ACCGCCTCCC CGTCAACGAG 
AGCACCCCCG TTGTCACCCT CCAGGAGGGC GGCACGCCCC TGCTGCCCGC CACGCGCGTG
TCCGAGCTCA CGGGCTGCGA GGTCTTCCTC AAGGTCGAGG GGCTCAACCC CACCGGGTCC
TTCAAGGACC GCGGCATGAC CATGGCCATC ACCAAGGCCG CCGAGGACGG CGCCAAGGCC
GTCATCTGCG CCTCCACCGG CAACACCAGC GCCAGCGCCG CCGCCTACGC CATCCGCGCG
GGCATGACCT GCGCCGTGCT GGTGCCCCAG GGCAAGATCG CCATGGGCAA GCTGGCCCAG
GCCCTCGTCC ACGGCGCCCG CCTGCTCCAG GTCGACGGCA ACTTCGACGA CTGCCTCGAA
CTGGCCCGCA AGCTCAGCGT GGACTACCCG GTCGCCCTGG TGAACTCGGT CAACCCCTAC
CGCCTCCAGG GGCAGAAGAC CGCCGCCTTC GAGATCGTCG ACGCCCTCGG CGACGCCCCC
GACGTCCACT GCATCCCCGT GGGCAACGCG GGCAACATCA CCGCCTACTG GATGGGCTAC
ACCGAGTACT CCAGGGACGG GATCTCCACC CGCAACCCGC GCATGCTCGG CTTCCAGGCC
AGCGGCTCCG CGCCCATCGT CAACGGCGCG CCCGTCACCA GCCCGAGCAC CATCGCCACC
GCCATCCGCA TCGGCAACCC GGCCTCCTGG AAGCTGGCCG AGCAGGCCCG CGACGAGTCC
GGCGGCCTCA TCGACAAGGT CACCGACCGC CAGATCATGG CCGCCTACAA GCTCCTCGCC
GCCGAGGAGG GCGTGTTCGT GGAGCTGGCC TCCGCCGCCA GCGTGGCCGG TCTGCTCCAG
TCCGTCCAGG CGGGGCTGGT CGAGCCCGGC AGCCGCGTGG TGTGCACCGT GACCGGCAAC
GGCCTCAAGG ACCCCGACTG GGCGCTGGCC GGAGCCTCCT CCGCCACCAC CGTCCCGGTC
GACGCCCTCG CCGCGGCCCA GGCCCTCGAC CTGGCCTGA
 
Protein sequence
MARAWRGIVE EYRDRLPVNE STPVVTLQEG GTPLLPATRV SELTGCEVFL KVEGLNPTGS 
FKDRGMTMAI TKAAEDGAKA VICASTGNTS ASAAAYAIRA GMTCAVLVPQ GKIAMGKLAQ
ALVHGARLLQ VDGNFDDCLE LARKLSVDYP VALVNSVNPY RLQGQKTAAF EIVDALGDAP
DVHCIPVGNA GNITAYWMGY TEYSRDGIST RNPRMLGFQA SGSAPIVNGA PVTSPSTIAT
AIRIGNPASW KLAEQARDES GGLIDKVTDR QIMAAYKLLA AEEGVFVELA SAASVAGLLQ
SVQAGLVEPG SRVVCTVTGN GLKDPDWALA GASSATTVPV DALAAAQALD LA