Gene Ndas_4205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4205 
Symbol 
ID9248079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5021142 
End bp5022401 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content71% 
IMG OID 
Productthreonine synthase 
Protein accessionYP_003682103 
Protein GI297563129 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATTG CCGCCACCGA ACCCACCGCC GCCCGCTCCT TCGGCCCCGG CACCGCGCTC 
TCCTGTCGCG AGTGCGGCGA GCGCTACGAA CTCACCCCCC GGTTCGCCTG CGAGTTCTGC
TTCGGCCCCC TTGAGGTCGC CTACGACTTC GGGACCGTCA CCCGCGCCGA CATCGAGAGC
GGCCCCAAGA GCATCTGGCG CTACCGCTCC CTCCTGCCCG TCCCGGCCAA CGTCGCCGAG
CTGCCCAACA TGGCCCCCGG CCTGACCCCG CTGGTGCGGG CCGACCGCCT CGCGGCCGAG
CTGGGCCTGG ACTCCCTCCA CGTCAAGGAC GACTCCGGCA ACCCCACGCA CTCCTTCAAG
GACCGCGTGG TCGCCATCGC CGTCGAGGCC GCCCGCACCT TCGGGTTCAC CACCCTGTCC
TGCTCCTCCA CCGGCAACCT GGCCGGAGCC GTCGGCGCCG CCGCCGCGCG CGCCGGGTTC
GAGTCCTGCG TGTTCATCCC CGCCGGGCTG GAGGAGGCCA AGGTCGTCAT GGCCTCCGTC
TACGGCGGCA AGGTCGTGGC CATCGACGGC AACTACGACG ACGTCAACCG CTTCTGCTCC
GAGCTCATCG GCGACCCGGT GGGCGAGGGC TGGGGCTTCG TCAACGTCAA CCTGCGCCCC
TACTACGGCG AGGGCTCCAA GACGCTGGCC TACGAGATCG CCGAGCAGCT CGGCTGGCGC
CTGCCCGAGC AGATCGTCGT CCCGATCGCG TCCGGCTCCC AGCTCACCAA GATCGACAAG
GGCTTCCAGG AACTGGTCAA GCTCGGCCTG GTCGAGGACC GCCCGTACCG GATCTTCGGC
GCCCAGGCCA CGGGCTGCTC CCCGGTCGCG CAGGCCTGGG ACAAGGGCAT CGACGTCATC
CAGCCGGTCA AGCCCGACAC CATCGCCAAG TCGCTGGCCA TCGGCAACCC GGCCGACGGG
CCCTACGTGC TGGACATCGC CAAGCGCACG GGCGGATCGG TGGAGCACGT GGGCGACGAC
GAGATCGTCG ACTCCATCAA GCTCCTCGCC CGCACCGAGG GCATCTTCGC CGAGACCGCG
GGCGGCGTCA CCACCGGCGT GCTGCGCAAG CTCGTCCGCG AGGGCAGGCT CGACCCGAAG
GCCGAGACGG TCGTGCTCAA CACCGGTGAC GGGCTCAAGA CCCTGAACGC CGTCGACGCC
GGGGTGAGCG CCACGATCAA GCCGTCGCTG AGCGCCTTCA CCGACGCCGG TCTGGCCTAG
 
Protein sequence
MAIAATEPTA ARSFGPGTAL SCRECGERYE LTPRFACEFC FGPLEVAYDF GTVTRADIES 
GPKSIWRYRS LLPVPANVAE LPNMAPGLTP LVRADRLAAE LGLDSLHVKD DSGNPTHSFK
DRVVAIAVEA ARTFGFTTLS CSSTGNLAGA VGAAAARAGF ESCVFIPAGL EEAKVVMASV
YGGKVVAIDG NYDDVNRFCS ELIGDPVGEG WGFVNVNLRP YYGEGSKTLA YEIAEQLGWR
LPEQIVVPIA SGSQLTKIDK GFQELVKLGL VEDRPYRIFG AQATGCSPVA QAWDKGIDVI
QPVKPDTIAK SLAIGNPADG PYVLDIAKRT GGSVEHVGDD EIVDSIKLLA RTEGIFAETA
GGVTTGVLRK LVREGRLDPK AETVVLNTGD GLKTLNAVDA GVSATIKPSL SAFTDAGLA