Gene Ndas_0083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0083 
Symbol 
ID9243914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp106572 
End bp107948 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content71% 
IMG OID 
Productcystathionine beta-synthase 
Protein accessionYP_003678041 
Protein GI297559067 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGTAT ACGACTCACT GATCGACCTG GTCGGGGACA CTCCGCTCGT CCGTTTGAAC 
AAGGTCACCC AGGTCCTCGG CCCGAACGCC CCGACGGTCC TCGCCAAGGT CGAGTACTTC
AACCCGGGCG GCTCGGTCAA GGACCGCATC GCGCTGCGCA TGGTCGAGGC GGCCGAGAAG
AGCGGCGAGC TCAAGCCGGG CGGCACCATC GTCGAGCCGA CCTCGGGCAA CACCGGCATC
GGCCTGGCCA TCGTGGCCCA GGAGAAGGGC TACCGCTGCG TCTTCGTCTG CCCGGACAAG
GTCGGCCCGG ACAAGCTCTC GGTCCTGCGC GCCTACGGTG CCGAGGTCGT GGTCTGCCCG
ACCACCGTCG CCCCCGACCA CCCGGAGTCC TACTACTCCG TCTCCGACCG GCTGGCCACC
GAGATCCCCG GCGCCTGGAA GCCCAACCAG TACGAGAACA CGAACAACCC GGAGTCGCAC
TACCACAGCA CCGGCCCCGA GATCTGGGAG CAGACCGAGG GCCGCATCAC CCACTTCGTG
GCGGGCATCG GCACCGGCGG CACCATCAGC GGGACCGGCC GCTACCTCAA GGAGGTCTCC
GGCGGCGCGG TGCGTATCGT CGGCGCCGAC CCCGAGGGCT CGGTCTACTC CGGCGGCTCG
GGCCGTCCGT ACCTGGTCGA GGGCGTGGGC GAGGACATCT GGCCCGGCAC CTACGACACC
TCGGTGTGCG ACGACATCGT GGCGGTCAGC GACAAGGACT CCTTCCTCAT GACCCGCCGC
CTGGCCCGGG AGGAGGCCCT GCTGGTGGGC GGCTCCTGCG GCCTGGCCGT CGAGGCGGCC
CTGCGCGTGG CCCGGGACGC CGGTCCCGAC GACGTCGTCG TCGTGCTGCT GCCCGACGGC
GGCCGCGGCT ACCTCGGCAA GATCTTCAAC GACGAGTGGA TGGCCGACTA CGGCTTCCTG
TCGGCCGAGA CCGAGGAGGC CACGGCGGGC CAGGTGCTGG CCGCCAAGGG CGGGGAGATG
CCCGACTTCG TGCACGCCCA CCCGGACGAG ACCGTCGGCA CGGCCGTGGC CATCATGCGC
GAGTACGGCG TCTCCCAGGT GCCGGTGATG AAGGAGGAGC CCCCGGTCAT GGCCGCCGAG
GTGGTCGGCT CCATCGCCGA GCGCGACCTC CTCGACGCCC TCTTCGACGG CCGCGCGCAG
CTGGACGACC TGGTGGAGAC CCACATGGGC AGGCCGCTGG CCACCGTGGG CACCGGCACC
CCCGTGAGCG ACTGCGTCCG CCTGCTGCGC GGGTCCGGCG CCCTGGTGGT GCTGCGCGAC
GGCAAGCCGG TGGGGATCCT CACCCGCCAG GACCTGCTGG CCCACCTGTC GGGCTGA
 
Protein sequence
MRVYDSLIDL VGDTPLVRLN KVTQVLGPNA PTVLAKVEYF NPGGSVKDRI ALRMVEAAEK 
SGELKPGGTI VEPTSGNTGI GLAIVAQEKG YRCVFVCPDK VGPDKLSVLR AYGAEVVVCP
TTVAPDHPES YYSVSDRLAT EIPGAWKPNQ YENTNNPESH YHSTGPEIWE QTEGRITHFV
AGIGTGGTIS GTGRYLKEVS GGAVRIVGAD PEGSVYSGGS GRPYLVEGVG EDIWPGTYDT
SVCDDIVAVS DKDSFLMTRR LAREEALLVG GSCGLAVEAA LRVARDAGPD DVVVVLLPDG
GRGYLGKIFN DEWMADYGFL SAETEEATAG QVLAAKGGEM PDFVHAHPDE TVGTAVAIMR
EYGVSQVPVM KEEPPVMAAE VVGSIAERDL LDALFDGRAQ LDDLVETHMG RPLATVGTGT
PVSDCVRLLR GSGALVVLRD GKPVGILTRQ DLLAHLSG