Gene Ndas_0085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0085 
Symbol 
ID9243916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp109027 
End bp110172 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content72% 
IMG OID 
ProductCystathionine gamma-synthase 
Protein accessionYP_003678043 
Protein GI297559069 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.383004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCG ACGGGTTTGA AACGCTGGCC ATCCACGCGG GGCAGGAGCC GGACGCCGGA 
ACCGGGTCCG TGGTGGTGCC GATCTACCAG ACGAGCACCT ACGCCCAGGA CGGCGTGGGC
GGTCTGCGCC AGGGCTACGA GTACTCGCGC ACCGGCAACC CCACGCGCGC GGCCCTGGAG
GAGTGCCTGG CCGCCCTGGA GTCCGGGGTG CGCGGCCTGG CCTTCGCCTC CGGCATGGCC
GCCGAGGACA CCCTGCTGCG CACGGTGCTC TCGCCCGGCG ACCACCTGAT CATCCCCGGC
GACGCCTACG GCGGCACCTT CCGCCTGGTC TCCAAGGTGG TCGAGCGCTG GGGTGTGCAG
TGGGACGCGG TCGACCAGTC CGACCCCGAG GCCGTGCGCG CGGCCGTGCG GCCCAACACC
AGGGTGGTGT GGACCGAGAC GCCCACCAAC CCCCTGCTCA ACATCACCGA CATCGAGGCC
GTCGCGCAGA TCGCGCACGA CGCCGGCGCC CTGCACGTGG TCGACAACAC CTTCGCCTCG
TCCTACCTCC AGCAGCCGCT GACCCTGGGC GCGGACGTGG TCGTGCACTC CACCACCAAG
TACCTGGGCG GGCACTCCGA CGTGGTCGGG GGAGCGCTGG TGGTCTCCGA CGCCGAGCTG
GGCGAGCGGC TGGCCTTCCA CCAGAACACC ATGGGCGCGG TCCCGGGGCC GTTCGACTCC
TGGCTGACCC TGCGCGGGAT CAAGACCCTG GGCGTGCGCA TGGACCGGCA CAGCGCCAAC
GCCGAGAAGG TGGTGGCGGC CCTGGAGGGC CACCCCGCGG TGCGCCGGGT GTTCTACCCC
GGGTTGGACG CCCACCCGGG GCACAAGACC GCCGAACGGC AGATGAGGGC CTTCGGCGGC
ATGGTCTCCT TCGCCCTGCG CGACGGTGAG AAGGCGGCGC TCGCCCTGTG CGAGCGCACC
GAGGTCTTCA CCCTCGGCGA GTCCCTGGGC GGGGTGGAGT CCCTGATCGA GCACCCGGGT
CGGATGACGC ACGCGTCCAC CGCGGGCTCC CCGCTGGAGG TCCCGGCCGA CCTGGTGCGG
ATCTCCGTGG GCATCGAGTC CGCCGACGAC CTGGTGGCGG ACCTGCTCCA GGCCCTGGAG
GGCTAG
 
Protein sequence
MKFDGFETLA IHAGQEPDAG TGSVVVPIYQ TSTYAQDGVG GLRQGYEYSR TGNPTRAALE 
ECLAALESGV RGLAFASGMA AEDTLLRTVL SPGDHLIIPG DAYGGTFRLV SKVVERWGVQ
WDAVDQSDPE AVRAAVRPNT RVVWTETPTN PLLNITDIEA VAQIAHDAGA LHVVDNTFAS
SYLQQPLTLG ADVVVHSTTK YLGGHSDVVG GALVVSDAEL GERLAFHQNT MGAVPGPFDS
WLTLRGIKTL GVRMDRHSAN AEKVVAALEG HPAVRRVFYP GLDAHPGHKT AERQMRAFGG
MVSFALRDGE KAALALCERT EVFTLGESLG GVESLIEHPG RMTHASTAGS PLEVPADLVR
ISVGIESADD LVADLLQALE G