Gene Ndas_0637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0637 
Symbol 
ID9244479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp783641 
End bp784927 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content77% 
IMG OID 
Productglutamate/cysteine ligase family protein 
Protein accessionYP_003678589 
Protein GI297559615 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.730981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCACC ACATGACCAC GGAAGACGTC CACGAGTACA TCAACGGCGT TTGCTTCAAG 
ACCGGCCCTC CCGGCAAGGT CGGAGCCGAG ACCGAATGGC TGGTCGCCGA CTCCGCGGAC
CCGACCGCAC CCGTTCCCGT CGACCGCCTC GCGGCCCTGG TGGAGTCCTG CGGCCCGCCG
CCCTCGGGAA GCGGCGTCAC CTTCGAACCC GGCGGACAGC TCGAACTCAG CTCGCCCGCC
CTTCCGGGAC CGGCCCGGGC GCACGCGGCG CTCTCCGCCG ACCTGGACCA CGTCGGCAAG
GCCCTCGCCG AGGCCGGGCT CCACCTGGTC GAGACGGCCC TCGACCCCGC CCGCCCGCCC
CGGCGCCAGC TCCGTGAAGC CCGCTACACG GCCATGGAGC GCTTCTTCGC CCACCACCGC
CAGCCCAGCG GCTACACCAT GATGTGCAGC ACCGCCTCGC TCCAGGTGTG CCTGGACACG
GGGGAGGACG CCGCCGACGT CCGCGACCGC TGGGAACTCG TCCACCGGCT CGGTCCGGTG
CTCGTCGCCG CTTTCGCCAA CTCCGCCGTC TGGCGGGGCC GCCCCACCGG GTGGAAGTCC
ACCCGGTGGG CGATCTGGGC GGCCACCGAC GCCACCCGCA CCCGGCCCGT CCTGGACGCG
GGCAGCCCCG CCGACCCCGC CACGGCGTGG GCCGAGTACG CGCTGGCCGC CAGGGTCATG
GCCGTCCCCG GAGGCGAGGG CCCCTGGACC CCCGACCCGG GCGTGACCTT CTCCCAGTGG
CTCGACGGGC AGGGTCCGCG CCCGGCCACC CGGGCCGACC TGGAGTTCCA CCTCAGCACT
CTCTTCCCGC CCGTGCGCCC GCGCGGCTGG TGGGAACTGC GGATGATCGA CGCGCTGCCC
GTGCGCTGGT GGCCGGTCCC GGTGGCGCTC GCCGCCGCCC TGGTCGACGA CCCCCGCGCC
CGCGCCGCGG CCGAGGAGGC CACCGAGGAG CTGTGCGGGG GGCGCTCCCC CGACCGCCAC
CTGTGGCTGC GCTCGGCGCG CCTGGGCATG GCCGACCCCG ACGTCGCCCG GTGCGCCCGC
GCCTGCTTCG ACGCCGCCAT CGAGGCGCTG CCCCGCATGG GCGCCGCCTC CCTGGCCGCG
CTGGTCGACG ACTACGCCGA CCACCACACG CGGCGCGGGC TCAGCCCCGC CGACACCCGG
CCGACCGTGC CCGTACCGCG CCCGGCGCGG GGGGAGCGCG TACGGTCCGC GGCCGCGGGC
CCCGGACACG TGGAAGGAGC ACGGTGA
 
Protein sequence
MAHHMTTEDV HEYINGVCFK TGPPGKVGAE TEWLVADSAD PTAPVPVDRL AALVESCGPP 
PSGSGVTFEP GGQLELSSPA LPGPARAHAA LSADLDHVGK ALAEAGLHLV ETALDPARPP
RRQLREARYT AMERFFAHHR QPSGYTMMCS TASLQVCLDT GEDAADVRDR WELVHRLGPV
LVAAFANSAV WRGRPTGWKS TRWAIWAATD ATRTRPVLDA GSPADPATAW AEYALAARVM
AVPGGEGPWT PDPGVTFSQW LDGQGPRPAT RADLEFHLST LFPPVRPRGW WELRMIDALP
VRWWPVPVAL AAALVDDPRA RAAAEEATEE LCGGRSPDRH LWLRSARLGM ADPDVARCAR
ACFDAAIEAL PRMGAASLAA LVDDYADHHT RRGLSPADTR PTVPVPRPAR GERVRSAAAG
PGHVEGAR