Gene Noca_4836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4836 
Symbol 
ID4595438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp165812 
End bp166780 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content69% 
IMG OID639772623 
ProductD-cysteine desulfhydrase 
Protein accessionYP_919283 
Protein GI119714141 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2515] 1-aminocyclopropane-1-carboxylate deaminase 
TIGRFAM ID[TIGR01275] pyridoxal phosphate-dependent enzymes, D-cysteine desulfhydrase family 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGC CGGACCTGCC CCGTCTGCAC CTGGTGCTGG CGCCGACGCC GCTCGTCCAC 
GCGCCCCGCC TGTCCGAGGC CGTCGGTGTT GAGGTGTGGT TCAAGCGAGA CGACCTGACC
GGCCGAGGCC TCGGTGGCAA CAAAGTCCGC ACCCTGGAGT ACCTCCTCGG GGACGCTGTG
GCCAAGGGAT GCGACGCACT GGTGACCGGG GCCGGCCCGC AGTCCAACTG GGCGATGCTC
GCGGCACTGT CCGCCCGCAC CGCCGGGTTC TCGCCGCACC TTGTCTTCTA TGGTGACCCA
CCTGAGGCGA GCGGCAACCT GTTGCTCACC CAGGTGACGT GTACAGACAT CCGCTACACC
GGTGAGCTCG ACCGCTGCTC TGTCGACTCC ATGCTGGGCA AGGTCGCCGA CGAGCTCGTC
GCGGCCGGTC GCTTTCCCTA TGTCGTGCCG AGAGGCGGGG CCACCCCGTT GGGGTGCCTG
GGCTATCTCC GGGCCGCTGT GGAGCTCGTG CGGCAACTGC CGGAGGTGGG TGTCGACCCC
GCAACGCTCT GGGTGCCGAC CGGATCCGGC GGCACCCAGG CAGGACTTCT CGCCGGGGCG
CACTGGTTGG GATGGGACGT TGCGGTGGTC GGGGTCGCCA CCAGCCGGAC TCCTGAAGAG
GCCCAGGTCC GCGTTGGCGA GCTCGCCTCG GCCACTCTCG AGCTGCTCGA CGCCGATGAC
ACGGCTCGAG CAGCGCCGCA CGTCCTCGGC GGGTTCCTCG GCGACGGCTA TGGCGAAGTC
TCGCCAGCGG GTACCGCGGC GGCCGAGCTC GTCGCGCGGA CCGAGGGCAT CTTCCTCGAC
CCGGTGTTCG GCGCCAAGGC GATGGCGGCT CTCTTGGCTG AAGTCCGGCA TGGAACCGTC
CGAGGGCCGG TGATCTTCCT GGTCACCGGA GGCGCACCCA CCCTCTTCAT GAAGGGCACG
GAACTGTGA
 
Protein sequence
MNLPDLPRLH LVLAPTPLVH APRLSEAVGV EVWFKRDDLT GRGLGGNKVR TLEYLLGDAV 
AKGCDALVTG AGPQSNWAML AALSARTAGF SPHLVFYGDP PEASGNLLLT QVTCTDIRYT
GELDRCSVDS MLGKVADELV AAGRFPYVVP RGGATPLGCL GYLRAAVELV RQLPEVGVDP
ATLWVPTGSG GTQAGLLAGA HWLGWDVAVV GVATSRTPEE AQVRVGELAS ATLELLDADD
TARAAPHVLG GFLGDGYGEV SPAGTAAAEL VARTEGIFLD PVFGAKAMAA LLAEVRHGTV
RGPVIFLVTG GAPTLFMKGT EL