Gene Noca_3142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3142 
Symbol 
ID4600127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3345262 
End bp3346401 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content77% 
IMG OID639777748 
Productcysteine desulfurase 
Protein accessionYP_924331 
Protein GI119717366 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCAGCG CAGACCCGGC GGGCGCTCCG CGGCGTCCGA CCTATCTCGA CTCGGCGTCC 
TCCGAGCCGC TCCACCCGGC GGCCCGGGAC ACCCTGCTCG CCGCGCTGGA GCGGGGGTAC
GCCGACCCGC GCCGGCTGCA CGGCCCGGCC CGCGACGCCC GCCTCCTGCT CGACAACGCC
CGCGCCGTGG TCGCCGAGTG TCTCGGGGTG CGCCCCGACG AGGTCACCTT CACGTCCTCG
GGCACCGACG CCGTCCACCG CGGGCTCCTG GGCCTGGTGC GGGCCTCGCG CCGCGGCGAC
GGCGTCGCCT ACTCCGCCGT CGAGCACTCC GCGGTGCTGC GGGCGGTGGC GTGGGGTGGC
ACCGGGCACG AGGTCGGCGC GCGGCCCGAC GGGCGGGTCG ACCCCGGGCT CCTCGCCGAG
GCCGCGGCGG CCGACGGGGT CGGGGTCGTC GCCCTGCAGA GCGCCAACCA CGAGGTCGGC
ACGGTCCAGC CGGTCGGCGA GCTCGAGCCC CGCGACGGCG TACCGGTCTT CGTCGACGCC
TGCGCGTCCA TGGGCCGGCT GCCACTCCCG GACGGTTGGA ACGTGGCGGC CGGGTCCGCG
CACAAGTGGG GCGGCCCGGC AGGGGTCGGG GTGCTGCTGG TGCGCAAGGG CACCCGATGG
CTCAACCCGT TCCCCGGGGA CGACCGGATC GACGAGCGCG CCGACGGGTT CGAGAACGTG
CCCGCCGCCC TCGCCGCCGC GGCGGCGCTC CGGGCGGTCG TCGCCGAGCG GGCCACCGTC
AACCCGCGCC AGCACGACCT GGTCGACCGG ATCCGCGCCG CCGCGGCGAA GATCCCCGAC
GTCGAGGTCG TCGGCGACCC GGTCGACCGG CTCCCCCACC TGGTCACCTT CTCCTGCCTG
TACGTCGACG GCGAGGCGCT GGTCACCGAG CTGGACCGGC GGGGGTACGG CGCGGCCAGC
GGCTCGGCGT GCACCTCCTC GACCCTGACC CCGAGCCGGG TGCTCGAGGC GATGGGCGTG
CTCACCCACG GCAACCTGCG GGTCTCCCTG ACCCGGGACA CCACCGAGCA GGACGTCGAG
GGCTTCCTCG AGGTGCTGCC ACAGGTGGTC CGCGACATCC GCGCCGAGGC CGGCCTGTGA
 
Protein sequence
MTSADPAGAP RRPTYLDSAS SEPLHPAARD TLLAALERGY ADPRRLHGPA RDARLLLDNA 
RAVVAECLGV RPDEVTFTSS GTDAVHRGLL GLVRASRRGD GVAYSAVEHS AVLRAVAWGG
TGHEVGARPD GRVDPGLLAE AAAADGVGVV ALQSANHEVG TVQPVGELEP RDGVPVFVDA
CASMGRLPLP DGWNVAAGSA HKWGGPAGVG VLLVRKGTRW LNPFPGDDRI DERADGFENV
PAALAAAAAL RAVVAERATV NPRQHDLVDR IRAAAAKIPD VEVVGDPVDR LPHLVTFSCL
YVDGEALVTE LDRRGYGAAS GSACTSSTLT PSRVLEAMGV LTHGNLRVSL TRDTTEQDVE
GFLEVLPQVV RDIRAEAGL