Gene Noca_3807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3807 
Symbol 
ID4599030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4022533 
End bp4023732 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content74% 
IMG OID639778415 
Productcysteine desulfurase family protein 
Protein accessionYP_924994 
Protein GI119718029 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01976] cysteine desulfurase family protein, VC1184 subfamily 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTCG ACGTCGACCG CATCCGCAAA GACTTCCCGG CCCTCGACTC CGGCACCGCC 
TACTTCGACG GGCCGGGCGG CAGCCAGGTG CCGCGGCAGG TGGCCGAGGC GGTGGCGGGG
ACGATGACGT CCGGGATCTC CAACCGTGGC CAGGTCACGG CGGCCGAGCA GCGCGCCGAG
GACGTCGTGG TCGGTGCGCG GGCGGCGGTC GCGGACCTGC TCGGCTGCGA CCCGGGCGGG
GTGGTGTTCG CGCGGTCGAT GACGCAGGCG ACGTACGACG TCTCCCGCGC GCTCGCCAAG
GAGTGGGGGC CGGGTGACGA GGTGGTGGTC ACTCGCCTGG ACCACGACGG GAACATCCGG
CCGTGGGTGC AGGCGGCGCA GGCGGCGGGC GCGACCGTGC GGTGGGCCGG GTTCGACCAG
GAGACCGGCG AGCTGGGCGT CGACGACGTC CGCGAGCAGC TGTCGGCAAG GACCAAGCTG
GTCGCGGTGA CGGGTGCGTC GAACGTCCTC GGCACCCGGC CCGACGTGCC GGCGATCGCG
GCTGCGGTGC ACGAGGTGGG TGCCCTGCTC TACGTCGACG GGGTGCACCT GACCCCGCAC
GTGCCCGTGG ACGTCGCGGC GATCGGCGCC GACTTCTACG CGTGCTCGCC GTACAAGTTC
CTGGGCCCGC ACCACGGCAT CGTGGTCGCC GCACCGGAGC TCCTGGAGCG GATCCACCCG
GACAAGCTGG TGCCGGCCGG CGACCAGGTC CCGGAGCGGT TCGAGCTCGG GACGCTGCCC
TACGAGCTGC TCGCCGGCAC CACGGCTGCA GTCGACTACC TCGCGGGCCT CGCCTCGGAC
GCGGCGGACC GGCGGACCCG GGTGCTGGAG TCGATGCGGG CAGTGGAGCA GCACGAGGAG
GCCCTGTTCG CGAGGCTGTT GGACGGGCTG CGCGGGATCG ACGCCGTCAC GCTGTACGGC
GACCCGGAGC GGCGCACCCC GACCGCGTTC TTCTCCGTCG CCGGCCGGGC GGACCAGGAG
GTCTACGAGC GCTTGGCCGC CGCGGGGGTG AACGCTCCGG CGAGCAGCTT CTACGCGATC
GAGGCATCGC GGTGGATCGG CCTCGGCGAC ACCGGCGCGG TGCGGGCCGG GCTCGCGCCG
TACAGCAGCG CCGACGACGT CGAGCGGCTG CTCGCGGGGG TCGCCGAGAT CGCCGGGTGA
 
Protein sequence
MTFDVDRIRK DFPALDSGTA YFDGPGGSQV PRQVAEAVAG TMTSGISNRG QVTAAEQRAE 
DVVVGARAAV ADLLGCDPGG VVFARSMTQA TYDVSRALAK EWGPGDEVVV TRLDHDGNIR
PWVQAAQAAG ATVRWAGFDQ ETGELGVDDV REQLSARTKL VAVTGASNVL GTRPDVPAIA
AAVHEVGALL YVDGVHLTPH VPVDVAAIGA DFYACSPYKF LGPHHGIVVA APELLERIHP
DKLVPAGDQV PERFELGTLP YELLAGTTAA VDYLAGLASD AADRRTRVLE SMRAVEQHEE
ALFARLLDGL RGIDAVTLYG DPERRTPTAF FSVAGRADQE VYERLAAAGV NAPASSFYAI
EASRWIGLGD TGAVRAGLAP YSSADDVERL LAGVAEIAG