Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3807 |
Symbol | |
ID | 4599030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4022533 |
End bp | 4023732 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639778415 |
Product | cysteine desulfurase family protein |
Protein accession | YP_924994 |
Protein GI | 119718029 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01976] cysteine desulfurase family protein, VC1184 subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTTCG ACGTCGACCG CATCCGCAAA GACTTCCCGG CCCTCGACTC CGGCACCGCC TACTTCGACG GGCCGGGCGG CAGCCAGGTG CCGCGGCAGG TGGCCGAGGC GGTGGCGGGG ACGATGACGT CCGGGATCTC CAACCGTGGC CAGGTCACGG CGGCCGAGCA GCGCGCCGAG GACGTCGTGG TCGGTGCGCG GGCGGCGGTC GCGGACCTGC TCGGCTGCGA CCCGGGCGGG GTGGTGTTCG CGCGGTCGAT GACGCAGGCG ACGTACGACG TCTCCCGCGC GCTCGCCAAG GAGTGGGGGC CGGGTGACGA GGTGGTGGTC ACTCGCCTGG ACCACGACGG GAACATCCGG CCGTGGGTGC AGGCGGCGCA GGCGGCGGGC GCGACCGTGC GGTGGGCCGG GTTCGACCAG GAGACCGGCG AGCTGGGCGT CGACGACGTC CGCGAGCAGC TGTCGGCAAG GACCAAGCTG GTCGCGGTGA CGGGTGCGTC GAACGTCCTC GGCACCCGGC CCGACGTGCC GGCGATCGCG GCTGCGGTGC ACGAGGTGGG TGCCCTGCTC TACGTCGACG GGGTGCACCT GACCCCGCAC GTGCCCGTGG ACGTCGCGGC GATCGGCGCC GACTTCTACG CGTGCTCGCC GTACAAGTTC CTGGGCCCGC ACCACGGCAT CGTGGTCGCC GCACCGGAGC TCCTGGAGCG GATCCACCCG GACAAGCTGG TGCCGGCCGG CGACCAGGTC CCGGAGCGGT TCGAGCTCGG GACGCTGCCC TACGAGCTGC TCGCCGGCAC CACGGCTGCA GTCGACTACC TCGCGGGCCT CGCCTCGGAC GCGGCGGACC GGCGGACCCG GGTGCTGGAG TCGATGCGGG CAGTGGAGCA GCACGAGGAG GCCCTGTTCG CGAGGCTGTT GGACGGGCTG CGCGGGATCG ACGCCGTCAC GCTGTACGGC GACCCGGAGC GGCGCACCCC GACCGCGTTC TTCTCCGTCG CCGGCCGGGC GGACCAGGAG GTCTACGAGC GCTTGGCCGC CGCGGGGGTG AACGCTCCGG CGAGCAGCTT CTACGCGATC GAGGCATCGC GGTGGATCGG CCTCGGCGAC ACCGGCGCGG TGCGGGCCGG GCTCGCGCCG TACAGCAGCG CCGACGACGT CGAGCGGCTG CTCGCGGGGG TCGCCGAGAT CGCCGGGTGA
|
Protein sequence | MTFDVDRIRK DFPALDSGTA YFDGPGGSQV PRQVAEAVAG TMTSGISNRG QVTAAEQRAE DVVVGARAAV ADLLGCDPGG VVFARSMTQA TYDVSRALAK EWGPGDEVVV TRLDHDGNIR PWVQAAQAAG ATVRWAGFDQ ETGELGVDDV REQLSARTKL VAVTGASNVL GTRPDVPAIA AAVHEVGALL YVDGVHLTPH VPVDVAAIGA DFYACSPYKF LGPHHGIVVA APELLERIHP DKLVPAGDQV PERFELGTLP YELLAGTTAA VDYLAGLASD AADRRTRVLE SMRAVEQHEE ALFARLLDGL RGIDAVTLYG DPERRTPTAF FSVAGRADQE VYERLAAAGV NAPASSFYAI EASRWIGLGD TGAVRAGLAP YSSADDVERL LAGVAEIAG
|
| |