Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_5203 |
Symbol | |
ID | 8450834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 5798103 |
End bp | 5800928 |
Gene Length | 2826 bp |
Protein Length | 941 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 645044234 |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_003204458 |
Protein GI | 258655302 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGACC CGGCCGGCGG GATCGCCCCG ACGGGCGCGG CGCTGCCGGA CTGGTTGCCG GACGAGGCGA CGCTGAACCG GTTGGCCGGC GAGTTCTTCG CCGCCCTGCC CGGGACGGCG CCGGCGGCCG GGTCCAGCCT GGACGCACCG GATCCGGCGT CCGGGTCGGC GCCCCGGGCC GGCACGGGCC ACACGCCGGG CGGGGTGGAG CACGCGCCCC GGGTCGACGT GTCGGCACGG TCGAACGAGA TCTCCCAGGT GCCGGGGGAC GGCGGGCCGG GTGCCGCCCA GCCGATCACC CAGCCGACCC CGCCGTCGGT GCCGCCCAGC CTGCCCGGGG TGGCGATCGG CGAGGCGCCG GCCGCGACCG GATTCGGCCC CGGCCGGTCG GTTCCCGACC TCGGAGAGGC GGCCACCGCG GTGCTGGGCG CGGCCGGGTT GGGCCTGACC GTGCCGGAAG GGCCGATCGT GCCCGGGCTG ACCGGCCTGC AGCTGGGGGC GCCGGGTGAG CTGACGCCGG TCGGTGCCGG CCTGGCCGAA CCCCGGCCCG ACGCCGGCGG ACCCGGGGTC GAGTCCGCCC TCAGCGGTCT GACCGGACCG GCCGGCTCCG CCGACGTCAC CGCGGCCGCC CCGGTCCGCG GCGGGTTCGG CCCGCCTGCG GCCGCGCCGG ATCCGGCCGG GGTGCCGTTC GCGGTGTCGG CGCTGTCCGT GCCGTTGCCG GGCGGCGAGC AGGTGCCGCC GCTGGCCGGG TCGGCCGCGC CCCTGCCCAG CGCCGGTGTC CCGGGTGTCC CGCCCAGCGA CCAGGTGGCG GCCGGTCTCG GCGCCCCCAG CGGCCCGCCC GACGTGACCG CGGCATCCGC ACTGCTCACC GGTGCCCAGC TGGGACCGCT GCCGCTGGGA CCGGCCGGTG TCCCCGATCA GCCGCCGACC GTGCCGGGAC TGCCCGGCCA GCCGGTGGAC GCGGCCGAGG CGATCAGCCA GGTCCCGGTG GCCGCGCCGT TCACCCCGGC GACCGGGCTG CACCCGCCCG CCGTTCCGGC GGGGCCTGTC CTTCCCACGG CGACCGCCGG CGCGCTGCCG GCCGATCGTT CGTCCGGGGG ATCGACTGGC GGATCGACGG GGGGATCGAC GGGGGGATCG ACTGGGGGAT CGACTGAGCC GGCGCCCGGC CTGGCCGCGC CTGGGCTGCC GTCGGTGCCG CAGCCCCCGC TGCCGGTGCC GGACCCGCCG GTGCCGGTGC CGACCCCGTC CAGCCCGTAC TACTTCCTGG CCGAGTCCTC GCCCTATCGC GCCGACGCCG ACGCCGGCCC CGGCCTGGAC TCGTTGGTCG CCGCCGCGCT GAGTCCGGTC GAAACGGACC CGGCCCAGGT GGGGGTCCCC GACCTGGACC TATCGAGTCT GGACCGACCG AACCTGGACC TGGCCGGCAC GGGCCTGGGA ACCGGGGCAC CGGCAATCCC GCTGGCCGGC CCCGCGCTGC CCACCGCCGG CTCCGTGCCA GCGTCCGCGT CGGCGCCGTC GTTCTACTTC GCTGATCGAG CCGAGCCGGC CCGGGACCGC ACGCCGCGGC CCGCCGCAGA CCTGGGCGCG GACCCGCACC CGCCGTTCGA CGTGCGGTTG GTGCGCCGGG ACTTCCCGAT CCTGGCCGAG CGGGTGAACG GGCATCAGCT GGTCTGGTTC GACAATGCCG CGACCACCCA GAAGCCGCAC GCGGTCCTGG ACCGGCTGGC CCACTTCTAC CGGCACGAGA ACTCCAACAT CCACCGGGCC GCGCACGAGC TGGCCGCCCG GTCGACCGAC GCCTACGAGG GGGCCCGCAA GACGGTCGCC CGGTTCGTGG GGGCCGAGTC GGAGAAGAAC ATCGTCTTCG TCCGGGGCGC CACTGAAGCG ATCAACCTGG TCGCCAAGAG CTGGGGCAAG GCCAATGTCC GCCGGGGCGA CGAGATCATC GTCTCGCATC TGGAGCACCA CGCGAACATC GTTCCGTGGC AGCAGCTGTG CGCGGAGACC GGCGCCAAGA TCCGGGTCAT CCCGGTCGAC GACTCCGGCC AGCTGCTGCT CGGCGAGCTG TCCCGGCTGC TCAACGAGAA GACCAAACTG GTCTCGGTCA CCCAGGTCTC CAACGCGTTG GGCACGGTCA CGCCGGTCGA TTCCGTGGTC GAGCTGGCCC ACCGGGCCGG CGCCTGCGTG CTGATCGACG GCGCCCAGTC GGTGCCGCAC GTGCGGGTGA ACATGCAGAC CCTGGGTCCG GACTTCTTCG TCTTCTCCGG CCACAAGATC TACGGGCCGA CCGGAATCGG CGTGCTCTAC GGCCGCACCG AGGTGCTCGA ATCCATGCCG CCGTGGGAGG GCGGCGGCAA CATGATCGCC GACGTGACGT TCGAGAAGAC GGTGTTCCAG CACCCGCCCA ACCGGTTCGA GGCCGGCACC GGCAACATCG CCGACGCGGT CGGGCTGGGC GCCGCCCTGG ACTACGTCAC CCGGATCGGC CTGGACACCA TCGCCCGGTA CGAGCACCAG CTGCTGGAGT ACGCGACCCC ACGGATGCTC GCCGTGCCCG GGCTGCGGTT GATCGGCACG GCCCGGGATA AGGCCAGCGT GCTCTCGTTC GTGCTCGACG GGTACCGCAC CGAGGAGGTC GGCGCCGCCC TCAACCAGAA GGGGATCGCG GTCCGCTCCG GCCACCATTG CGCGCAGCCG ATCCTGCGCC GCTTCGGCCT GGAGGCCACC GTCCGGCCCT CGATCGCCTT CTACAACACC ACCGGGGAGA TCGACCGGAT GGTCGCGGTG CTGCACGAGC TGGCCGCCGA TCGCGGTCGC CGCTGA
|
Protein sequence | MTDPAGGIAP TGAALPDWLP DEATLNRLAG EFFAALPGTA PAAGSSLDAP DPASGSAPRA GTGHTPGGVE HAPRVDVSAR SNEISQVPGD GGPGAAQPIT QPTPPSVPPS LPGVAIGEAP AATGFGPGRS VPDLGEAATA VLGAAGLGLT VPEGPIVPGL TGLQLGAPGE LTPVGAGLAE PRPDAGGPGV ESALSGLTGP AGSADVTAAA PVRGGFGPPA AAPDPAGVPF AVSALSVPLP GGEQVPPLAG SAAPLPSAGV PGVPPSDQVA AGLGAPSGPP DVTAASALLT GAQLGPLPLG PAGVPDQPPT VPGLPGQPVD AAEAISQVPV AAPFTPATGL HPPAVPAGPV LPTATAGALP ADRSSGGSTG GSTGGSTGGS TGGSTEPAPG LAAPGLPSVP QPPLPVPDPP VPVPTPSSPY YFLAESSPYR ADADAGPGLD SLVAAALSPV ETDPAQVGVP DLDLSSLDRP NLDLAGTGLG TGAPAIPLAG PALPTAGSVP ASASAPSFYF ADRAEPARDR TPRPAADLGA DPHPPFDVRL VRRDFPILAE RVNGHQLVWF DNAATTQKPH AVLDRLAHFY RHENSNIHRA AHELAARSTD AYEGARKTVA RFVGAESEKN IVFVRGATEA INLVAKSWGK ANVRRGDEII VSHLEHHANI VPWQQLCAET GAKIRVIPVD DSGQLLLGEL SRLLNEKTKL VSVTQVSNAL GTVTPVDSVV ELAHRAGACV LIDGAQSVPH VRVNMQTLGP DFFVFSGHKI YGPTGIGVLY GRTEVLESMP PWEGGGNMIA DVTFEKTVFQ HPPNRFEAGT GNIADAVGLG AALDYVTRIG LDTIARYEHQ LLEYATPRML AVPGLRLIGT ARDKASVLSF VLDGYRTEEV GAALNQKGIA VRSGHHCAQP ILRRFGLEAT VRPSIAFYNT TGEIDRMVAV LHELAADRGR R
|
| |