Gene Namu_5203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5203 
Symbol 
ID8450834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5798103 
End bp5800928 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content75% 
IMG OID645044234 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_003204458 
Protein GI258655302 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACC CGGCCGGCGG GATCGCCCCG ACGGGCGCGG CGCTGCCGGA CTGGTTGCCG 
GACGAGGCGA CGCTGAACCG GTTGGCCGGC GAGTTCTTCG CCGCCCTGCC CGGGACGGCG
CCGGCGGCCG GGTCCAGCCT GGACGCACCG GATCCGGCGT CCGGGTCGGC GCCCCGGGCC
GGCACGGGCC ACACGCCGGG CGGGGTGGAG CACGCGCCCC GGGTCGACGT GTCGGCACGG
TCGAACGAGA TCTCCCAGGT GCCGGGGGAC GGCGGGCCGG GTGCCGCCCA GCCGATCACC
CAGCCGACCC CGCCGTCGGT GCCGCCCAGC CTGCCCGGGG TGGCGATCGG CGAGGCGCCG
GCCGCGACCG GATTCGGCCC CGGCCGGTCG GTTCCCGACC TCGGAGAGGC GGCCACCGCG
GTGCTGGGCG CGGCCGGGTT GGGCCTGACC GTGCCGGAAG GGCCGATCGT GCCCGGGCTG
ACCGGCCTGC AGCTGGGGGC GCCGGGTGAG CTGACGCCGG TCGGTGCCGG CCTGGCCGAA
CCCCGGCCCG ACGCCGGCGG ACCCGGGGTC GAGTCCGCCC TCAGCGGTCT GACCGGACCG
GCCGGCTCCG CCGACGTCAC CGCGGCCGCC CCGGTCCGCG GCGGGTTCGG CCCGCCTGCG
GCCGCGCCGG ATCCGGCCGG GGTGCCGTTC GCGGTGTCGG CGCTGTCCGT GCCGTTGCCG
GGCGGCGAGC AGGTGCCGCC GCTGGCCGGG TCGGCCGCGC CCCTGCCCAG CGCCGGTGTC
CCGGGTGTCC CGCCCAGCGA CCAGGTGGCG GCCGGTCTCG GCGCCCCCAG CGGCCCGCCC
GACGTGACCG CGGCATCCGC ACTGCTCACC GGTGCCCAGC TGGGACCGCT GCCGCTGGGA
CCGGCCGGTG TCCCCGATCA GCCGCCGACC GTGCCGGGAC TGCCCGGCCA GCCGGTGGAC
GCGGCCGAGG CGATCAGCCA GGTCCCGGTG GCCGCGCCGT TCACCCCGGC GACCGGGCTG
CACCCGCCCG CCGTTCCGGC GGGGCCTGTC CTTCCCACGG CGACCGCCGG CGCGCTGCCG
GCCGATCGTT CGTCCGGGGG ATCGACTGGC GGATCGACGG GGGGATCGAC GGGGGGATCG
ACTGGGGGAT CGACTGAGCC GGCGCCCGGC CTGGCCGCGC CTGGGCTGCC GTCGGTGCCG
CAGCCCCCGC TGCCGGTGCC GGACCCGCCG GTGCCGGTGC CGACCCCGTC CAGCCCGTAC
TACTTCCTGG CCGAGTCCTC GCCCTATCGC GCCGACGCCG ACGCCGGCCC CGGCCTGGAC
TCGTTGGTCG CCGCCGCGCT GAGTCCGGTC GAAACGGACC CGGCCCAGGT GGGGGTCCCC
GACCTGGACC TATCGAGTCT GGACCGACCG AACCTGGACC TGGCCGGCAC GGGCCTGGGA
ACCGGGGCAC CGGCAATCCC GCTGGCCGGC CCCGCGCTGC CCACCGCCGG CTCCGTGCCA
GCGTCCGCGT CGGCGCCGTC GTTCTACTTC GCTGATCGAG CCGAGCCGGC CCGGGACCGC
ACGCCGCGGC CCGCCGCAGA CCTGGGCGCG GACCCGCACC CGCCGTTCGA CGTGCGGTTG
GTGCGCCGGG ACTTCCCGAT CCTGGCCGAG CGGGTGAACG GGCATCAGCT GGTCTGGTTC
GACAATGCCG CGACCACCCA GAAGCCGCAC GCGGTCCTGG ACCGGCTGGC CCACTTCTAC
CGGCACGAGA ACTCCAACAT CCACCGGGCC GCGCACGAGC TGGCCGCCCG GTCGACCGAC
GCCTACGAGG GGGCCCGCAA GACGGTCGCC CGGTTCGTGG GGGCCGAGTC GGAGAAGAAC
ATCGTCTTCG TCCGGGGCGC CACTGAAGCG ATCAACCTGG TCGCCAAGAG CTGGGGCAAG
GCCAATGTCC GCCGGGGCGA CGAGATCATC GTCTCGCATC TGGAGCACCA CGCGAACATC
GTTCCGTGGC AGCAGCTGTG CGCGGAGACC GGCGCCAAGA TCCGGGTCAT CCCGGTCGAC
GACTCCGGCC AGCTGCTGCT CGGCGAGCTG TCCCGGCTGC TCAACGAGAA GACCAAACTG
GTCTCGGTCA CCCAGGTCTC CAACGCGTTG GGCACGGTCA CGCCGGTCGA TTCCGTGGTC
GAGCTGGCCC ACCGGGCCGG CGCCTGCGTG CTGATCGACG GCGCCCAGTC GGTGCCGCAC
GTGCGGGTGA ACATGCAGAC CCTGGGTCCG GACTTCTTCG TCTTCTCCGG CCACAAGATC
TACGGGCCGA CCGGAATCGG CGTGCTCTAC GGCCGCACCG AGGTGCTCGA ATCCATGCCG
CCGTGGGAGG GCGGCGGCAA CATGATCGCC GACGTGACGT TCGAGAAGAC GGTGTTCCAG
CACCCGCCCA ACCGGTTCGA GGCCGGCACC GGCAACATCG CCGACGCGGT CGGGCTGGGC
GCCGCCCTGG ACTACGTCAC CCGGATCGGC CTGGACACCA TCGCCCGGTA CGAGCACCAG
CTGCTGGAGT ACGCGACCCC ACGGATGCTC GCCGTGCCCG GGCTGCGGTT GATCGGCACG
GCCCGGGATA AGGCCAGCGT GCTCTCGTTC GTGCTCGACG GGTACCGCAC CGAGGAGGTC
GGCGCCGCCC TCAACCAGAA GGGGATCGCG GTCCGCTCCG GCCACCATTG CGCGCAGCCG
ATCCTGCGCC GCTTCGGCCT GGAGGCCACC GTCCGGCCCT CGATCGCCTT CTACAACACC
ACCGGGGAGA TCGACCGGAT GGTCGCGGTG CTGCACGAGC TGGCCGCCGA TCGCGGTCGC
CGCTGA
 
Protein sequence
MTDPAGGIAP TGAALPDWLP DEATLNRLAG EFFAALPGTA PAAGSSLDAP DPASGSAPRA 
GTGHTPGGVE HAPRVDVSAR SNEISQVPGD GGPGAAQPIT QPTPPSVPPS LPGVAIGEAP
AATGFGPGRS VPDLGEAATA VLGAAGLGLT VPEGPIVPGL TGLQLGAPGE LTPVGAGLAE
PRPDAGGPGV ESALSGLTGP AGSADVTAAA PVRGGFGPPA AAPDPAGVPF AVSALSVPLP
GGEQVPPLAG SAAPLPSAGV PGVPPSDQVA AGLGAPSGPP DVTAASALLT GAQLGPLPLG
PAGVPDQPPT VPGLPGQPVD AAEAISQVPV AAPFTPATGL HPPAVPAGPV LPTATAGALP
ADRSSGGSTG GSTGGSTGGS TGGSTEPAPG LAAPGLPSVP QPPLPVPDPP VPVPTPSSPY
YFLAESSPYR ADADAGPGLD SLVAAALSPV ETDPAQVGVP DLDLSSLDRP NLDLAGTGLG
TGAPAIPLAG PALPTAGSVP ASASAPSFYF ADRAEPARDR TPRPAADLGA DPHPPFDVRL
VRRDFPILAE RVNGHQLVWF DNAATTQKPH AVLDRLAHFY RHENSNIHRA AHELAARSTD
AYEGARKTVA RFVGAESEKN IVFVRGATEA INLVAKSWGK ANVRRGDEII VSHLEHHANI
VPWQQLCAET GAKIRVIPVD DSGQLLLGEL SRLLNEKTKL VSVTQVSNAL GTVTPVDSVV
ELAHRAGACV LIDGAQSVPH VRVNMQTLGP DFFVFSGHKI YGPTGIGVLY GRTEVLESMP
PWEGGGNMIA DVTFEKTVFQ HPPNRFEAGT GNIADAVGLG AALDYVTRIG LDTIARYEHQ
LLEYATPRML AVPGLRLIGT ARDKASVLSF VLDGYRTEEV GAALNQKGIA VRSGHHCAQP
ILRRFGLEAT VRPSIAFYNT TGEIDRMVAV LHELAADRGR R