Gene Arth_1235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1235 
Symbol 
ID4446264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1358585 
End bp1359916 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content67% 
IMG OID639689043 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_830729 
Protein GI116669796 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTCATGG TATCCACTCC CGCTACCGCA GACACCCAGG TCGCGCTAAG TGATGTTGAA 
GTCCTGCGTA TCCGCAACGA CTTTCCCGTC CTGGACCAGG AAGTCAACGG CAGGCCCCTT
GTTTACCTGG ATTCCGGCGC CACTTCTCAG AACCCCCGCA GCGTCCTCGA AGCCGAGCAG
GAGTTCTACG AACTGAGGAA TGCCGCGGTG CACCGCGGTG CCCACCACCT TGCCGTCCAG
GCCACCGACG CCTTCGAGGA TGCCCGCGCC ACTGTGGCCG GTTTCGTCGG CGTGGCGGAG
GACGAACTGG TCTGGACAGC GAACGCCACC GCCGGGCTCA ATCTCCTCGC GTACGCCTTC
TCGAACGCCA GTGTCGGCAC CGTCCGGGGC GAGGCCGGCC GCTTCGCCCT CGGTCCGGGG
GACGAGATTG TGGTGACGGA GATGGAACAC CACGCGAACC TGATTCCCTG GCAGGAGCTC
TGCCGGCGCA CCGGTGCCAC CTTGAAATTC ATCCCCATCG ACGACGACGG CGCGTTGCGG
CTTGAGGAGG CGGCGCGGCT TATCACCGGG CGTACCAAGG TCCTGGCGTT CACCCATGCG
TCAAATGTGC TTGGAACCAT CAATCCGGTG CCCGAGCTCG TGCGGCTGGC CCGGGCGGCA
GGGGCCCTCG TTGTGCTGGA CGCCTGCCAG TCGGCGCCGC ACCTGCCCTT GGACTTCAAG
GCCCTGGACG TGGACTTCGC GGTATTCTCC GGCCACAAGA TGCTGGCGCC CACGGGGATC
GGCGGCGTGT ACGGGCGGCG CGAATTGTTG AATGCCATGC CTCCGTTCCT GACCGGGGGT
TCCATGATCA CGACCGTGAC GATGGAAAAG GCCGAGTACC TTCCGGCGCC CCAGCGGTTC
GAGGCCGGCA CCCAGCCCAT CTCGCAGGCT GTGGCGCTCG CGGCGGCCGC GAACTACCTG
CGCGAAACCA GCATGGAACG AATCGCCGGC TGGGAAGCGT CCCTGGGCCA GCGGCTGGTC
ACGGGGCTGA GCGCCATTGA CGGAGTCCGG GTCGTGGGGC CCGCTGCCGG CGTAGAACGG
CTCGGCCTGG CAGCCTTCGA CGTGGCCGGC GTGCATGCGC ACGACGTCGG GCAGTACCTG
GACAGCATGG GCATCGCCGT CCGCGTGGGT CACCACTGCG CGCAACCGCT CCACCGCCGG
CTGGGCCTGA CCGCCACCAC GCGGGCGAGC ACCTATTTGT ACAACACAAC GGAAGAAGTG
GACCTGCTGA TCGAAGCTGT GGCCCAGGTC CGGCCCTACT TCGGCGTAGA AGGCACGGGG
ACATCCAAAT GA
 
Protein sequence
MVMVSTPATA DTQVALSDVE VLRIRNDFPV LDQEVNGRPL VYLDSGATSQ NPRSVLEAEQ 
EFYELRNAAV HRGAHHLAVQ ATDAFEDARA TVAGFVGVAE DELVWTANAT AGLNLLAYAF
SNASVGTVRG EAGRFALGPG DEIVVTEMEH HANLIPWQEL CRRTGATLKF IPIDDDGALR
LEEAARLITG RTKVLAFTHA SNVLGTINPV PELVRLARAA GALVVLDACQ SAPHLPLDFK
ALDVDFAVFS GHKMLAPTGI GGVYGRRELL NAMPPFLTGG SMITTVTMEK AEYLPAPQRF
EAGTQPISQA VALAAAANYL RETSMERIAG WEASLGQRLV TGLSAIDGVR VVGPAAGVER
LGLAAFDVAG VHAHDVGQYL DSMGIAVRVG HHCAQPLHRR LGLTATTRAS TYLYNTTEEV
DLLIEAVAQV RPYFGVEGTG TSK