Gene Sare_3306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3306 
Symbol 
ID5703660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3816455 
End bp3817759 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content69% 
IMG OID641272733 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001538100 
Protein GI159038847 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000348872 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCACCA TCGCGATTCC ACCGGGCATG CCGCAGTACG GCGACGTGCC GCGCTACGAC 
GTGGCGGCGG TGCGCGCCGA CTTCCCGATC CTGGACCGGA CGGTTAACGG GCACCCGCTG
GTCTACCTGG ACAGCGCGAA CACGTCGCAC AAGCCACGGC AGGTGCTCGA CGTACTGCGG
GAGCACTACG AGCGGCACAA CGCCAACGTG TCGCGTTCGG TTCACACGCT GGGCACCGAG
GCCACCGAGG CGTACGAGGG GGCGCGGGCC AAGGTCGCCG CCTTCATCAA CGCTCCGAAC
CCGGACGAGG TGGTGTTCAC CAAGAACTCC ACCGAGGCGA TCAACATCGT GGCGTACGCC
TTCTCGAACG CGTCGCTGCG CCCCGACGCC GACCCGCGGT TCCGGTTGGG CCCCGGAGAC
GAGGTGGTGA TCTCCGAGAT GGAGCACCAC TCGAACATCG TCCCGTGGCA GCTGATCTGC
GAGCGTACCG GCGCCACGTT GCGCTGGTTC CCGGTCACCG ACCACGGTCG ACTCGACGAG
TCGGGTCTGG CGGACCTGGT CACCGAGCGG ACGAAGATCG TCTCACTGGT GCACATGTCC
AACATCCTCG GCACGGTCAA CGCCACGTCC CGGATCACCC AGCGGGTCCG TGAGGTCGGC
GCACTGCTGC TGCTCGACTG TTCGCAGTCG GTGCCGCACA TGCCGATGGA CGTGGTCGAC
TACGACGCGG ACTTCATCGT CTTCACCGGG CACAAGATGT GTGGCCCGAC CGGTATCGGA
GTGCTCTGGG GCCGGTCCGA GCTGCTCGCG GCGATGCCGC CGGTGCTCGG CGGCGGGTCG
ATGATCGAGA CGGTGGCGAT GTCGGGGTCG ACCTTCGCCG CGCCGCCGGC CCGGTTCGAG
GCGGGCACCC CACCGATCGC CGAGGCGGTC GCGCTGGGCG CGGCGGTGGA CTACCTGTCC
GGCGTCGGCA TGCGGGCCAT CCAGTGGCAC GAGAAGCATC TCACGGCGTA CGCCCTGGAC
GCTCTGGCGA CGGTGCCCGG GTTACGGGTC TTCGGGCCGA CCGTGCCGGT GGGTCGGGGT
GGCACGATCT CGTTCGCGCT GGGCGACATC CACCCGCACG ACGTTGGGCA GGTGCTCGAC
TCGCTGGGTG TGCAGGTGCG GGTCGGTCAC CACTGTGCCC GTCCGGTCTG CACCCGGTTC
GGCGTGCCCG CGATGACCCG GGCCTCGTTC TACCTCTACA CCACCACGGA GGAGATCGAC
GCCTTGGTGG CGGGTCTGGA GCGGGTGCGG AAGGTGTTCG ACTGA
 
Protein sequence
MTTIAIPPGM PQYGDVPRYD VAAVRADFPI LDRTVNGHPL VYLDSANTSH KPRQVLDVLR 
EHYERHNANV SRSVHTLGTE ATEAYEGARA KVAAFINAPN PDEVVFTKNS TEAINIVAYA
FSNASLRPDA DPRFRLGPGD EVVISEMEHH SNIVPWQLIC ERTGATLRWF PVTDHGRLDE
SGLADLVTER TKIVSLVHMS NILGTVNATS RITQRVREVG ALLLLDCSQS VPHMPMDVVD
YDADFIVFTG HKMCGPTGIG VLWGRSELLA AMPPVLGGGS MIETVAMSGS TFAAPPARFE
AGTPPIAEAV ALGAAVDYLS GVGMRAIQWH EKHLTAYALD ALATVPGLRV FGPTVPVGRG
GTISFALGDI HPHDVGQVLD SLGVQVRVGH HCARPVCTRF GVPAMTRASF YLYTTTEEID
ALVAGLERVR KVFD