Gene Sare_1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1756 
Symbol 
ID5705083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2028746 
End bp2029957 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content74% 
IMG OID641271259 
Productcysteine desulfurase family protein 
Protein accessionYP_001536634 
Protein GI159037381 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01976] cysteine desulfurase family protein, VC1184 subfamily 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.115577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0371464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTCG ACATCGCCCG CGCCCGGGCC GCCTATCCCG CCCTGGCCGA GGGACACGTC 
CACTTCGACG GTGCCGGTGG CACCCAGACC GCCGCGCCGG TGATCGCCGC GGTGGCCGAG
ACGATGGGTA CGGCGCTCGG CAACCGCAGT GGTGGCAACC TACCCGGCCG ACGCTCGGTG
GAACTGGTGT CCGCCGCCCG GACGGCCGTG GCCGACCTGC TCGGCGCGGT CCCGGAGGGG
GTGGTGCTGG GCCCGAGCGC GACCGCGCTG ACGTACACCC TGGCCCGCGC CCTCGGGGCG
ACCTGGCGGC CGGGCGACGA GGTGGTGGTG TCCCGACTCG ACCACGATGC CAACGTTCGG
CCGTGGATCC AGGCGGCCGA GGCGGCCGGC GCGACGGTAC GGTGGGCCGA GTTCGACGAG
CACACCGGCG AACTGCCCGC CGGCCAGTAC GCCGACCTGG TCAACGAGCG GACCCGGCTG
GTGGCGGTCA CGGCCGGCAG CAACGCGATC GGCACGATCC CGGACGTGGC GGCGATCGCC
AAGTCGGCTC ACGCCGCCGG CGCGTTGGTC TGCGTGGACG GCGTGCACTC GGTACCGCAC
GGTCCGACCG ACCTCACCGC GCTGGGAGCG GACTTCCTCG TCACCAGTGC CTACAAGTGG
TCCGGCCCGC ACCTGGCCGC GGTAGCAGCG GACCCGACGT GCTGGCAGCA CCTGCACCCG
GCGAAGCTGC GCCCCTCCGC CGACACGGTG CCCGACCGGT TCGAGTACGG CACGCCCAGC
TTTCCCCTGT TGGCCGGGGT GGCCGTCGCC GTGGACCACC TCGCCGGGCT GGACCCGACG
GCTACCGGAA CCCGGCGGGA GCGGCTGCGG ACCAGTCTGA GCGCAGTCCG CACGTACGAG
GAGGGACTGT TGGACCGGCT GCTCGACGGT CTCGCCGCGG TGTCCGGGGT CACCGTGCTC
GGCTCACCGG GCCGGCGCTG CCCCACGGTC TCGTTCCGGT TGGCCGGCCG GTCTCCGGCC
GACACCCAGG CGGCGCTGGG CGCGGCGGGG GTCTGCCTGT CCGCCGGCGA CTACTACGCC
TACGAGTACT TCCAGACGTT GGGACTGCGG GACAGCGGCG GGGCGGTGCG GGTCAGCCTG
TACCACTACA ACACCGTCGC CGAGGTGGAT CGCCTGCTCA ACGAGTTGGC GACCCTGACC
ACCGGCGGCT GA
 
Protein sequence
MPFDIARARA AYPALAEGHV HFDGAGGTQT AAPVIAAVAE TMGTALGNRS GGNLPGRRSV 
ELVSAARTAV ADLLGAVPEG VVLGPSATAL TYTLARALGA TWRPGDEVVV SRLDHDANVR
PWIQAAEAAG ATVRWAEFDE HTGELPAGQY ADLVNERTRL VAVTAGSNAI GTIPDVAAIA
KSAHAAGALV CVDGVHSVPH GPTDLTALGA DFLVTSAYKW SGPHLAAVAA DPTCWQHLHP
AKLRPSADTV PDRFEYGTPS FPLLAGVAVA VDHLAGLDPT ATGTRRERLR TSLSAVRTYE
EGLLDRLLDG LAAVSGVTVL GSPGRRCPTV SFRLAGRSPA DTQAALGAAG VCLSAGDYYA
YEYFQTLGLR DSGGAVRVSL YHYNTVAEVD RLLNELATLT TGG