Gene Sare_3533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3533 
Symbol 
ID5704601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4074175 
End bp4075326 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content75% 
IMG OID641272960 
Productcysteine desulfurase 
Protein accessionYP_001538326 
Protein GI159039073 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000228802 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGCACCT CTCCGGTCTA CCTGGACGCC GCCTCCGCCG CACCGCTGCA CCCCGTCGCA 
CGGCAGGCGC TGCTGGCCGC GCTCGACGAC GGCTGGGCCG ACCCGCAACG GCTCTACACC
CAGGCCCGCC GCGCACGGCA ACTCCTCGAC GCGGCGCGGG AGGCCGCCGC GGCCACGCTC
GGCGTACGGC CGGACGAACT CTCCTTCGCC CCGAGCGGTA CTGCTGCCGC GCATTCGGCC
GTCCTCGGCG GGCTCACCGG ACGTCGCCGG GTCGGTTCCG GCCTCGTGCA CTCGGCGATC
GAGCACTCAG CGGTACTACA TGCCGCCGAT CGGCATGCGG CCGGCGGCGG CGCGGTGACG
TCGGTCCCGG TGGATCGGAT CGGCCGCATC GACCCGGATA CCTGGTCCGC GGCGGTTCGG
GCGCCCGGCG TGGCGCTCGC CGCACTGATC GCCGCGAGTC ACGAAGTGGG CACGGTGCAG
CCCGTCGCCG CGGCGGGCGC CGCCTGCGCC GCGGCCGGGG TACCGCTCTA CGTTGACGCA
GCGCAGGTGG TCGGGCACGG GCCGGTGCCG GTCGGCTGGT CGCTGCTGAC CGCGAGTGCC
CACAAGTGGG GCGGGCCGCC GGGAGTCGGG CTGCTGGTGG TTCGCAAGGG CACCCGCTGG
GAGTCGCCGT GGCCGGTGGA CGAACGCGAG GCCGGGCGTG TCCCGGGAGT GGTGAACCTG
CCGGCGGTCG TCGCGGCGGC GGCGAGCCTG CGCGCGGCTG CCGCCGACGC GGACGCGCGG
GCGGCCCGAC TCACCCCCCT GGTGGATCGG ATCCGTACCC GGGTGGTGAC GGACGTACCG
GACGTGGAGG TGGTCGGCGA TCCCGATCAC CGGCTCCCCC ACCTGGTGAC CTTCTCCTGC
CTGTACGTCG ACGGTGAGGC GCTGCTGCAG GCGCTGGACC GGCGGGGCTT CGCCGTCTCC
TCCGGGTCGT CGTGCACGTC GTCGACGCTG CGTCCGTCGC ACGTGCTGGC GGCGATGGGG
GTGCTGTCGC ACGGCAATGT TCGGGTCTCG CTGCACCGGG ACACCACCGA GGCCGAGGTG
GAACGGTTCC TGGCCGAGTT GCCGGGGGTC GTGGCTGAGC TGCGGGCCGA GGCGGGCGTG
GTGGGGCTGT GA
 
Protein sequence
MSTSPVYLDA ASAAPLHPVA RQALLAALDD GWADPQRLYT QARRARQLLD AAREAAAATL 
GVRPDELSFA PSGTAAAHSA VLGGLTGRRR VGSGLVHSAI EHSAVLHAAD RHAAGGGAVT
SVPVDRIGRI DPDTWSAAVR APGVALAALI AASHEVGTVQ PVAAAGAACA AAGVPLYVDA
AQVVGHGPVP VGWSLLTASA HKWGGPPGVG LLVVRKGTRW ESPWPVDERE AGRVPGVVNL
PAVVAAAASL RAAAADADAR AARLTPLVDR IRTRVVTDVP DVEVVGDPDH RLPHLVTFSC
LYVDGEALLQ ALDRRGFAVS SGSSCTSSTL RPSHVLAAMG VLSHGNVRVS LHRDTTEAEV
ERFLAELPGV VAELRAEAGV VGL