Gene Sare_2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2122 
Symbol 
ID5704748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2444790 
End bp2446097 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content70% 
IMG OID641271607 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001536978 
Protein GI159037725 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0104519 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCGGA CCGCACCGAC CCAGGAGGGT GGCTCGTTGA CCCGGCCCGA CCCCGCGCCA 
CCGGGGTCCG AGATCGGCCG GCTCCGGGCC GACTTTCCGA TCTTCGGTAG GCGGGTGCAC
GGTCACCCGC TGGTGTATTT GGACTCGGCC AGTACCTCCC AGATACCGCT ACCGGTACTC
GACCGGATGC GGCGGCACGA ACAGTGGCAC AACGGGAACG TGGGCCGTGC CGTCCACACC
CTGGGTAGCG AGGCCACCGA GGCGTACGAG GAGGCCCGCG CCAAGCTCGC CGCGTTCATC
GACGCCCGGT CGCCCGATGA GATCGTGTTC ACCCGTAACA CCACCGAGGC GATCAACCTG
GTGGCACACG CGTTCGGCGG CGCGGGCGGC GGTGACCAGC GGTTCCGCCT GGGCCCCGGC
GACGAGATAG TGGTCTCGGA GATGGAGCAT CACTCCAACC TCGTGCCGTG GCAGTTGCTG
TGTCAGCGCA CCGGCGCCAC GCTCCGCTGG ATCGGACTCA CCGACGACGG CCGCCTGGAC
CTGTCCGGCC TCGATGAGCT GATCAACGAG CGCACCCGAC TGGTGTCGTA CGTGCACGTC
TCGAACATCC TCGGCACGGT CAACCCCACC CGGCCGATCG TCGACCGCGC TCGTGCCGTC
GGTGCGATCA CCATGTTGGA TGCCTCCCAG TCCGTGCCGC ACATGCCCGT TGACGTCGCC
GCCCTGGATG TCGACTTCGT GGCCTTCACC GGGCACAAGA TGTGCGGCCC GACTGGCATC
GGCGCACTGT GGGGCCGGGC TGACCTACTG GAGGTGATGC CGCCGTTCCT GGCCGGAGGC
GGCATGGTCG GGACGGTGTC GATGGAGGGC ACGGCCTTCG TGCCGCCGCC GGCCCGGTTC
GAGGCGGGCA CACCGGCGAT CACCCCGGCG GTCGGGCTCG GCGCCGCGGT GGACTACCTG
TCAGCGGTGG GTATGGCCGC GGTGCACCGC CACGAACAGC AGCTCACGGC GTATGCGCTG
GCCGCGCTCG CCGAGGTGCC GGGGCTGCGG GTATTCGGTC CGACCGATCC GGCGCACCGC
GGCGGTACGA TCTCCTTCGC CGTGCAGGGG GTGGACCCGA CCGTTGTCGG GCGACAGCTC
GACGCGGTCG GGGTGCAGGT GCGCGTCGGC CGGCACTGCG CTGGGCCGGT GTGTGCCCGG
TACGGCGTAC CAGCCATGGC CCGGGCCTCC TTCTACCTGT ACACGACGAC GGACGACGTC
GACGCGCTGG TCACGGCCCT CGCGGACATC CGCCGGCGGT TCGGGTAG
 
Protein sequence
MNRTAPTQEG GSLTRPDPAP PGSEIGRLRA DFPIFGRRVH GHPLVYLDSA STSQIPLPVL 
DRMRRHEQWH NGNVGRAVHT LGSEATEAYE EARAKLAAFI DARSPDEIVF TRNTTEAINL
VAHAFGGAGG GDQRFRLGPG DEIVVSEMEH HSNLVPWQLL CQRTGATLRW IGLTDDGRLD
LSGLDELINE RTRLVSYVHV SNILGTVNPT RPIVDRARAV GAITMLDASQ SVPHMPVDVA
ALDVDFVAFT GHKMCGPTGI GALWGRADLL EVMPPFLAGG GMVGTVSMEG TAFVPPPARF
EAGTPAITPA VGLGAAVDYL SAVGMAAVHR HEQQLTAYAL AALAEVPGLR VFGPTDPAHR
GGTISFAVQG VDPTVVGRQL DAVGVQVRVG RHCAGPVCAR YGVPAMARAS FYLYTTTDDV
DALVTALADI RRRFG