Gene Sare_2654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2654 
Symbol 
ID5703575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3025945 
End bp3028290 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content73% 
IMG OID641272112 
Productradical SAM domain-containing protein 
Protein accessionYP_001537482 
Protein GI159038229 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0138301 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCCTCA GCCAGTTCGT TCTGAAGATT GTCAGCCGCT GCGACCTGTC CTGCGACCAC 
TGCTACGTCT ACGAGCACCC CGACCAGTCC TGGCGCCGTC AGCCGCGAAC CATGGCGCCC
GCGACGGTGA CGGTCGCCGC CCGCCGCATC GCTGAGCACG CCACCGCGCA CCAACTCGGC
ACCGTACGCG TCGTCCTGCA CGGAGGCGAG CCCCTCCTCG CGGGTGCGTC CGGGATCGAC
GCGGTCGCCG GTGAGCTCCG CCGACACCTC GATCCGGTAA CCCGGCTGGA CCTACGGATG
CAGTCCAACG GGGTCCTGCT CACCGAGGAG GTGGCCGAGG TCCTCGTCGC CCACGGCATC
ACGGTCGGTG TCTCACTCGA CGGTGATCGC GCGGCCAACG ATCTGCACCG CCGCTACGCG
AGCGGGGCGA GTAGCCACGC AAAGGTACTC CGGGCCCTCG CCCTGCTGCG TCGCCCGGAG
TTCCGGCCGA GCTACGCCGG CCTGCTCTGT ACGGTCGACC TCACCAACGA CCCGGTCACC
GTCTACCGGG CGCTACTGGC CGAAGACCCA CCGCGCATCG ACTTTCTCCT GCCGCACGCC
AACTGGGACC GTCCCCCACC ACGCCCGGCC GGTGCACGCA CCGCGTACGC GGACTGGCTG
CTCGCGGTGC ACCGGGCCTG GACGGCCGAT GGCCGTCCGG TGCCGGTCCG GTTGCTGGAC
TCGCTGCTGG CCATGGCCGA GGGCGACAGT AGCGGCACCG AGGCGGTCGG GCTCGCCGCC
GCCGACCTCG CCGTGATCGA AACGGATGGT AGCTACGAAC AGGTCGACTC GCTCAAGTCC
GCCTACGACG GTGCGCCCGC CACCGGACTG GACGTCTTCG ACCACGCGAT CGACAAGGTC
GCGGTCCATC CGCTGATAGC CATCCGACAG GCCGGACTGG CCGGGCTCTG CGGCACCTGC
CGGGCCTGTC CGGTTGTCCG GCAGTGTGGG GGTGGCCTGT TCACCCACCG CTACCGCAGC
GACACCGGCT TCGACAACCC CTCGGTCTAC TGCGCGGACC TGGCCCACCT CGTCCAGGGT
GTGACCGACC CGCCACCGAT GGCCCAGCCC CGGTCGCCGG CAGCCGGCGG TCCTGCCGAC
CTGGTCATCC CGACCCCCCG CGTCCCGGGT GGAGCGGCGG CACACCCCGC GCCCACCAGG
ACCGAGCCGG ACGCACTGGA CCCGGCGGTG CTCGACGACC TCGGCAGCGG GTACGGCAGC
ACCGCATCGG TACGCCAGCT CGCCGCCGTG CACCTGGCCA AGACCCGCGC GCTGCTGGTC
GCACTCAGCC CCGCCGTTGC CGGGCAGGCG GTCACCGCAC CCGCGTGGGA TCTGCTGGTC
GACTTCGACG TCACCGCCCC GGCGGCGGTG CGCACGGTGC TGGCGCATCC GTTCGTCCGC
CGCTGGGCGT ACCGGTGCCT GGCGGTGCCG GCGTCGGCCG ACCTCGGGCA CCTCGCCCGC
ATCGCCGCGG CGGTGGCCGT GCGGGCCGGC GCCGCCGTGG ATCTGGACGT ACCGGTTCGC
GACGGCCAGC TCAGCCTGCC CACGCTCGGC GTGCTCCGGA TGCCGGACGT GGCCGGTCCG
GTCCGGCTCG CCATCGCCGA CGGCGGCTTC CGGGTAAGCG GCGACGAGCG GAGCGGACCG
GTCGCGTCCG GCCATCGGCC GCCCGGCTGG CGACCGGCCC GCCGGGTGGA CACCTCCGGC
ACCCTGATCG AGGACACCGA CCCGTACCGG GGCAGCTACC AGGACCTGGT GGTGGCGCCA
CGCCTGTCCG CCGGCGCGGC GGGGCGTTGG GCCGCCCAGC TGGCTTCGGC CGATAAGCGC
GACGACATCG GCGGGTACGC CCCCGGCGTG CGAGGTCTGC TACGGGCGGT CGTACCGCTG
CGCCCGGACC GCCGCGGCCG GCAGCGCAGC GCCACCGCGG CGTCCGCTTT CGGCGCGGTG
GCGATGACGC CGGTACCCGA CGCCGCCGCC CTGGCCGTGC TGCTCGTCCA CGAGGTGCAA
CACCTCAAGC TCGACGGGGT GCTCGACGTG TGCGAGCTGG TCGACCGACG CGACACCCGG
TTGCTCACCG TTCCGTGGCG GGAGGACCCG CGCCCGGTGG AGGGTGTGCT GCACGGCACG
TACGCCCATC TGGCGGTCGC CGACATATGG CGACACCGGG CCGGGGCCCA GGCGACGGCG
CGGTACCGCC GGTACCGTGC CTGGACCGAC CAAGCGCTCG ACGAACTACT CGGGCTCGGT
GCCCTGACGC CGGTGGGGCA GCGGTTCGCC GGCCGGATGC GCGCCACGGT GGACAGCTGG
CCATGA
 
Protein sequence
MALSQFVLKI VSRCDLSCDH CYVYEHPDQS WRRQPRTMAP ATVTVAARRI AEHATAHQLG 
TVRVVLHGGE PLLAGASGID AVAGELRRHL DPVTRLDLRM QSNGVLLTEE VAEVLVAHGI
TVGVSLDGDR AANDLHRRYA SGASSHAKVL RALALLRRPE FRPSYAGLLC TVDLTNDPVT
VYRALLAEDP PRIDFLLPHA NWDRPPPRPA GARTAYADWL LAVHRAWTAD GRPVPVRLLD
SLLAMAEGDS SGTEAVGLAA ADLAVIETDG SYEQVDSLKS AYDGAPATGL DVFDHAIDKV
AVHPLIAIRQ AGLAGLCGTC RACPVVRQCG GGLFTHRYRS DTGFDNPSVY CADLAHLVQG
VTDPPPMAQP RSPAAGGPAD LVIPTPRVPG GAAAHPAPTR TEPDALDPAV LDDLGSGYGS
TASVRQLAAV HLAKTRALLV ALSPAVAGQA VTAPAWDLLV DFDVTAPAAV RTVLAHPFVR
RWAYRCLAVP ASADLGHLAR IAAAVAVRAG AAVDLDVPVR DGQLSLPTLG VLRMPDVAGP
VRLAIADGGF RVSGDERSGP VASGHRPPGW RPARRVDTSG TLIEDTDPYR GSYQDLVVAP
RLSAGAAGRW AAQLASADKR DDIGGYAPGV RGLLRAVVPL RPDRRGRQRS ATAASAFGAV
AMTPVPDAAA LAVLLVHEVQ HLKLDGVLDV CELVDRRDTR LLTVPWREDP RPVEGVLHGT
YAHLAVADIW RHRAGAQATA RYRRYRAWTD QALDELLGLG ALTPVGQRFA GRMRATVDSW
P