Gene Sare_2465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2465 
Symbol 
ID5706263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2823977 
End bp2825113 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content67% 
IMG OID641271931 
Productradical SAM domain-containing protein 
Protein accessionYP_001537301 
Protein GI159038048 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGCA TCGGCCTGAT CAACGCCACC GGTCAGCCGG CGGTCGCCGG CAGCTTCCAC 
ACGTTGGTCG TGCAGCCAAC GAGCTTCTGC AACCTCGACT GCACCTACTG CTACCTGCCC
GACCGCAGGT CTCTGCGTCT GATGAGCGGC GCGGTCGCGC AGGCCTGTGC CGAGTCGATA
GCCCAGCAGA ACAGCGGTCA TCCGGTGAGT GTCGTATGGC ACGGGGGCGA ACCCACCGCC
ACACCCATCG GGCTGTTCCG GGGCCTGCTG GCCCCGTTCG AGCAGTTGCG GCGCGCGGGG
ATGGTGCGTC ACGAGATCCA AACGAACGCC ACGCTGATCA ACCGCCAATG GTGCGAGCTG
TTCACCACTT ACGGGTTCGA GGTCGGGGTC AGCATCGACG GACCGAGCGC GTTGAACCGT
AACCGCCTCG ACCGGGCCGG TAACGCGACG GATGCCCGTA TCCTGCGCGG CCTGCAGACC
CTGGCGGAAG CGGGGCTGGG GTACTCGGTG ATCTCTGTGG TCACACCGGA GACCATCGAC
CATGCGGAAG CCCTTGTCGA CTTCTTCACC GACCTGCCCG GCTGCGAGTC GGTGGGCTTC
AACATCGAGG AGCAGGAGGG CGCCGACCGG CCGCCGGTGT CGGAGGATAC CGCGTACCGG
TTTTGGCAGC GTCTCATTGC ACGTCGTGTC GGTGGTAGCC CGCTGCGGAT CCGCGATGTG
GACCGGCTGG CTGACTATGT CGCCGCTACC CGCGCCGGGC GCGTTGACCA TGCGCCGTAC
GAGCCGATCC CGACCGTTTC TTGGGATGGG CAGGTGGTAC TGCTGTCACC GGAGTTGCTC
GGCATCACCG AGCCGCGATA CGGCGACTTC ATCGCCGGCA ACGTCCTCCA GCAACCCATC
ACCGCCATGC TCGCCGGCGC CGGCGACCTG GGCTACGTCG CCGAGTTCGT CACGGCCCTC
AACGACTGCG CCGACCACTG CGCCTTCTAC GACTTCTGCC GGGGCTCCCA GGCCGGCAAC
CGCTACTTCG AGCACGCGAC GTTCACCGCC CGCGAGACCA CCTACTGCCG AACCACGCGA
CAGGCGATCG TCCGCGCCGC CGCGGACCAG CTCACCCCTC AAGGAGAAGC GCCATGA
 
Protein sequence
MNGIGLINAT GQPAVAGSFH TLVVQPTSFC NLDCTYCYLP DRRSLRLMSG AVAQACAESI 
AQQNSGHPVS VVWHGGEPTA TPIGLFRGLL APFEQLRRAG MVRHEIQTNA TLINRQWCEL
FTTYGFEVGV SIDGPSALNR NRLDRAGNAT DARILRGLQT LAEAGLGYSV ISVVTPETID
HAEALVDFFT DLPGCESVGF NIEEQEGADR PPVSEDTAYR FWQRLIARRV GGSPLRIRDV
DRLADYVAAT RAGRVDHAPY EPIPTVSWDG QVVLLSPELL GITEPRYGDF IAGNVLQQPI
TAMLAGAGDL GYVAEFVTAL NDCADHCAFY DFCRGSQAGN RYFEHATFTA RETTYCRTTR
QAIVRAAADQ LTPQGEAP