Gene Sare_2936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2936 
Symbol 
ID5705241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3325917 
End bp3328061 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content66% 
IMG OID641272385 
Productperoxidase 
Protein accessionYP_001537753 
Protein GI159038500 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.208315 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCGCTG GTGTGAGCCT GCCCGGCGTC GCTGCGGCCG ACGCCTCCCA CGACGGGCCG 
GATCGGCGCC TTCTGGGTGT GTGGGAGGTG CAGAGCCTGG ACGGCGCCGA CAACAACCCC
AGACACCCGA CGTGGGGCAT GGCCAACACG AACTACCCTC GGATGGCCGC CGCCAACTAT
GCCGACGGCC TGGGCGAGCC GGTCAATGCC CCGAACCCCC GTTACATCAG CAACCGAGTC
ATCAATGACA CGGGGCTGTC GTTGTACTCC GAGGGCAACG TGAGTCAATG GGGATTCGTC
TGGGGACAGT TCCTCGACCA CACCTTCGCC GAACGCCTCG GCCGCCGCGA GGTATCCGAC
CCCGCCGAGC CGGCGCCAAT CGTCGTCGAC GACTCCGACC CCATGGAGTT CTACCGAACC
AACCTCGGGT TCATTCCGTT CGACCGATCG GCCATAGCTC CCGGCAGCGG CATCGACGGT
CCGCGGGAAC AGATCAACAC ACACAGTTCC TACGTCGATG GTGCCACCAT CTATGGTCAG
ACCGAGGAGC GGCTGGACTG GCTACGCGTT GGGTCGGTCG ACGGTGACCC GCGAAACAAC
AACGCGCGGC TCCTGATGTC GGCCGACGAC TACCTGCCGC GCCGCGACGC CCGCGGTAAC
CCCGACAGCG CGCCGCTCAT GGTCGTGGGC AGCAACGTTC CGGCGAGGGT GGCGGTCGCC
GGCGATGCCC GGGCCAACGA GAACCCGCCG CTGCTCGCCA CGCACACACT CTTCGCCCGC
GAGCACAACC GCATCGTGGC ACGCCTGCCG CGATGGTTGT CCGAGGAGGA CAAGTTTCAG
ATCGCGCGGG CGGTCGTCAT CGCCGAGCAG CAGTACATAA CCTTCGAGGA GTTCCTGCCC
GCGCTGGGGG TGACCCTACA GCCGTACCGC GGGTACCGCC CGACGGTCAA CAGCTCCCTC
TCCAACGAAT TCGCCACCGT CGCGTACCGC GCGCACAGTC AGATCCGCGG CGACTTCCGT
CTGGAAGCCG AGGCAGGCCG CTACTCGCCC GAGGACCTGG ACCGACTGGA GTCCCTCGGC
GTCCTCATCG AGGCTGATGG TGACACAGTC GGCATGACGA TTCCGCTCAG CGAGGACGCG
TTCTTCAACC CGGACATGCT CGAGCTTCTT CAGCTGGGAC CGCTCCTGGA GGGCATTGGC
CGCAACACCC AGCACAACAA CGATGAGTTG ATCGACAATC TACTACGCAG CGTCGTGTTC
GACATCCCGG TGCCAGAGAA CCCGACCTGC GCGGACGAGC CGGATCTGCC CGCCTGTATC
AGGGGAGTCA ACGACCTCGC GGCAATCGAC ATCGCTCGCG GCCGCGATCA TGGCATGCCG
ACCTACAACC AGCTTCGCGT GGCCATGGGA CTGCCAGCGA AGACCTCGTT CGCGGCCATT
ACCGGTGAGG AGTCCGAGGA GTTCCCGGCC GACCCGCTGC TGACTGCCGG CGATGAGATC
AACGACCCGG ACAGCCTCGA CTTCGTGGCG ATCTACAACG GTGATGGTGA GCCGACCACG
CCGGAGACAG GCGATGCCAC GAGCGCGCAG CGGCGCGCGC CGTTGGCCGC CAGGCTGAAG
GCCATCTACG GCAGTGTCGA CAGTGTGGAT GCCTTCGTGG GCATGTTGTC CGAGCCTCAC
GTGCCGGGGA CCGAGTTCGG CGAGTTGCAG CTCACCGTCT GGCGAGACTC TTTCACGGGC
CTGCGTGATG GTGACCGATT CTTCTACGCC AACGATCCGC TGCTGCGTCA CGTCCGGCGC
GCGTTCGGGA TCGACTACCG CACGTCGCTG GGCGATCTGA TCGCACGCAA CACCAGTGTC
CCGCGGTCGG CGATGCCTGA CAACGTGTTC CTGACGAAAC AGCGGGAGGA CCCGACGCAC
CCCTGGTGCC GATGGCTGCC CCCAGCATGG TGTGAGTGGC TCGACCGGCA CGCGACGGCC
GCCAAGGTCG AGGCGCGATT CGTGAAGGAT GGATACACCA GGTCGACCCA TCACCGGGTG
GCTTCGCGGA ATGGCCATGC TCCCGGCTGG CTCGCAGGTG TCGATGCCCG AGTGGTGCCG
CGCCGCTCGC CGCAGGGGTG GCGGCGCGAC GGAGCCCACT GCTGA
 
Protein sequence
MFAGVSLPGV AAADASHDGP DRRLLGVWEV QSLDGADNNP RHPTWGMANT NYPRMAAANY 
ADGLGEPVNA PNPRYISNRV INDTGLSLYS EGNVSQWGFV WGQFLDHTFA ERLGRREVSD
PAEPAPIVVD DSDPMEFYRT NLGFIPFDRS AIAPGSGIDG PREQINTHSS YVDGATIYGQ
TEERLDWLRV GSVDGDPRNN NARLLMSADD YLPRRDARGN PDSAPLMVVG SNVPARVAVA
GDARANENPP LLATHTLFAR EHNRIVARLP RWLSEEDKFQ IARAVVIAEQ QYITFEEFLP
ALGVTLQPYR GYRPTVNSSL SNEFATVAYR AHSQIRGDFR LEAEAGRYSP EDLDRLESLG
VLIEADGDTV GMTIPLSEDA FFNPDMLELL QLGPLLEGIG RNTQHNNDEL IDNLLRSVVF
DIPVPENPTC ADEPDLPACI RGVNDLAAID IARGRDHGMP TYNQLRVAMG LPAKTSFAAI
TGEESEEFPA DPLLTAGDEI NDPDSLDFVA IYNGDGEPTT PETGDATSAQ RRAPLAARLK
AIYGSVDSVD AFVGMLSEPH VPGTEFGELQ LTVWRDSFTG LRDGDRFFYA NDPLLRHVRR
AFGIDYRTSL GDLIARNTSV PRSAMPDNVF LTKQREDPTH PWCRWLPPAW CEWLDRHATA
AKVEARFVKD GYTRSTHHRV ASRNGHAPGW LAGVDARVVP RRSPQGWRRD GAHC