Gene Sare_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2202 
Symbol 
ID5708197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2535865 
End bp2538111 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content69% 
IMG OID641271683 
Productcatalase 
Protein accessionYP_001537054 
Protein GI159037801 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0753] Catalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.366789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCCA ACAAGCCCGC CAAGGCCGTC GAGGGTGTCG TGCAGGCCGC GGCCGAGGAG 
GGGGCCGATG TCACCCGCGC GGACGTTCCC GGCGCACCAG GTAGCGCTCC GCCCAGCGTC
GAGGAACCAA CGGAACCGCA TGACCCGCTG CCGTCCAGGA AGGAGCAGGG CACCCCGGAG
ACGCGAACCC CGACGGGCGC GACCACCGGC TTGCCCCCCG CCGCCAATGG CCAGCAGGGT
GCGTTTCTCA CGACGGCGCA GGGGGCACGG CTGCGCGACA CCGACCACTC CCTCAAGGCT
GGACCGCGCG GCCCAATCCT GATGCAGGAC CACCACTTCC GCGAGAAGAT CACGCACTTC
GACCACGAGC GCATCCCCGA GCGCGTCGTG CATGCCCGCG GGGCCGGGGC GCACGGCGTC
TTCACCAGCT ACGGCGCCGC GGAGGTCACC CGGGCCGGCT TTCTGCGCAA AGGGAAGGAG
ACCCCGGTCT TCGTCCGGTT CTCCACCGTC CTCGGCTCCC GCGGGTCGGC CGACACGGTC
CGCGACACCC GTGGCTTTGC GACGAAGTTC TACACCGACG AGGGCAATTT CGACCTGGTC
GGTAACAACA TGCCGGTCTT CTTCATCCAG GACGCGATCA AGTTTCCCGA CATCATCCAC
GCCGGGAAGC CACACCCGGA CCGGGAGATC CCGCAGGCGC AGAGCGCGCA TGACACGTTC
TGGGACTTCG TCTCGTTGCA CACGGAGGCG CAGCACCACG CCATGTGGAA CATGTCCGAC
CGGGGCATTC CACGCTCGTT TCGGATGATG GAAGGTTTCG GCGTACACAC CTTCCGATTG
GTCAACGCGG CCGGTGAGAC GGCGCTGGCG AAGTTCCACT GGAAGCCGAA GCTTGGCGTG
CACTCCCTGA CCTGGGAGGA GGCACAGATG ATCAGCGGTA TGGACCCGGA CTTCCATCGG
CGTGACCTCT ACGATGCCAT CGAGGCCGGT GCCTACCCCG AGTGGGAACT CGGCCTGCAG
ATCGTTCCGG ACAGCCCGGA GGAGACCTTC GCCGGCGTCG ACCTGCTCGA CCCGACAAAG
TTCGTCCCGG AGGAGTTGGC GCCGGTGCAG CCGGTCGGCA TGTTGGTCCT CAACCGAATG
CCGACGAACT TCTTCGCCGA GACCGAGCAG GTCGCCTTCC ACGTGGGCAG CCTGGTGGCG
GGCATCGACG TCACCAACGA CCCGCTGTTG CAGGGGCGGC TCTTCTCGTA CGTGGACACG
CAGCTGACCC GGTTGGCCGG GCCGAACTTT CCGCAGATTC CGATCAACCG ACCGCACGCT
CCGGTCAACG ACCTGCTGCG GGACGGCTTC CACCAGCAGG CGGTACACAC GGGGGTGGCG
CCGTACCGGC CGAACTCGCT TGACGGTGGC AACCCCTTCC CGGCCGGGGA CGACGAGAAC
GCGTTCCTCG ACGTCCCGGT GACCGTCGCT GAGGCCCCGA AGGTACGCGC CAACCCGGTC
TCCTTCGACG ACCACTTCAG CCAGGTCCGG CTGTTCTGGC TCAGCATGTC GCCGGTGGAG
CGGGAACACA TCATCCGCGC CTACACCTTT GAGCTCAGCA AGTGCTACCA CCAGGAGATC
AAGGAACGGC AGCTGCGGTG CCTGGCCAAC ATCGACCCGG TGCTCTGTGC ACAGGTCGCC
GCCGGCCTCG GCCTCCCCGC GCCGGAGCCG ACGGTGCCGC CGGTCGACGT CGAGCCCAGC
CCCGCCCTGT CCCAGTTGGG CCGGACATGG CCGACGGACG GCCGGACGGT CGGCATCGTG
GTCGACGCGG ACAGCGACCT CGACGGCGTC GACGGCGTCC GCAGCGCCGT CTTCGCCGCG
GGCATGGTGC CGCTGCTGGT CGCGGCGCAC GGCGGTAAGG TCGGCGAGTT GCCGGTGCAA
CGGACCTTCG CCACTGGGCG CGCGACCGAA TTCGACGCAC TGCTGCTGGC CGGCGCGCCG
GCGCCAGCGC CGGACGCCCT GCCGGCCGCG GACGCCGGCC ACGCCACGCT CGTCGATCCA
CGCGTGCGGT TGCTGGTGCA GGAAAGCTGG CGGCACGCCA AGGCGATCGG TGGTTGGGGC
GCGGGCGTCA GCGTGTTGGA GCAGGCCGGG GTGCTGGGTA CTCCCGGTGT GGTGACCGGC
GGTTCGGGGA CCGACGTTCT CGCCGGGGTG CAACGCCTGA TGGCCGCACA CCGGGTATGG
GAACGCTTCC CCACCGCGGT CACCTGA
 
Protein sequence
MDPNKPAKAV EGVVQAAAEE GADVTRADVP GAPGSAPPSV EEPTEPHDPL PSRKEQGTPE 
TRTPTGATTG LPPAANGQQG AFLTTAQGAR LRDTDHSLKA GPRGPILMQD HHFREKITHF
DHERIPERVV HARGAGAHGV FTSYGAAEVT RAGFLRKGKE TPVFVRFSTV LGSRGSADTV
RDTRGFATKF YTDEGNFDLV GNNMPVFFIQ DAIKFPDIIH AGKPHPDREI PQAQSAHDTF
WDFVSLHTEA QHHAMWNMSD RGIPRSFRMM EGFGVHTFRL VNAAGETALA KFHWKPKLGV
HSLTWEEAQM ISGMDPDFHR RDLYDAIEAG AYPEWELGLQ IVPDSPEETF AGVDLLDPTK
FVPEELAPVQ PVGMLVLNRM PTNFFAETEQ VAFHVGSLVA GIDVTNDPLL QGRLFSYVDT
QLTRLAGPNF PQIPINRPHA PVNDLLRDGF HQQAVHTGVA PYRPNSLDGG NPFPAGDDEN
AFLDVPVTVA EAPKVRANPV SFDDHFSQVR LFWLSMSPVE REHIIRAYTF ELSKCYHQEI
KERQLRCLAN IDPVLCAQVA AGLGLPAPEP TVPPVDVEPS PALSQLGRTW PTDGRTVGIV
VDADSDLDGV DGVRSAVFAA GMVPLLVAAH GGKVGELPVQ RTFATGRATE FDALLLAGAP
APAPDALPAA DAGHATLVDP RVRLLVQESW RHAKAIGGWG AGVSVLEQAG VLGTPGVVTG
GSGTDVLAGV QRLMAAHRVW ERFPTAVT