Gene Strop_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2054 
Symbol 
ID5058517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2324947 
End bp2327223 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content69% 
IMG OID640474319 
Productcatalase/peroxidase HPI 
Protein accessionYP_001158885 
Protein GI145594588 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.886153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA CTCAGGACAA CGCCCCCGTC AGCGCGCAGG GCGTGGATCA GAAGGCGGCG 
GCCGGCTGTC CGGTCGCGCA CGACTCAGTC ACCGCGCACG GCAGCGAGAG CGAGAGCCCG
GCGATCGACT CTCCCAGCGC GGTCGGTGGT GGCCGCCCAC GTACCAACCG GGACTGGTGG
CCCAACCAGC TTGACCTGTC GGTGCTCTCC ACCAACTCGG CGAAGGTCAA CCCGTTGGGC
GAGGACTTCT CCTACGCCAA GGAGTTCGCC AAGCTCGACG TCGAGGCCCT CAAGCGGGAC
ATCACCGAGG TGCTCACCAC CTCGCAGGAC TGGTGGCCGG CCGACTTCGG CCACTACGGC
GGTCTGATGA TCCGGCTGAG CTGGCACGCT GCCGGCACCT ACCGCATCCA CGACGGCCGC
GGTGGCGCCG GTGACGGCGG CCAGCGGTTC GCGCCGCTGA ACAGCTGGCC GGACAACGTC
AATCTGGACA AGGCCCGGCG GCTGTTGTGG CCGGTCAAGC AGAAGTACGG TCAGAAGATC
TCCTGGGCTG ACCTGCTCGT GCTCGCCGGC AACGTCGCCC TGGAGTCGAT GGGCTTCAAG
ACGTTCGGGT TCGGTTTCGG CCGGGAGGAC GTCTGGGAGC CCGAGGAGAT CTTCTGGGGC
CCGGAGGACA CCTGGCTCGG CGACGAGCGC TACGTCTCTG AGAAGGAGTT CTCGGCGGGC
GTCGGGGCGA CCGAGATGGG TCTGATCTAC GTCAACCCGG AGGGCCCGCG CGGCAACGCG
GACCCGGCCT CGGCGGCGCA CTTCATCCGG GAGACCTTCC GCCGGATGGC GATGAACGAC
GAGGAGACCG TGGCCCTCAT CGCCGGTGGC CACACCTTTG GCAAGACCCA CGGTGCCGGG
GTCGCCGACG ATCATGTGGG TCCCGAGCCC GAGGGCGCCC CCCTGGAGGC GCAGGGCCTG
GGCTGGATGA GCAGCCACGC CAGCGGAGTG GGTGCAGACA CGATCTCCAG CGGCCTCGAG
GTGACGTGGA CCGACCGGCC GACGCAGTGG AGCAACCGCT TCTTTGAGAT CCTGTTCGGC
TACGAGTGGG AACTCACCAC CAGCCCCGGT GGCGCGAAGC AGTGGGTCGC CAAGGACGCC
GAGGCGATCA TCCCCGACGC GTACGACTCG ACCAAGAAGC ACAAGCCGAC CATGCTCACG
ACCGACCTGT CGCTGCGCGT TGACCCGGCG TACGAGCGGA TCTCGCGTCG CTTCCTGGAG
AACCCGGACG AGTTCGCGCT GGCCTTCGCC AAGGCCTGGT ACAAGTTGCT GCACCGCGAC
ATGGGCCCGG TCAGCCGGTT CCTCGGGCCG TGGGTGCCGC AGACGCAGCT GTGGCAGGAC
CCGGTACCCG CCGTTGACCA CGAGCTCGTC GGCGCAGCCG ACATCGCCGC CCTCAAGGCG
AAGGTGCTTG AGTCCGGCCT GACGACCACC CAGTTGGTCT CCACCGCGTG GGCCTCCGCG
GCCAGCTTCC GCCACACCGA CAAGCGCGGT GGCGCCAACG GTGCCCGGAT CCGCCTCGAG
CCGCAGCGCA GCTGGGAGGT CAACCAGCCC GAGCAACTCG CCACCGTCCT GCCGGCGCTG
GAGGAGATCC AGCGGGAGTT CAACGCCGCT GGTGGCGCGA AGATCTCGCT CGCAGATCTG
ATCGTGCTGG CCGGCTCAGC CGCGGTCGAG AAGGCGGCGC GGGACGCCGG CGTCGAGGTG
ACCGTGCCGT TCCGGCCAGG TCGCACCGAC GCCACCCAGG AGCAGACCGA TGTCGACTCC
TTCCGGGTGC TCGAACCGCG GGCTGACGCG TTCCGTAACT ACCTGCGTCC GGGTGAGAAG
ACCCAGCCGG AGGTGCTGCT CGTTGACCGT GCCTACCTGC TCAACCTGAC CGCACCCGAG
ATGACCGTCC TCATCGGCGG CCTGCGGGCG CTCGAGGCCA ACGCCGGCGG CAGCCGGCAC
GGCGTTCTCA CCGACCGTCC CGGTGTGCTC ACCAACGACT TCTTTACCAA CCTGCTCGCC
TCGGGCGCGC GGTGGAAGGC ATCGGAGTCC ACTGAGCACG CCTACGAGAT CCGGGATGTG
GCCACCGACA AGGTGAAGTG GACCGCCAGC GCGGTCGACC TCATCTTCGG CTCGAACTCG
CAGCTGCGGG CCCTGGCCGA GGTGTACGCC AGCGAGGACG CGCGGGAGAA GTTCGTGCAG
GACTTCGTCG CGGCCTGGAC CAAGGTCATG GAGCTCGACC GGTTCGACCT CGCCTGA
 
Protein sequence
MSDTQDNAPV SAQGVDQKAA AGCPVAHDSV TAHGSESESP AIDSPSAVGG GRPRTNRDWW 
PNQLDLSVLS TNSAKVNPLG EDFSYAKEFA KLDVEALKRD ITEVLTTSQD WWPADFGHYG
GLMIRLSWHA AGTYRIHDGR GGAGDGGQRF APLNSWPDNV NLDKARRLLW PVKQKYGQKI
SWADLLVLAG NVALESMGFK TFGFGFGRED VWEPEEIFWG PEDTWLGDER YVSEKEFSAG
VGATEMGLIY VNPEGPRGNA DPASAAHFIR ETFRRMAMND EETVALIAGG HTFGKTHGAG
VADDHVGPEP EGAPLEAQGL GWMSSHASGV GADTISSGLE VTWTDRPTQW SNRFFEILFG
YEWELTTSPG GAKQWVAKDA EAIIPDAYDS TKKHKPTMLT TDLSLRVDPA YERISRRFLE
NPDEFALAFA KAWYKLLHRD MGPVSRFLGP WVPQTQLWQD PVPAVDHELV GAADIAALKA
KVLESGLTTT QLVSTAWASA ASFRHTDKRG GANGARIRLE PQRSWEVNQP EQLATVLPAL
EEIQREFNAA GGAKISLADL IVLAGSAAVE KAARDAGVEV TVPFRPGRTD ATQEQTDVDS
FRVLEPRADA FRNYLRPGEK TQPEVLLVDR AYLLNLTAPE MTVLIGGLRA LEANAGGSRH
GVLTDRPGVL TNDFFTNLLA SGARWKASES TEHAYEIRDV ATDKVKWTAS AVDLIFGSNS
QLRALAEVYA SEDAREKFVQ DFVAAWTKVM ELDRFDLA