Gene Plim_1663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1663 
Symbol 
ID9138364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2146482 
End bp2148863 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content55% 
IMG OID 
Productcatalase/peroxidase HPI 
Protein accessionYP_003629693 
Protein GI296121915 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAGC ATTCGTGGTT GAGATCGATG CGACGGTCAA TGGCAGTCAC GGCACTGGCG 
GCGTCGTGTG GTTTGCTGGC CACCAGCGGG CTGGTGACAA GTGCCGATGA TAAGGCGGCT
GCTAAACCAG GTGCACCCGG TGGTCAATGC CCCGTGATGG GGAATGTGAA TCCAGCTTCT
GCCCGGAATA CTGCCGCTGG AGCCATGTCA AATCGCGACT GGTGGCCCGA GCAATTGAAT
CTGTCGATTC TGCATCAGAA CTCGGCGAAG AGTAACCCTA TGGGGCCAAA CTTCAGCTAT
GCAGAAGAAT TCAGCAAACT TGATCTGGTG GCTGTGAAGA AGGACATCAA AGAACTTCTC
TCGACATCGC AGGATTGGTG GCCGGCCGAC TTTGGTAATT ATGGGCCACT GATGATTCGT
ATGGCCTGGC ACAGTGCCGG CACTTATCGA ATTACGGATG GTCGTGGTGG TGCAGGTTAC
GGCACACAGC GATTTGCTCC GCTGAACAGT TGGCCTGATA ATGCCAACCT GGATAAAGCC
CGCCGCCTGC TCTGGCCCAT CAAGCAGAAG TATGGCAACA AGATCTCGTG GGCTGACCTG
ATGATCCTCA CTGGGAATGT AGCCATTGAA TCAATGGGTG GTGAAACACT CGGGTTTGCC
GGTGGTCGCG AAGATGTCTG GGAACCCCAG GAAGATATCT ACTGGGGTCC CGAATCCAAG
TGGCTGGGTG ACAGCCGGTA TACGGGTGAT CGTGTTCTCG AAAAGCCATT GGCAGCCGTG
CAGATGGGGC TGATTTATGT AAATCCTGAA GGTCCCGATG GCAAGCCTGA TCCATTGGCG
GCGGCTCGCG ACATTCGTGA AACATTTGCC CGGATGGCCA TGAATGATGA AGAGACCGTC
GCTCTGATTG CTGGCGGCCA TACATTCGGC AAAGCTCACG GCGCAGCCAC GCCGGAAGGA
AACGTGGGCC CTGCACCGGA AGGTGCCCCC ATCCAGGAAC AAGGCCTGGG CTGGAAGAAC
ACCTTCGGTA AAGGAAATGG AAAAGACACC ATCACCAGTG GTCTGGAAGG TGCCTGGACG
ACGACGCCTA CAAAATGGTC GAACGGTTAC TTCGATAACC TGTTTGGCTA CGAATGGGAA
TTGACCAAGA GCCCTGCGGG TGCCTGGCAA TGGACACCCA AAGAAAAGGC AGCACAAGGG
ACTGTCCCCG ATGCTCATGA CCCCAAAAAG TCTCACGCTC CGATGATGTT CACGACGGAC
ATCGCTCTGA AGACCGATCC TGCGTATGCG AAGGTCTCGA AGAAGTTTCA TGAGAATCCT
GCCGAGTTCA AGCAGGCATT TGCCAAGGCT TGGTATAAGC TGACTCACCG CGATATGGGG
CCAGTCAGCC GCCTGCTGGG GCCTGAAGTG GCTGCACCGC AGATCTGGCA GGATCCAGTC
CCTGCGGTCA ATCACGAGCT GATCAATGCT CAGGATATTG ATTCGCTCAA AGGGACAATT
CTCGCTTCCG AACTGACCAT TCCTCAAATG GTGCGAACCG CCTGGGCTTC GGCGTCGACT
TTCCGAGGCA GTGACAAGCG TGGTGGAGCC AATGGGTCCC GTATTCGCCT GGCTCCCCAG
AAAGACTGGA AGGTCAATCA GCCAGCCGAA CTGGCCAAAG TCTTGAAGGT CTACGAGCAG
ATCCAGAAAG ACTTCAACTC CGCTCAAAAG ACCAATAAGA AAGTCTCGCT GGCAGATCTG
ATTGTGCTGG GTGGTTGTGC CGGTATTGAA GAAGCGGCCA AGAAGGCTGG AAATCCAGTG
AAAGTTCCTT TCGCGCCCGG ACGCACTGAC GCCACGGCTG AAATGACGGA TGCGGAATCC
TTTGCAGTCC TCGAACCCAA GGCTGATGGC TTCCGGAACT TCTTCGGGCA TGACCTTGAC
CGCCGGGGTG AAGAACTGCT GGTTGATCGG GCTCAACTGT TGACACTCAC TGCACCCGAA
ATGACAGTGC TGGTCGGTGG CATGCGAGTG CTTGATACGA ACGTGGGCTT CCCGGGGATG
GGGGTGTTTA CAAAGAATCC CGGAACTCTC ACCAACGATT TCTTTGTGAA TCTGCTGGAT
ATGAACACCA CCTGGCAGAC CTCGCCCATG TGTGAGCACT TCTTTGAAGG ACGCGACCGC
AAGACGGGAC AGGTCAAATG GACGGCTTCA TCGGTCGATC TGGTCTTTGG CTCGAATTCG
CAGTTGAGAG CGATTTCGGA AGTCTATGCC AGTGGCGACG GCAAGCAGAA GTTCTTGAAT
GACTTCGTCG CTGCATGGAC AAAAGTCATG AATCTTGATC GCTTTGATCT GGATCCCAAG
CTCAAAAAGG CCAACGTTCA GGCGGCACTG GGACAAAGAT AG
 
Protein sequence
MTKHSWLRSM RRSMAVTALA ASCGLLATSG LVTSADDKAA AKPGAPGGQC PVMGNVNPAS 
ARNTAAGAMS NRDWWPEQLN LSILHQNSAK SNPMGPNFSY AEEFSKLDLV AVKKDIKELL
STSQDWWPAD FGNYGPLMIR MAWHSAGTYR ITDGRGGAGY GTQRFAPLNS WPDNANLDKA
RRLLWPIKQK YGNKISWADL MILTGNVAIE SMGGETLGFA GGREDVWEPQ EDIYWGPESK
WLGDSRYTGD RVLEKPLAAV QMGLIYVNPE GPDGKPDPLA AARDIRETFA RMAMNDEETV
ALIAGGHTFG KAHGAATPEG NVGPAPEGAP IQEQGLGWKN TFGKGNGKDT ITSGLEGAWT
TTPTKWSNGY FDNLFGYEWE LTKSPAGAWQ WTPKEKAAQG TVPDAHDPKK SHAPMMFTTD
IALKTDPAYA KVSKKFHENP AEFKQAFAKA WYKLTHRDMG PVSRLLGPEV AAPQIWQDPV
PAVNHELINA QDIDSLKGTI LASELTIPQM VRTAWASAST FRGSDKRGGA NGSRIRLAPQ
KDWKVNQPAE LAKVLKVYEQ IQKDFNSAQK TNKKVSLADL IVLGGCAGIE EAAKKAGNPV
KVPFAPGRTD ATAEMTDAES FAVLEPKADG FRNFFGHDLD RRGEELLVDR AQLLTLTAPE
MTVLVGGMRV LDTNVGFPGM GVFTKNPGTL TNDFFVNLLD MNTTWQTSPM CEHFFEGRDR
KTGQVKWTAS SVDLVFGSNS QLRAISEVYA SGDGKQKFLN DFVAAWTKVM NLDRFDLDPK
LKKANVQAAL GQR