Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1533 |
Symbol | |
ID | 5669937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1835124 |
End bp | 1837355 |
Gene Length | 2232 bp |
Protein Length | 743 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240453 |
Product | catalase/peroxidase HPI |
Protein accession | YP_001505879 |
Protein GI | 158313371 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0376] Catalase (peroxidase I) |
TIGRFAM ID | [TIGR00198] catalase/peroxidase HPI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0328118 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGAGA ATCACGAGAC AGTTGTTTCC GAACTGAACG AGGAAAGCGG TGGCGGCTGC CCGGTCGCGC ATGAGCGGGC TCCGCATCCC ACCCAGGGTG GCGGGAACCG TGGTTGGTGG CCGAACCGGC TCAACCTGAA GATCCTCGCC AAGAATCCCG CGGTGGCCAA TCCGCTCGGT GAGGAGTTCG ACTACGCCGC GGCGTTCCGG ACGCTCGACC TCCCCGCCGT CAAGCGGGAC ATCGCGCAGG TGCTGACGAC GTCGCAGGAC TGGTGGCCGG CCGACTACGG TCACTACGGC CCGTTCATGA TCAGGATGGC GTGGCACAGC GCGGGTACCT ACCGCATCAG CGATGGCCGT GGTGGCGGCG GCGCCGGGCA GCAGCGTTTC GCGCCGCTCA ACAGCTGGCC GGACAACGGG AACCTGGACA AGGCGCGCCG CCTGCTGTGG CCGGTGAAGA AGAAGTACGG TCAGGCGCTC TCCTGGGCGG ACCTGATGAT TCTCGCCGGC AACGTCGCCC TGGAATCGAT GGGTTTCACG ACCTTCGGCT TCGCCGGTGG CCGCGAGGAT GTCTGGGAGC CGGACGAGGA CGTCTACTGG GGCCCGGAGA CCACCTGGCT CGGCGACGAG CGCTACACCG GTGACCGGGA GCTCGAGAAT CCGCTCGGGG CGGTCCAGAT GGGCCTCATC TACGTCAACC CGGAGGGCCC GAACGGCACC CCGGACCCGC TGGCCGCGGC CCGCGACATC CGCGAGACGT TCCGCCGCAT GGCGATGGAC GACGAGGAGA CGGTGGCGCT CATCGCCGGT GGCCACACGT TCGGCAAGAC CCACGGCGCG GGCGACCCGG ACAACGTCGG TCCGGAGCCC GAAGGTGCCC CCCTTGAGAC GCAGGGCCTC GGCTGGAAGA ACGCCTTCGG TACCGGCAAG GGCGCGGACG CGATCACCAG CGGGCTCGAG GGTGCCTGGA CTCCCACCCC GGTGAGCTGG GACAACAGCT TCTTCGAGAC GCTTTTCGGC TACGAGTGGG CGCTGACGAA GAGCCCGGCC GGGGCCTACC AGTGGAAGCC GAAGGGCGGC GCGGGCGCTG GCACCGTCCC CGATGCCCAC GACGCGGCGA AGAGCCACGC CCCGACGATG CTGACGACCG ACCTCGCCCT GCGGTTCGAC CCGGTGTACG AGCCGATCTC GCGGCGGTTC CTGGAGCACC CGGACGAGCT CGCGGACGCG TTCGCCCGGG CGTGGTTCAA GCTGACCCAC CGTGACATGG GGCCGGTCGC GCGCTACCTC GGCCCGGAGG TCCCGGCCGA GACGCTGCTG TGGCAGGACC CGGTGCCGGC GGTGGACCAC GAGCTCGTCG ACGCCGCGGA CGTCGCCGCG CTGAAGGTCC GGGTCCTCGC CTCGGGCCTG TCGGTCTCCG AGCTGGTCGC GACCGCGTGG GCGTCGGCCT CGACGTTCCG TGGCGGTGAC AAGCGCGGTG GTGCCAACGG CGCGCGCATC CGCCTCGAGC CGCAGCGCGG CTGGGAGGTG AACGAGCCGG ACCGGCTGGC GGCGGTGCTG GGGACGCTGA CCGGCATCCA GGAGGAGTTC CACGCCGCCC GGACCGACGG CCGGCGGGTC TCGCTCGCCG ACCTGATCGT GCTCGCCGGT GGTGCCGCTG TCGAGCAGGC AGCCCGCGAA GCCGGCTTCG ACGTCGAGGT CCCGTTCACC CCGGGCCGGA CCGACGCGTC CCAGGAGCTG ACCGACGTCG AGTCGTTCGC GGCGCTCGAA CCGGCCGCGG ACGGGTTCCG TAACTACCTC GGGAAGGGCC AGCGCCTGCC GGCCGAGTAC CTGCTGCTCG ACCGGGCGAA CCTGCTGACC CTGAGCGCCC CCGAGCTGAC GGTCCTCGTC GGTGGCCTGC GGGTGCTGGG GGCGAACTTC CGGCAGTCCT CGCTGGGGGT CCTCACCGCG ACGCCCGGGG TGTTGACCAA CGACTTCTTC GCCAACCTGC TCGACCTGGG CACGACGTGG CGCCCGAGCG GCGAGGACGA CAACGTCTTC GAGGGCCGCG ACGCCGCCAC GGGCGAGCTG ACCTGGACCG GTAGCCGCGT CGACCTCGTC TTCGGCTCGA ACTCCGAGCT GCGCGCGTTC GCGGAGGTGT ACGCGAGCGA CGACGCGCGG GAGAAGTTCG TACGCGACTT CGTCGCGGCC TGGGCCAAGG TGATGAACCT CGACCGCTAC GACCTCGCCT GA
|
Protein sequence | MSENHETVVS ELNEESGGGC PVAHERAPHP TQGGGNRGWW PNRLNLKILA KNPAVANPLG EEFDYAAAFR TLDLPAVKRD IAQVLTTSQD WWPADYGHYG PFMIRMAWHS AGTYRISDGR GGGGAGQQRF APLNSWPDNG NLDKARRLLW PVKKKYGQAL SWADLMILAG NVALESMGFT TFGFAGGRED VWEPDEDVYW GPETTWLGDE RYTGDRELEN PLGAVQMGLI YVNPEGPNGT PDPLAAARDI RETFRRMAMD DEETVALIAG GHTFGKTHGA GDPDNVGPEP EGAPLETQGL GWKNAFGTGK GADAITSGLE GAWTPTPVSW DNSFFETLFG YEWALTKSPA GAYQWKPKGG AGAGTVPDAH DAAKSHAPTM LTTDLALRFD PVYEPISRRF LEHPDELADA FARAWFKLTH RDMGPVARYL GPEVPAETLL WQDPVPAVDH ELVDAADVAA LKVRVLASGL SVSELVATAW ASASTFRGGD KRGGANGARI RLEPQRGWEV NEPDRLAAVL GTLTGIQEEF HAARTDGRRV SLADLIVLAG GAAVEQAARE AGFDVEVPFT PGRTDASQEL TDVESFAALE PAADGFRNYL GKGQRLPAEY LLLDRANLLT LSAPELTVLV GGLRVLGANF RQSSLGVLTA TPGVLTNDFF ANLLDLGTTW RPSGEDDNVF EGRDAATGEL TWTGSRVDLV FGSNSELRAF AEVYASDDAR EKFVRDFVAA WAKVMNLDRY DLA
|
| |