Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1918 |
Symbol | |
ID | 3906867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2253077 |
End bp | 2255311 |
Gene Length | 2235 bp |
Protein Length | 744 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637879256 |
Product | catalase/peroxidase HPI |
Protein accession | YP_481023 |
Protein GI | 86740623 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0376] Catalase (peroxidase I) |
TIGRFAM ID | [TIGR00198] catalase/peroxidase HPI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0108605 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.336609 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAGA ACCATGACGC AGTCGTGTAC AACACGAACG CGGAGAACGG CGGTGGCTGC CCGGTCGCGC ACAGGCGCGC CCCGCACCCC ACCCAGGGCG GCGGAAACCG CGGCTGGTGG CCGAACCGGC TCAACCTGAA GATCCTCGCC AAGAACCCCG CCGTGGCCAA CCCCCTCGGC GGGGAGTTCA CCTATGCCGA GGCCTTCCGG ACCCTCGACC TCGCCGCCGT GAAGCAGGAC ATCGCGGCGG TGCTGACTAC CTCGCAGGCC TGGTGGCCGG CCGACTACGG TCACTACGGC CCGTTCATCA TCCGGATGGC GTGGCACAGC GCGGGCACCT ACCGCATCAG CGACGGTCGC GGTGGTGCGG GTGCCGGCCA GCTGCGTTTC GCTCCCCTCA ACAGCTGGCC GGACAACGCG AACCTCGACA AGGCTCGCCG CCTGCTGTGG CCGGTCAAGA AGAAGTACGG TCAGAAGATC TCATGGGCCG ATCTGATGAT CCTCGCCGGC AACGTCGCCC TGGAGTCGAT GGGCTTCGAG ACCTTCGGCT TCGCCGGCGG TCGGGTGGAC GTCTGGGAGC CTGACGAGGA CGTCTACTGG GGCCCCGAGA CCACCTGGCT CGACGACGAG CGCTACACCG GCGACCGGGA GCTCGAGAAT CCCCTCGCCG CCGTCCAGAT GGGTCTCATC TACGTCAACC CGGAGGGCCC GAACGGCAAC CCGGACCCGA TCGCCGCGGC GCGCGACATC CGCGAGACGT TCCGCCGGAT GGCGATGAAC GACGAGGAGA CGGTCGCCCT GATCGCCGGC GGCCACACCT TCGGCAAGAC CCACGGCGCG GCCAACCCGG ACGAGCACGT CGGCCCGGAG CCCGAGGGCG CCCCCATCGA GGAGCAGGGC TTCGGCTGGA CGAGCACCTT CGGCACCGGC AGGGGTGGGG ACACGATCAC CAGCGGGCTT GAGGGTGCGT GGACGAACAC CCCGGTGAGC TGGGACAACA GCTTCTTCGA GATCCTGTTC AGCTACGAGT GGGAGCTGAC GAAGAGCCCC GCCGGTGCGA ACCAGTGGAA GCCGAAGGAC GGTGCCGGTG CCGGCACCGT CCCCGACGCC CACGACGCAG CGAAAAGCCA CGCCCCCACG ATGCTGACGA CGGACCTCGC CCTCCGGTTC GACCCGATCT ACGAGCCCAT CTCGCGGCGC TTCCTGGAGA ATCCGAGCGC GTTCGCGGAC GCGTTCGCCC GGGCGTGGTT CAAGCTGACG CATCGTGACC TGGGGCCGGT CGCGCGCTAC CTCGGCCCGG AGGTCCCGAC CGAGACGCTG CTGTGGCAGG ACCCGCTCCC GGCGGTGGCC CACGAGCTCA TCGACGCCGC GGACGTCGCC ACTCTCAAGG GTCAGATCCT TGCCTCGGGC CTGTCGGTCT CCCAGCTGGT CTCCACCGCG TGGGCGTCGG CCTCGACGTT CCGCGGTGGT GACAAGCGCG GCGGCGCCAA CGGTGCGCGC ATCCGCCTCG AACCACAGCG CGGTTGGGAG GTCAACGAAC CCGACCAGCT GGCGGCGGTC CTGCGCACGC TGACGAGAAT CCAGGAGGTC TTCAACGCCG CCCAGACCGG CGGCAAGCAG GTCTCACTCG CCGACCTGAT CGTGCTCGCC GGTGGTGTTG CCGTCGAGCA GGCCGCCGCG AACGCCGGCT TCGACGTCGA GGTCCCCTTC GCACCGGGAC GTACCGACGC GTCGCAGGAG CAGACCGACG TGGAGTCGTT CGCGGTGCTC GAGCCGACGG CGGACGGGTT CCGCAACTAC CTGGGGAAGG GCCACCGCCT GCCGGCTGAG TACCTTCTGC TCGACCGGGC GAACCAGCTG ACCCTGAGCG CCCCCGAGCT GACGGTCCTC GTCGGTGGTC TGCGGGTCCT GGGCGCCAAC TACCAGCAGT CGCCGCTCGG CGTCTTCACC GCGACCCCCG GGTCGCTGAC GAACGACTTC TTCGTCAACC TGCTCGAGCT GGGCACGACG TGGAAGACGA CGTCCGAGGA CGCGAACACC TTCGAGGGCC GCGATGCCGC CACGGGCAAG GTCAGGTGGA CCGGCAGCCG CGCCGACCTC GTCTTCGGTT CAAACTCCGA ACTGCGCGCG CTCGCGGAGG TCTACGCGAG CGACGACGCG CGGGAGAAGT TCGTGCACGA CTTTGTCGCG GCGTGGGTCA AGGTGATGAA CCTCGACCGG TTCGACCTCG TCTGA
|
Protein sequence | MSENHDAVVY NTNAENGGGC PVAHRRAPHP TQGGGNRGWW PNRLNLKILA KNPAVANPLG GEFTYAEAFR TLDLAAVKQD IAAVLTTSQA WWPADYGHYG PFIIRMAWHS AGTYRISDGR GGAGAGQLRF APLNSWPDNA NLDKARRLLW PVKKKYGQKI SWADLMILAG NVALESMGFE TFGFAGGRVD VWEPDEDVYW GPETTWLDDE RYTGDRELEN PLAAVQMGLI YVNPEGPNGN PDPIAAARDI RETFRRMAMN DEETVALIAG GHTFGKTHGA ANPDEHVGPE PEGAPIEEQG FGWTSTFGTG RGGDTITSGL EGAWTNTPVS WDNSFFEILF SYEWELTKSP AGANQWKPKD GAGAGTVPDA HDAAKSHAPT MLTTDLALRF DPIYEPISRR FLENPSAFAD AFARAWFKLT HRDLGPVARY LGPEVPTETL LWQDPLPAVA HELIDAADVA TLKGQILASG LSVSQLVSTA WASASTFRGG DKRGGANGAR IRLEPQRGWE VNEPDQLAAV LRTLTRIQEV FNAAQTGGKQ VSLADLIVLA GGVAVEQAAA NAGFDVEVPF APGRTDASQE QTDVESFAVL EPTADGFRNY LGKGHRLPAE YLLLDRANQL TLSAPELTVL VGGLRVLGAN YQQSPLGVFT ATPGSLTNDF FVNLLELGTT WKTTSEDANT FEGRDAATGK VRWTGSRADL VFGSNSELRA LAEVYASDDA REKFVHDFVA AWVKVMNLDR FDLV
|
| |