Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6791 |
Symbol | |
ID | 5675104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8272731 |
End bp | 8274254 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641245640 |
Product | catalase |
Protein accession | YP_001511031 |
Protein GI | 158318523 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0753] Catalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGTTC AGAAGCCTCG TCCGACGACG ACCGATGCCG GAATACCCGT CGCCAGCGAT GAGCACTCGC TGTCCGTGGG GCCGGACGGC CCGCTGCTGC TGCAGGATCA CTACCTGATC GAGCAGATGG CGAACTTCAA CCGGGAACGC ATCCCGGAGC GTCAGCCGCA CGCGAAGGGC GGTGGCGCGT TCGGCACCTT CCAGGTCACC CAGGACATGA GCGCGTTCAC CCGGGCCGCG GTGTTCCAGC CCGGTACCGA GACGGATGTG ATCATTCGGT TCTCGACGGT GGCCGGGGAG CGCGGCAGCC CCGACACCTG GCGCGACCCA CGCGGTTTCG CGGTCAAGTT CTACACCAGC GAGGGCAATC TCGACATCGT CGGGAACAAC ACGCCGGTCT TCTTCATCAG GGATCCGCTG AAGTTCCAGC ACTTCATCCG GTCCCAGAAG CGCCGGGCCG ACAACAACCT GCGTGACCAC GACATGCAGT GGGACTTCTG GACGCTGTCA CCGGAGTCCG CGCACCAGGT CACGTGGCTG ATGGGTGACC GCGGCATTCC GCGCACCTGG CGACACATGA ACGGATACTC CAGCCACACG TACATGTGGA TCAACGCCGC CGGCGAGCGG TTCTGGGTGA AGTACCACTT CAAGACCGAT CAGGGCATCG AGTTCTTCAC CCAGGACGAG GGTGACCAGA TGGCCTCGGT AGACACCGAC TACCACCAGC GCGACCTCTT CGAGCACATC GCCGCCGGGG ACTTCCCGAG CTGGACGCTG AAGATGCAGA TCATGCCGTT CGAGGAGGCC AAGACCTACC GGTTCAACCC CTTCGACCTG ACCAAGGTCT GGCCGCACGG CGACTACCCG CTGCACGAGG TCGGCCGGCT GACACTCAAC CGCAACATCA CCGATTTTCA CACGGAGATG GAGCAGGCCG CGTTCGAGCC GAACAACATC GTCGCGGGCA CCGGGCTGTC GCCGGACAAG ATGCTGCTCG CTCGCGGGTT CAGCTACGCC GACGCCCACC GGGCCCGCCT TGGTGTGAAC TACAAGCAGA TCCCGGTGAA CTCCGCGAAG GTGCCCGTCC ACAGCTACTC CAAGGACGGC GCGATGCGGG TGCAGAACGT CACCGACCCG GTGTACGCGC CGAACTCCTA CAGCGGCCCG GTGGCCCAGC CGGAGCTGAC CGACGACGGC GGCCACTGGT ACTCCGACGG CGAGATGGTT CGCGCCGCTT ACACCTCGCG CCCGGAGGAC GACGACTGGG GGCAGGCCGG GACGATGGTC CGCGATGTCC TCGACGACGC CGCGCGGCAG CGGCTGGTCG ACAACGTCGT CGGGCACCTG CTCGACGGTG TCAGCGAACC CGTGCTGGTC CGCGCCTTCG AGTACTGGCG CAATGTCGAC AAGGACCTCG GCACCCGGAT CGAGGCAGAC GTACGGGCCA AGCAGGACGA GACCGATCCG AAGGCCGCGC AGCAGGCCAA CCCGGCCCGG TCCAGCGCGC AGGCCAAGGC CTGA
|
Protein sequence | MTVQKPRPTT TDAGIPVASD EHSLSVGPDG PLLLQDHYLI EQMANFNRER IPERQPHAKG GGAFGTFQVT QDMSAFTRAA VFQPGTETDV IIRFSTVAGE RGSPDTWRDP RGFAVKFYTS EGNLDIVGNN TPVFFIRDPL KFQHFIRSQK RRADNNLRDH DMQWDFWTLS PESAHQVTWL MGDRGIPRTW RHMNGYSSHT YMWINAAGER FWVKYHFKTD QGIEFFTQDE GDQMASVDTD YHQRDLFEHI AAGDFPSWTL KMQIMPFEEA KTYRFNPFDL TKVWPHGDYP LHEVGRLTLN RNITDFHTEM EQAAFEPNNI VAGTGLSPDK MLLARGFSYA DAHRARLGVN YKQIPVNSAK VPVHSYSKDG AMRVQNVTDP VYAPNSYSGP VAQPELTDDG GHWYSDGEMV RAAYTSRPED DDWGQAGTMV RDVLDDAARQ RLVDNVVGHL LDGVSEPVLV RAFEYWRNVD KDLGTRIEAD VRAKQDETDP KAAQQANPAR SSAQAKA
|
| |