Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2949 |
Symbol | |
ID | 3903764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3483997 |
End bp | 3486126 |
Gene Length | 2130 bp |
Protein Length | 709 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637880270 |
Product | catalase |
Protein accession | YP_482036 |
Protein GI | 86741636 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG0693] Putative intracellular protease/amidase [COG0753] Catalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.962169 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.093907 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC CCGGGACGTC GCGCCTCCCC CGGGACGACG CGAAGCAGCA GCAACTCGAC GCCGTCCGAA TCGGCGACGA CGGCGCCCGC ATGACCACCG ACCAGGGGAT CGGTGTCGAA CACACCGACG ACTCGCTCGC GGCGGGGGAA CGCGGTCCGA CGCTGCTGGA GGACTTCCAC TTCCGGGAGA AGCTCACTCG GTTCGATCAC GAGCGCATCC CCGAGCGGGT CGTCCACGCC CGTGGTGCCG GTGCCTACGG GTACTTCGAG GCGTACGAGT CGCTCGCCGA CGTGACCCGG GCGCACTTCC TGGGCGAGGA CGGCCGGCGT ACCCCGGTGT TCGTACGGTT CTCGACCGTC GGCGGTTCCC GCGGTTCCGC GGACACGGTC CGGGACGTAC GCGGATTTGC CGTAAAATTT TATACCGAGG AGGGTAACTT CGATCTCGTC GGTAACAACA TGCCGGTGTT CTTCATTCAG GACGGCATCA AGTTCCCCGA CTTCGTCCAT GCGGTGAAGC CGGAGCCGCA CAACGAGATC CCGCAGGCAT CCTCCGCGCA CAACACCCTG TGGGACTTCG TCAGTCTCGT CCCCGAGTCG ATGCACATGA TGATGTGGCT GATGTCGGAC CGGGCGCTGC CGCGCAGCTA CCGGATGATG CAGGGCTTCG GCGTGCACAC CTTCCGGTTC GTCGATGCCG CCGGGCAGGG GACCTTCGTC AAGTTCCACT GGCGGCCGAA GCTCGGCACG CACTCCCTGG TCTGGGACGA GACACAGAAG ATCGCGGGCA AGGACCCCGA CTTCAACCGG CGGGACCTGT GGGAGTCGAT CGAGAACGGT ACGTTCCCGG AGTGGGAACT CGGCGTGCAG CTCGTGCCGG AATCCGACGA GCACGCGTTC GACTTCGATC TTCTCGACGC GACGAAGATC ATCCCTGAGG AGCGGGTGCC GGTGCGGCCG GTGGGACGGC TGGTGCTCAA CCGCAACCCC GGCAATTTCT TCGCCGAGAC CGAGCAGGTG GCGTTCTGCG TACAGAACGT CGTCCCCGGC ATCGACTTCA CGAACGACCC GCTGCTGCAG GCCCGGCTGT TCTCCTACCT GGACACCCAG CTGATCCGGC TCGGCGGCCC GAACTTCGCC CAGCTGCCGG TCAACCGGCC GGTCGCGGCG GTGCACAACA ACTCCCGCGA CGGTTACGGG CAGCATCGCA TCCAGCAGAG CCAGACGTCG TACTTCCCGA ACTCGATCAG CGGCGGTTGC CCGGTGCTCT CCGATCCGGC GCACGGCGGG TACGTGCACT ACGCCGAGCG GGTCGACGGG AACACGATCC GTAAGCGGAG CGAGAGCTTC AAGGACTTCT ACAGCCAGGC CACCCTGTTC TGGAACAGCA TGTCCTCCTG GGAGCGGCGG CACATCGTCG ACGCGTTCAG CTTTGAGCTC GGCAAGGTCG ACTACACCCA GATCAAGGAG CGTGTCCTGG GACATCTCGC CCAGGTGGAC CACGAACTCG CGGCCGGGGT CGCCGAGAAC CTCGGACTGC CGGTGCCACC GGAGAGCACC CCGAACCACG GACGATCCTC ACCCGCGCTG AGTCAGGCCG ACCAGCCCAG CGGGGTGGCC ACCCGCAGGA TCGCGGTGCT CGCGGCGGAC GGCGTGGACG AGGTGGCATT GCGTTCGGCC ACCGCCGCGC TGCGCGAACA GGGCGCGATC CTGGAGGTTC TCGCGCCGCA CGGAGGCATG CTCGCCACGG TGTCCGGCGA CGCGTTGCCG GTGGACCGAA CGCTGGTGAC CATGTCCTCG GTCCTCTACG ACGCGGTGTT CGTCGCCCCG GGCGAGCGCG GTGTCACCGC ACTGACACAC AACGGCGAGG CCGTGCACTA CGTCGGCGAG GCGTACAAGC ACGCGAAGCC GATCGGCGCG GTCGGCGCGG GGGTGTCGCT GCTCGAGATC GCATCCTTGC CGGGCGCGCG GGTCGCCGAC CAGGGTGACG CGGTGGTGTC CGACCGGGGT ATCGTGACGG TCCGCGACCT TGCGGGGCCG TCCGTGCTCG GCGACTTCGG TTCCGCGTTC GCCACGGCCG TGGCCGCCCA TCGGCACTTC GACCGCGCGC TGGAGGCCGT CGCCGCGTAG
|
Protein sequence | MTDPGTSRLP RDDAKQQQLD AVRIGDDGAR MTTDQGIGVE HTDDSLAAGE RGPTLLEDFH FREKLTRFDH ERIPERVVHA RGAGAYGYFE AYESLADVTR AHFLGEDGRR TPVFVRFSTV GGSRGSADTV RDVRGFAVKF YTEEGNFDLV GNNMPVFFIQ DGIKFPDFVH AVKPEPHNEI PQASSAHNTL WDFVSLVPES MHMMMWLMSD RALPRSYRMM QGFGVHTFRF VDAAGQGTFV KFHWRPKLGT HSLVWDETQK IAGKDPDFNR RDLWESIENG TFPEWELGVQ LVPESDEHAF DFDLLDATKI IPEERVPVRP VGRLVLNRNP GNFFAETEQV AFCVQNVVPG IDFTNDPLLQ ARLFSYLDTQ LIRLGGPNFA QLPVNRPVAA VHNNSRDGYG QHRIQQSQTS YFPNSISGGC PVLSDPAHGG YVHYAERVDG NTIRKRSESF KDFYSQATLF WNSMSSWERR HIVDAFSFEL GKVDYTQIKE RVLGHLAQVD HELAAGVAEN LGLPVPPEST PNHGRSSPAL SQADQPSGVA TRRIAVLAAD GVDEVALRSA TAALREQGAI LEVLAPHGGM LATVSGDALP VDRTLVTMSS VLYDAVFVAP GERGVTALTH NGEAVHYVGE AYKHAKPIGA VGAGVSLLEI ASLPGARVAD QGDAVVSDRG IVTVRDLAGP SVLGDFGSAF ATAVAAHRHF DRALEAVAA
|
| |