Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1102 |
Symbol | |
ID | 3905773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1313124 |
End bp | 1316426 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637878434 |
Product | putative DNA methyltransferase |
Protein accession | YP_480211 |
Protein GI | 86739811 |
COG category | [R] General function prediction only |
COG ID | [COG4889] Predicted helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.822518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCGG CAACGGATCA TTGGCTGCTG AAGGCAGTAT CGGATTTTGG TCGGGCGTGT GGCACCAAAC TCTCCGGAGG TGGCAGCCCG GAAGCCTCGA TACGAAGCCC GCTGGAAGGG CTCCTGGCGG CAGTTGGCAG ACATCACAAG CTGTCCGAAC TTACCTGGCA TGACGAAGTG AGGCTTCCCG AACTCGGAGT GCGGCCCGAC TACGCAGTCC GGCTGAGTGG CCTGGTCACC GGATACATAG AAATCAAGAA GCCCGGCCTG TCCGTAGACC CTCAGAATTT TACCGGGCAC AACAAGGAGC AATGGGAACG GCTGCGAGAC CTTCCAAATC TACTTTACAC CAACGGCACC GAATGGCGGC TTTTTCGGGA CGGGATCCAG ATAGGAGAGA CCATTCACTT CACCGGAACT CTCCGGACAA GCAGGGAGCG GCTTCGGCCA CCGGATCCGG CCGCCTTCGA CGCACTGATC ACGACGTTCC TCGTCTGGAG CCCGCCACCA ATCCAGAACG TTGGCAGGCT CGTGCAGAAC ATCGCACCGT TGTGCCGCCT CCTCCGCGCT GCCGTCCTGG AGCAGCTCAC CGCCGAAGCG AAGTCGGACG CACCGGAGAA CGACGGCCAC GCACGTCCCT TCACCGGCCT CCAGAACGAC TGGCGGACGC TACTCTTCCC CGAGGCAGAC AGCCGGACCT TCGCCGACGG ATACGCCCAG ACGGTAACCT TCGCGCTGCT GCTCGCACAA ACGGAGGGAA TTCCTCTCCT GCAGGTCGGA TTTCATGATA TCGGCCGGAA GCTGAACGCC GGCCACGCAC TTATCGGCAG CGCGCTGCGA CTGCTCACCG ACAATATAAA CGAGCGTTTT TCGGTGACGC TCGATCTACT GACCCGCACC ATCTCCAGTG TCGATTGGCC TGCCATCCGG AACGGCAACC GGGATGCCTA CCTACATCTC TACGAAAACT TCCTGACCAG GTATGACGCA CAGCTACGAC AGCAGAGCGG TTCGTACTAC ACGCCGCGAG AGGTCGTGGA GCACATGGTC CGGTTGGCAG AGGATGTGCT GCGCACCCGG CTCGGCAAGG ACCATGGTTA CGCCGATCCG GACGTTCGCA TCGTCGACCC GGCGATGGGA ACCGGCACTT TCCTTCACGC GATCATCGAG AGAGTCGCAG AAACTGCCTC CGAAGGAGGG GGCGAAGGCA TGGAAATCGA CGCGGTCGCG CAGCTCGCGG AACGACTGTA TGGCTTCGAG CTGCAGATAG GTCCCTACGC GGTCGCCGAA CTCCGAACCT CAGACCTGTT AAGAGCGGAG GAGATCCCAG CGCCGCGAGA GGGACTCAAT CTCTTCCTAA CAGACACGCT TGATTCGCCG TTCAGTGATA CGCAGAAAGC ACTCTTCGGC TATCGGGAAC TGGCGGCTTC CCGACAGCGA GCCGATCAGG TAAAGGGCAA TGTTCCCGTG ACTGTTGTCA TCGGCAACCC CCCCTATGAC GACAAGGCGA AGAAACGCGG AAAATGGGCA GAGAAAAAAA TTCCAGGAGA GAATAGGACT CCGCTCGACG CTTTCCGTCA CCCCGGCAAC GGCCGGTACG AACACGTACT GAAGAACATG TACATCTACT TTTGGCGCTG GGCGACTTGG AAGGTCTTCG ATGCCCACGA GGCCGACCAA CATGGATTGG TCTGTTTCAT CACGCCATCG GGCTTTTCCA CTGGCCCTGG CGGCCGTGGC CTGCGCGACT ATCTCCGTCG CACCTGTCAC GAAGGATGGG TTATCAACCT CTCTCCCGAG GGACAGCGCG CGGATGTTGC CACCCGCGTC TTTCCAGCAG TCGCGCAGCC ATTGGGTATC TATATCTTCG TACGCCGCGC GGGGAGCTCT CCGGACGGCT CGACCCGGAT CCACTACCGC TCGATCTCGG GCCGACGCGA GGAGAAGTTC AGACAGCTGG CCCATGTGGA GATTGACGAC GGCGGCTGGA GGGATACGCA TCGCGAGCGA TCTCGTCCGT TCACGCCGAC TACGCAATCG GCGTGGGAGG ACTTTCCAGA GATTTCCGAT ATATATCCGT GGGGAAGTCC TGGAGTAAAG GCAAACCGCT CCTGGGTGGC TGCTCCCAGC GCGGAGATCC TGCGTCGCAG GTGGGCGCGT TTGGTCCGCG AGGACGACCC GGACGCGAAA GAGGAGCTTT TCAAGGAGAC TCGGGACCGC ACGCTGCTAC GTGCGGTGTC GCCGCTCCCC GGATACTCTG GACCTCGAGT GCCTATCGGG CGGGAGAGTA GCCTGACCCC ACCTGTGATC CGGGTCGGGT TCCGCAGCTT CGATCGACAG TGGCTGATTG CCGACCCCCG TGTGCTGGAT TTCGCCCGAC CAGACCTGTG GGCCTCACTT CACGACGACC AGATCTTCCT CAACCAACAG TCCTCGCATG AGATCGAAAG CGGACCGGCG GTTGTGGCGA CCGCATTACT TCCAGACACC CACCATTTTA ATGGGCGGGG TGGAAGAATT CTGCCGCTAC TGCAACCGGA TGGCTCAGCC AACGTTCCAG GCGGACTGCT ATCCTATCTT GCCAGCGCAT TCGGACTGGA GAAGGTTACC GTCTTGGATC TGGCTGCTTA TACAGTCGCA GTCGCGGGGC ACTCGGCCTT CACCGAGCAG TTCGCGGAAG AACTGCTCAC GCCCGGCGTT CGACTTCCTG TCACACGGAA CCTCGAGAGC TGGCGAAAGG CGTTGTCGAT CGGAGCCGAG ATCCTGTGGG CTTCGACCTA CGGCGAAAGG TGTGCAGATC CGGATGCTGG TCGTCCCACC CGCGACGTCA GGTTCAAGAG CGGGGATCCA CGTCAGGTCC GATACTTGAC CAATATCCGG AAACAGATCC CAGAGAATTT TCACCACGAC CCAACAACAA ACACTCTACA TGTTGGCGCA GGGTCTTTTG GGAACGTTCC AGAGGAGGTA TTGACCTTCA ACGTCGGCGC GATGCCAGTA GTTAGGAAAT GGTTTGGATA TCGCAAGTTT TCCCCCAACA GCAAGAAGAC GAGCCCGCTC GACGATATTC ACGTCGACAG CTGGCCCCGT GAGTGGGCCG CAGAACTCGT CGAGCTCCTG TCGGCCCTAC GCCGCCTCGT GGATCTGGCA CCGGCGCAGC GCGATCTTTT GGCCGAGGTT CTGGCGGGTC CCATGGTGAC CCAGCAGGAT CTCGCGGTTG CGGGAGTGCT CCCTGTGCGT CCCGCCGCAC GGAAGCCGCG GTATGAAAAC CCGGTCGGGC TGTTCGTGGA CGGAGACGCC TGA
|
Protein sequence | MSAATDHWLL KAVSDFGRAC GTKLSGGGSP EASIRSPLEG LLAAVGRHHK LSELTWHDEV RLPELGVRPD YAVRLSGLVT GYIEIKKPGL SVDPQNFTGH NKEQWERLRD LPNLLYTNGT EWRLFRDGIQ IGETIHFTGT LRTSRERLRP PDPAAFDALI TTFLVWSPPP IQNVGRLVQN IAPLCRLLRA AVLEQLTAEA KSDAPENDGH ARPFTGLQND WRTLLFPEAD SRTFADGYAQ TVTFALLLAQ TEGIPLLQVG FHDIGRKLNA GHALIGSALR LLTDNINERF SVTLDLLTRT ISSVDWPAIR NGNRDAYLHL YENFLTRYDA QLRQQSGSYY TPREVVEHMV RLAEDVLRTR LGKDHGYADP DVRIVDPAMG TGTFLHAIIE RVAETASEGG GEGMEIDAVA QLAERLYGFE LQIGPYAVAE LRTSDLLRAE EIPAPREGLN LFLTDTLDSP FSDTQKALFG YRELAASRQR ADQVKGNVPV TVVIGNPPYD DKAKKRGKWA EKKIPGENRT PLDAFRHPGN GRYEHVLKNM YIYFWRWATW KVFDAHEADQ HGLVCFITPS GFSTGPGGRG LRDYLRRTCH EGWVINLSPE GQRADVATRV FPAVAQPLGI YIFVRRAGSS PDGSTRIHYR SISGRREEKF RQLAHVEIDD GGWRDTHRER SRPFTPTTQS AWEDFPEISD IYPWGSPGVK ANRSWVAAPS AEILRRRWAR LVREDDPDAK EELFKETRDR TLLRAVSPLP GYSGPRVPIG RESSLTPPVI RVGFRSFDRQ WLIADPRVLD FARPDLWASL HDDQIFLNQQ SSHEIESGPA VVATALLPDT HHFNGRGGRI LPLLQPDGSA NVPGGLLSYL ASAFGLEKVT VLDLAAYTVA VAGHSAFTEQ FAEELLTPGV RLPVTRNLES WRKALSIGAE ILWASTYGER CADPDAGRPT RDVRFKSGDP RQVRYLTNIR KQIPENFHHD PTTNTLHVGA GSFGNVPEEV LTFNVGAMPV VRKWFGYRKF SPNSKKTSPL DDIHVDSWPR EWAAELVELL SALRRLVDLA PAQRDLLAEV LAGPMVTQQD LAVAGVLPVR PAARKPRYEN PVGLFVDGDA
|
| |