Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1684 |
Symbol | |
ID | 3903071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2020477 |
End bp | 2022474 |
Gene Length | 1998 bp |
Protein Length | 665 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637879022 |
Product | resolvase-like protein |
Protein accession | YP_480789 |
Protein GI | 86740389 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.560816 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGTG CGACCTCGCG TCGCAAGAGC GCGAACCGCA CCCCCCAGCC CGCCGTCGAC CCGCTCGACA CCGTCCGGGT CGGGATCTAC CTGCGGCGCT CCACCGACGA CGAGCACCAG CCTTACTCCA TCGAAGCCCA GGAAGAACGA CTCCGTTCCT ACATCGACTC CCAACCCGGC TGGGCCATCG CCCTCCGGTT CTCCGACGAC GCCTCCGGCG CCACCACCGA ACGCGACGAC CTGCAACGCG CCCTGTCCGC GGCCCGCCAC GGACTGATCG ACGTCCTGCT CGTCTACCGG GTCGACCGAC TCTCCCGCAA CCTGCGCGAC ACCGTCACCC TGCTCGAAGA ACTTGACCAG GCTGGCGTCG TGTTCCGCTC GGCCACCGAG CCGTTCGACA CCGCGACCCC GATGGGCCGC ATGCTGCTCC AGATGCTCGC GATGTTCGCA CAGTTCGAGC GCGACACGAT CATCGACCGG GTCATCGCCG GCATGGAACG CAAAGCCGCC AAGGGCCTAT GGATGGGCGG CAACCGGCCC TTCGGCTACC AGGTCGACCG CGCCAACTGG AAACTGCTCG TCGACGAGAA GGAAGCGCCC GTCGTCCGCC TGATCTTCAA CCTCTACGTC AAGGAACGGG TCGGCACCCG CGCCATCGCC AAGACCCTCA ACGAGCGCGG CCACCGCACC ACCACCGGCG GACCCTGGTC CGGACACCAG GTCCTCCGCG TCCTGGACAA CCGCATCTAC CTCGGCGAGC TGACCTTCCG CGAGATCACC GTCACCGACA CCCACAAGCC GATCATCGAG GCGGCCCAGT TCGCTGAGGC CGAGAAGATC CTCACCATCC GCAGCGACGG ACACACCCAC CGAGCCGCCA GCGACTCCGA CTACTACCTC ACCGGCCGCA TGCGCTGCCC GCAATGCGCC AAAGCGATGC TCGGCTCCAA CGCCGGGGGA CGCAACCGCA CCTACCGCTA CTACACCTGC TTCACCCGCC TGCGCTACAG CCGCGACCGC TGCGACGCCC CCCGCCTCGA CGCCGACGCC CTCGACCAAG CCGTCCTCAC CGCACTCGCC GCCTTCTACC GCGACCACCA GCAGCTCATC TCCGACGCCG TCCACCACGC CCGCCAGCGC CACCACGACG CCCACGCCGA CCGCACCGGC GAACTCGCCA CCGTCCAGGC CGACCTCACC CAAACCGACC AGGCCATCGA CCGCTACCTA TCCGCGTTCG AGCGCGGCAC CCTCGACGAA GAAACCCTCG CCACCCGACT CGCCACGCTG CGCACCAAGC AGAAGCAGCT CCGCCGCCGA CAGACCGAAC TCACCGCCCA GATCGACGAC GAACCCGTCA TGCCCCCACG CGCCACCCTC AGCAAGATCG CAGGCCACAT CGATACGATC ATCGAGGTCG GCACCGACCT TCAGCGCAAA GCCCTCGTCG AAGCCCTCAT CCACGAAGTC AAGATCCTCG GCCCCGGACG GCTCCAACCG GTCTTCAAAG TCCCCAGACC CGAGCCGAGC GAGACCGCCG CAGCCGCCCT ACCAGCCACA ACGCCCCCGA AGGGAGCGGT TCGTACAATG CCAAACATGG TGGAGCGGGT GGGACTCGAA CCCACGACCG ACGGAAACAC CGTCGCAACC ACGACGTCCA TACCGTCGGC ACGGTGCCGC ACGCGTCGTC GAGGTGTCAC CCTGTTCCCG TGGACACACC GTCGATCCAT CAGCCCCGCC AGCGCCGGTT CACCGACAGC GACGATCCCG GGCCGAACGA AGATCCCGAG CAGCTCCGGC TCTTCCACCG CTGGCGTGCG GCCCGGTCAC CGTCGTCGGT GGTTCATCTC GACTCGGCCG GCGCGGCCCG CCCCAGTCGG GCCACGATCG CGGCCGAGGC CGCGTACCTG GAGCGCGAGA GCGCACTCGG CGCGTACGCC GCCGAGGTGG AGGCAGGCAG CGGGCTCCAC GCCACGCGGG CCAGGCTCGT CAGCCTGA
|
Protein sequence | MARATSRRKS ANRTPQPAVD PLDTVRVGIY LRRSTDDEHQ PYSIEAQEER LRSYIDSQPG WAIALRFSDD ASGATTERDD LQRALSAARH GLIDVLLVYR VDRLSRNLRD TVTLLEELDQ AGVVFRSATE PFDTATPMGR MLLQMLAMFA QFERDTIIDR VIAGMERKAA KGLWMGGNRP FGYQVDRANW KLLVDEKEAP VVRLIFNLYV KERVGTRAIA KTLNERGHRT TTGGPWSGHQ VLRVLDNRIY LGELTFREIT VTDTHKPIIE AAQFAEAEKI LTIRSDGHTH RAASDSDYYL TGRMRCPQCA KAMLGSNAGG RNRTYRYYTC FTRLRYSRDR CDAPRLDADA LDQAVLTALA AFYRDHQQLI SDAVHHARQR HHDAHADRTG ELATVQADLT QTDQAIDRYL SAFERGTLDE ETLATRLATL RTKQKQLRRR QTELTAQIDD EPVMPPRATL SKIAGHIDTI IEVGTDLQRK ALVEALIHEV KILGPGRLQP VFKVPRPEPS ETAAAALPAT TPPKGAVRTM PNMVERVGLE PTTDGNTVAT TTSIPSARCR TRRRGVTLFP WTHRRSISPA SAGSPTATIP GRTKIPSSSG SSTAGVRPGH RRRWFISTRP ARPAPVGPRS RPRPRTWSAR AHSARTPPRW RQAAGSTPRG PGSSA
|
| |