Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3338 |
Symbol | |
ID | 3904124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3955805 |
End bp | 3960367 |
Gene Length | 4563 bp |
Protein Length | 1520 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637880663 |
Product | hypothetical protein |
Protein accession | YP_482424 |
Protein GI | 86742024 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member |
TIGRFAM ID | [TIGR02686] conjugative relaxase domain, TrwC/TraI family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.389694 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGCTA CGGTGAAGGT ATTGACGTTG CGGGCCCGGG ATGGGGAGAC GGTTGCTCGG GCGGCGCGGG CGGTGGTGGC CTACGTGGAG GGTGGGCAGC CGGGGGCGGT GGCGCCGTTG CGCCGGTACT ACGGCGAGGG GTTGGTGCCC GGGTGGGCGC GCGGGTCGGC GGCGTATCTG GTCGGGTTGG ACGCGGGCCG GCCGGTGGCG GGGGAGGCGT TGGAGCGGCT GCTGCGCGGG GAGCACGCGG TGACGGGGAG GCCGTTGCTG ACGGCATTGG GGTCGGCGGG TCGGGCATCG CCGCCCGTGG AGGGGCAGCG TTCGGCCGGG CCGGGCGGGG GGTTGTTGAC GTTGGCCCAG GCGGCGCGGC GGGCGGGGGT GAGCGCGGCC TATCTGCGTG CGCTGGCGGT ACGCACCGCG GCCATGGCCA CCGCGGAGCG CTCAGCCTCC CGCGGTGGCG ATGCCGATGC CGGCGGGTCG GCGCAGGGTG CCGATCGGGC GGTGCACGAG CGGACAGGCG CGGCCGGGAT GCGACGTGGC CCCTCGGCGG GCAGCGTGCC GGGGGATGAG GGCCGGGCGG TGGACAACGG GGTGGGGGAG GGGCGGGGTC CGTGGTTGGC GGCGGTGCGG GAGGCGGGGA CGGGCCGGTG GCTGGTGAGT GCGACGGAGG TGGATCGGTT CTGTGCCGCG CGGGTGCCGC CGGCGGTGGT GTTGGGCTAT GACGTGACGT GTTCGGCGCC GAAGTCGGTG TCGTTGCTGT GGGCGTTCGG GGATGAGGAG ATTCGCCGGG ATGTGGCCGC GGCGATGGAC GCCGGTGTGG AGGCGGTGCT CGGCTATCTG GAACGGCATG CGACGGTGGG CACCGTTGCC GGCCGGAACC GTCCGGGGGT GGGGGTGGCG GCGGTGTCGT ACCCGCATGA GGTGTCGCGG AGCGACGAGG CCCATCTGCA TGTGCATTCC ATCGTGGTGA ACGCGACCGC CGTCCCCGAT CTGGATGAGC AAGGCCGGCC GGTGGCCGAT GAGCAGGGCC GGGGGCGGGT GGACTGGCGG GCGTTGGACG GGGAGGTGTT CCTCTCCCAC GTGAAGACCG CTGGCTATGT GGGGGCGGCG GCGCTGCGCC ATGAGCTGTC CCGGCGGCGC GGACTGGCCT GGGGGCCGGT GCGTAACGGG GTGGCTGAAC TCGCCGCCTT CCCCGCGCAG CTGCTGGCGG CGTTCTCCAC CCGGCATGGG GAGGTCCAGG CCGAGTACGC CCAGCTCGTC GCCGACGGGT TGACGCCGGG TGGGGTGACC GAGGCGGCGG CGCAGCGCGG TTCCCGGGCG GCGAAGAAGG TCCTCGCTGA CGCGCAGGTC CGCCGCATCC AACACGAACG GCTGACGGCC GCCGGATGGA CGCCGCAGCG GGTCCGCGCC CTTGCCGCAC CCGCCTCCCG GAACCGGGCC CCGGTCGACG GTGAGGACCT GGCCGGGTTG TGCGACCTGC TGACCGGCCC GGCCGGATTG ACCGAGCACG ACTCGACGTT CGACCGCCGT GCCGTGGTAC GACGGGTGGC CGCCTGGGCC GCCGATCGGC TGCCCGCGGA CGAGGTCGAC CGGCTCACCG ACCAGGTGTT GGCCGACCGG CGGATCGTCC TGCTCGGCCA CAGCGCCGCC CGGGCCCGCC AGCAACCCGA GCCGGTGTAC ACGACGCAGG AACTGCTCGA GGTCGAGGAC ACCCTGCTGG CCCTCTGCCG GCAGGGCCGG GTTGAGGCCG GCGCGCAGCC GCGGATCCTC GTCGACCCGG CCACGCTCGA AGCCCATCTC GCCGCCGCGC AGCAGCGGCC GTCATCGGAC GGCCCGGGTG GCGTCGGCGG CGAGGACGGC GGCGGCCAGG GGAACGGCGG CCAGGGGAGC GGTGGGCCGT CGGGGCCGGC GCTGTCGGCC GAGCAGATCA CCCTCGTGCG CCGGCTGCTC ACCTCCGGGG ATCTGGTGCG GCCGGTCGTC GGGCCGGCGG GGTCGGGGAA GACCGAGGCG ATGCGCCTGC TGACCCGGAT CGTCCACGCC GGCGGCGGGC AGGTGTTCGC CGCCGCGCAC GGGGGCCGGC AGACCGAGGA ACTCACCGGC CGGATCGGGG TGGCCGGGCG GGTGGTGTCC GGCTGGCTGA CCCTGCTCGA TCACACTGAG GATCCGGGCC GGGTGTGGCC GGCGGGCAGC GTGGTCATCG TGGATGAGGC CACGCAGGTC TCGACCCGGG ACGCCGCCCG ACTGGCGCGG TATGCGTCTC GGACCGGGAC CGTGCTGATC CTGCTCGGTG ACCCGGCGCA GCTCGGCGCG GTCGGCGCCG GCGGCTGGTA CGCCCATCTC GTCGCCTCCA CCCCCGACGT GCCGGCGTTG GGCAGTCTGC ACCGGCAGAC CGGCGCGGCG CTTGCGCCGG TCCGCGCCGC CCTCGGCGCG CTGCGCGCCG AGGGCGGCGC GTCGGCGAGG AAGGCTCTCG AGCTGCTGGC CGCGGACGGG CGGATCCGCC TGTTCGACTC CCGCGAAGCA CTGCTCGCCC AGGTCGTCAA CGACTGGTAC ACCGAACGCA CCGCACCGCA CCCTCGAGGC GCCACCGACC CGGACAGTGC CACCGACCCG GGCAGCGTGG ACGGCGCGGG CAGCACGGAG GGCGACGGGT GGACGGCCGC CGGTCCGGGA CGGCGCCGGT CAGGGGGTAC GGTCCGGCCC CGACCCGCGG CAGCGTTGCA CATGATGGCC GAGCGGACCC GCGACGTGGA GATCCTCGCC CGCGCCGCCC GTGACCGTCT CGCCGCCGAC GGCACCCTGA CCGGACCGGT CCTGACCGTG GCGGGCCGGG ACTTCCAGGC CGGCGACGAG GTCATCACCC TGACCCAGAC GGGTCACACC CTGATCCCCG CCGGTAAGCC CGCCTCGGCC TACATCCGTA CTGGCACCCT CGGCCGGGTC ACCGCCGTCC ACCTGGATCC CGACCATCCT GACCGGCAGG CCCTCACCGT CCGCTTTCCC AGGAAGGGCA GCGTGCGGGT GCCGTGGGAG TACCTCACCC ACCGGTTCAC CGATGGTCGC ACCGGCGGTC TCGGCTACGC CTACGCGATC ACTGCGGCGA AAGCCGAAGG CTCCTCCCTG CCCACCGCAC GAGCCGTCGC CCCCGACGAC ACCAGCCGCG CCGGCCTCTA CGTCATGCTC TCCCGAGCCC GGACCGATCT CGCGGCCTAC GTCATCCGCC GGGCCGATCT CGAAGCCGAC CTTGACGAAG AAGACTGGCT GCCCGTCCTG CGTGACCCGA CCGGACCCCT CGAACGCTTC GCCGACCACC TCGCCCAGTC CCGCACCGAA CGCCTCGCCA GCGAGTACGA CCCGCTCGCC CACGCTGCCC ACCGGCTGCG GCGCACCCAC ACCCTCGCCG GCCTCGCCCG ACTCCCCGCA CCCCCGCCAT CCGGCGGCGC GCACCGCGCG GCCGGCGCGC CGCCCGCGCC GCCCGCGCCG CCCGCCGTGC TGCGACGCGC GGAACTCGCA GCGGAAGCGG CCCTGCGCAC CGCCGCCGTC GCCAACCCGC CCGCCGATCT CGTCGCTCGC ATCGGGCCAC GACCCGCCGC CGCCGGCGCC GACCGCGCTC TGTGGGACCG GGCCGTCGGC GCCCTCGCCG TCCACCACGC CCGCTACCGG CCGGCCGTCC CACCCCACGA CCCTGGACCC CCACTCTCAT CCGGCGAACC GGCCGGCACC CTACGGGCCC GGTGGATGCT GCACCATGAT CAGGCCACGC GCCTCGCTCG AACCTGGGCC GACGTCCTAC CCCGCCGGGC CCGCGCCCGC TTCCACAGCC GCGCCGAACA GATCCCCCGC GCCCGCGCCA TCGCCGGCCT GCACGCCCTG CTCGACAACG GCCATCAGCC CGCTGATCTT CTCGTCGCGT TGACCCGCGA GGACCAGAGC AGCGTCCGCA CCGGCGCCGC CGTGCTCGAC CACCGCGTCA CCGACCTCTG CCAGCAGCAG GGACTCCACC CCACCGACTA TCTCCTCCCC CCGCCGCGTC CGGCCCGCGA CGAGTGGAAC GAGCTCGTCG GCCTGCTCGA CACCTGCGAG ATCCACCACC TGGCCCGTCA CCCCACCGCC CAGCTCGCCG CCGAACGCCG CCACCTCCGC GACGCTCAAG GCGCGACCGT CCCGCGGGCG AGGCCACACG GAGAAGGCAG CCGGGCAAGC ACCGTCGAGG CACGCACCGG CCGCCAGGAC AGGCTCCGGC TGATCGAGGA AGCCCTCGAC CGGCAGATCG CGCACGCCGT GTTCCGGGCC GGTATCGACC CCGCCGACTA CCTCACCGGA CTCCTCGGTG CCCGTTCCAG CGCAGGCCTG GACGCCACCG GATGGGATAG CCGGGTCGAG GCCGTCGAGG GGTTCCGCCA CCGCGACCTC GGCCTGCCCT ACGGCACCCC GGCCACCACC GACGGCGAGA CCGACCCGCT GCGCCGCGCC GTCGGCGACC GGCCCACCGA TCCGGCCCTC GCGGAGGGCT ACCGGGGAAT ACGTGCGCTG ATCCGGGAGC ACACCCCAAC CCTCGATTTA TGA
|
Protein sequence | MIATVKVLTL RARDGETVAR AARAVVAYVE GGQPGAVAPL RRYYGEGLVP GWARGSAAYL VGLDAGRPVA GEALERLLRG EHAVTGRPLL TALGSAGRAS PPVEGQRSAG PGGGLLTLAQ AARRAGVSAA YLRALAVRTA AMATAERSAS RGGDADAGGS AQGADRAVHE RTGAAGMRRG PSAGSVPGDE GRAVDNGVGE GRGPWLAAVR EAGTGRWLVS ATEVDRFCAA RVPPAVVLGY DVTCSAPKSV SLLWAFGDEE IRRDVAAAMD AGVEAVLGYL ERHATVGTVA GRNRPGVGVA AVSYPHEVSR SDEAHLHVHS IVVNATAVPD LDEQGRPVAD EQGRGRVDWR ALDGEVFLSH VKTAGYVGAA ALRHELSRRR GLAWGPVRNG VAELAAFPAQ LLAAFSTRHG EVQAEYAQLV ADGLTPGGVT EAAAQRGSRA AKKVLADAQV RRIQHERLTA AGWTPQRVRA LAAPASRNRA PVDGEDLAGL CDLLTGPAGL TEHDSTFDRR AVVRRVAAWA ADRLPADEVD RLTDQVLADR RIVLLGHSAA RARQQPEPVY TTQELLEVED TLLALCRQGR VEAGAQPRIL VDPATLEAHL AAAQQRPSSD GPGGVGGEDG GGQGNGGQGS GGPSGPALSA EQITLVRRLL TSGDLVRPVV GPAGSGKTEA MRLLTRIVHA GGGQVFAAAH GGRQTEELTG RIGVAGRVVS GWLTLLDHTE DPGRVWPAGS VVIVDEATQV STRDAARLAR YASRTGTVLI LLGDPAQLGA VGAGGWYAHL VASTPDVPAL GSLHRQTGAA LAPVRAALGA LRAEGGASAR KALELLAADG RIRLFDSREA LLAQVVNDWY TERTAPHPRG ATDPDSATDP GSVDGAGSTE GDGWTAAGPG RRRSGGTVRP RPAAALHMMA ERTRDVEILA RAARDRLAAD GTLTGPVLTV AGRDFQAGDE VITLTQTGHT LIPAGKPASA YIRTGTLGRV TAVHLDPDHP DRQALTVRFP RKGSVRVPWE YLTHRFTDGR TGGLGYAYAI TAAKAEGSSL PTARAVAPDD TSRAGLYVML SRARTDLAAY VIRRADLEAD LDEEDWLPVL RDPTGPLERF ADHLAQSRTE RLASEYDPLA HAAHRLRRTH TLAGLARLPA PPPSGGAHRA AGAPPAPPAP PAVLRRAELA AEAALRTAAV ANPPADLVAR IGPRPAAAGA DRALWDRAVG ALAVHHARYR PAVPPHDPGP PLSSGEPAGT LRARWMLHHD QATRLARTWA DVLPRRARAR FHSRAEQIPR ARAIAGLHAL LDNGHQPADL LVALTREDQS SVRTGAAVLD HRVTDLCQQQ GLHPTDYLLP PPRPARDEWN ELVGLLDTCE IHHLARHPTA QLAAERRHLR DAQGATVPRA RPHGEGSRAS TVEARTGRQD RLRLIEEALD RQIAHAVFRA GIDPADYLTG LLGARSSAGL DATGWDSRVE AVEGFRHRDL GLPYGTPATT DGETDPLRRA VGDRPTDPAL AEGYRGIRAL IREHTPTLDL
|
| |