Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2575 |
Symbol | |
ID | 3904067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3036647 |
End bp | 3040372 |
Gene Length | 3726 bp |
Protein Length | 1241 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637879900 |
Product | hypothetical protein |
Protein accession | YP_481666 |
Protein GI | 86741266 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.633416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACGGCC TGAATCTGGA GATCGACGCG CTGGTCGGCC CGGCGCTGGA TCTACTGGCC ACCGCGGAGC GCGACGGGGA CCCGGTTGCC GCCCGGGCCG CCGTCGGCCT GCTGGCTCAG GCATATGACC TCGCGCCCGG CCACTCCGGC GCGCCGGGCC TGGCGATGCT CATCGCCGAG TGGGAGGGCG AGCTCGCCCA GGCGGATCTG GGCAGCTGGG ACAGTGTGGT GCTGTGGTGG CGCCGGGGCT GGCCGGCGCC CCGGCCCGGC GGCCGCCCGT CCCTGAGCAT TGCCGAACTC GATGCCATGG AAGGCGCGCT GCACGGCGAG CGCTACGTCA ACGGTCGCGA TCCCGCCGAG CGGGACGCCG CCATGCGCCT GCTCGCCCCG CTGGCCGACG CGGGCAAGCT CGGTCCGCAG GACCTACTCG GCTACGCAGC GCTCCTGATC GTCTGGTACG AGCAGGACGC GGTGCCGGCC GCCCGCGACA CCGCGCTCGA CCTGCTGGCG CCCATGGTCA CCGCGGGCAC CGCCGACCCG GGCATCGTGG CCGACTACGC AGAGCTCGTC TTCGACCGCG CCGAACAGAG CGGTACCACC CGGGATTGGG CCCAGGCCGT CCACTGGCTG CGCGAATCCC TCCGCATCGG CGGCGCCCAA AACGTCGCCG ACACCTGGGA CAGCCTCGGC CACGCCTACT GGGCGTGGAG CCGGCTCGAT GACGACGACG CCCTGTTCGA CGAGGCGGTC GACTGTTTCA CCGCGGCGGT ACGGCACGGT CAGCCCGCCG AGCTCCTCGC GCCGGCAGCG CTGGTGTTCG TCGCGGACCG GCTGCGCGAG CAGCGGCGCA GCGCGGGCGG CGTGGACGAC GCGCTGCTGG CGGACATCCG CGCCGTGCTG GCACTCGCGG ACCAGATCGT CGACCGGCCG GAACTGGATA CCGAGATCCG CGCGACGGTG GCCATGGAGA TTCTCACGCT GCAGTACCTC ATCGTCGACG GGAACGAGAT CACCCGCCGG CTGGTGCAGG AGGGCCTGGA GGCGTTCCCG CTGGCCGGGG ACAGGTTTGA CCGCCTGCTC GGCTACGCCG ACGCGCACCC CGACCCGCCG CAGGGCTGGT CGAGCAGCCG GGCCGCGATG CGGGCGATGA CCAGTCTGCT GCGCGCCATG TTCGGCACCG GGCCCCGGCC CGGCGCGGAC TTCGGCCTGC AGCATGCCGC CGAGGCCATG CGCACGGCCG AGCACTCGGA CGACGCCGCC GCGTTGTTCT CCCTCATCTC CGCTGTCGCC GGTGGCTTCT CCGGCGACCT GGGCCGGGTG AAGGCCGCCG AGCAGCTGGT CGGGACCTCC GCGGACCCGA GCGCCACGGC GATCTCCCTG TTCGCCCGAC TGGGTCAGAT GCTGCTGGTC GGCATGCCGG CCGACGGCGC CCGCACCGCC GTCGAGCTGC TGTCCGTCTT GGCCCGGTCG CTGGCACAGG TTCTGGGCGG TGGCCTTCGG GGCAGGCTGC AGCCACCTGC CGCCGACACG GGCAAGCCCG GTCACCTCAT CGAGGTCAGC CAGTACCTCG ACGTCAGCCG GCGCCTCGAC GTCGGTCAGC TCCGGGAGAT CGCCGACGTC TTCGCCGCGT TCCCGGGCTG CGACGCCCTT GTCGCGCAGG CAGCGGCCGC GGAAGTGATC GCCGGCATCA TCGCGGCGAC TTCAGCCGGC GCGCCCGCGG ATCGCCTCGC CGAGTTGCGC GCCCGCCTCG CGGACGCCGA CCAGGCCTGG GCCGACAGCA GTGCGCAGCT ACCGCCACCC CCGCGGGCCA TGGGCACGCA GCTGTTCGCC CACGGCTGGC AGCTGTTCGC GGGAGTCACC GGGGACCGGT CCGCCGCCCG GCTCGCCGTG GCACGCTCCC AGGCGGTCCT GGACTGGTTC GACGGTCCTG ACCACCCGAT GTGGCCGATG GTCGTCATGA CCGCCGCCCG CGCCCGGCGA CTGCGCGGCG AGCCCGGCGA CCTGCCCGGA GGTCGCCGCC TGGCGCTGCG AGCCCTGCGC GGGCACGCCT GGCTCGTGCT GCTGCAGTCG GGCACCCGAG ACGCGCTGGA GGTCGCCCGC GGGGCGGCGG CGGATGCCCG GGTGGTGCTG GAGTGGTGTG TCGAGGACCT GCGGGGCGGC GACGAGCAGG CCCGCGACGA CCTGGTGACC GCCGTGGAGG CCGGCCGCGG CCTCGTCCTG CACGCGGCGG TGACCACCCG GTCGGCCGCG GAGCAGCTGC GTGAGCTTGG GCATGCCGAC CTCGCGCAGG CCTGGGCAGC CGCAGGCCTC GAACCGGCCG AGGCCGACCC GGCTGCGCAA GGCCCCGGCT GGTTCGCCGA CGGGGCCGAA CGGTCCGCGC TGCGCCGGGC TGTGCTGGCG GCGCTGAGCG AGTCCGCCGC CGGACCCGAC AGCCTGCTGC TTCCACCCAG CATCGGCGAG ATCCGCGCAG CGGTGGGCAC GCACGGCTCC GACGCACTCG TCTATCTGGT CCCCGGCGTC GACCGGCCCG GGCATATCAT CATCATCCCG GCTGACGGGA CCAGACCGGT GGAACATGTC GAGCACCCGG GGCTCGTCAG CGCCGACGAC TCCCCCATGG GCCGGTACCT GGCGGCGTAC GCGGGTTGGC GCCAGGCTGC AGGCGGTGAG GGTGCCAGGC AGGGTGTGGC CGAGCCAGCG GCCGCGTTCG CGGCGTCGCG CGCCGCGCTG GCGGCGTGGC GCGCTGCGCT GGCGGAGCTC ACCGAGTGGG CGGGTCGCAT CGCCGAGGTG CTGCTGGACC GGGTCTCGGC TGTCGCGGCC GACACCACCT GGCCGACCTT CGTAATCACC TCGGTGGGGG TGCTCGGCCT AGTGCCCTGG CACGCCGCCG TGCTGGACGT CGAGGCGCCC GGTCATCAAA CCGGCGGCGG CACCCGGCTC GCCCAGCGGG CGACGGTGTC CTACATCCCC TCGGCCCGGC TGTTGTGCCG GACGGTCGCG ATGCCCGCCG CTGCCGACGG CGACGTGCTG ATCGTCGGCA ATCCGGTGGG CGCGGCGTCC CACCCTGCCG GTGACGCCGC CTGGCATCTA CCATCGGCCT TCTACCCCCG CGCCATGCTG CTGGGCGGGG GGCCGTCCCG GTCGGGCCGC GAACCGGCCA CGGTGGCGGG CGCGGGGGCG TCCGGGGACG GGACGCCGGC CGAGGTGCTC GCCTGGCTGC GCCGGCCAGG CGCCCGCCGC CTAGTGCTCC ACCTGGCGTG CCACGGCTTC GCCGTCCCCG CGGACCCGGC GGCGTCGCAT CTGGAGCTCG CCGACGGACG GCCGCTGACC GCCCGCGAGC TGCTGGCCGA GCAGAGCGCC GCCACCCGCG TCGAGCGGGT GTTCCTCGCT GCCTGCTCGA CCAACGTCAC CGGTGCGGAC TACGACGAGG CGTTCAGCAT CGCGACCGCG TTCCTCGCCG CCGGCGCCCG CACCGTGTAC GGCTCTCTGT GGGATCTGCC GGACAGGTAC ACGGCCACCA TGATGTTCGT GTTGCACCAC AACCTGGTCG TGGAGGACTG TTCACCGGCC GAGGCCATGC AGGCCGCCCA GGACTGGGCG CTGGACCCCT ACCGGACGGC GCCGGCCACA ATGCCCGGCC CGGTCGCCGC GGCCATGGTG GACGACGCGT GGCTCCCCTT CGTTGACCCG GCCGCCTGGG CCGGGCTCGT CCACCTTGGC GCCTGA
|
Protein sequence | MDGLNLEIDA LVGPALDLLA TAERDGDPVA ARAAVGLLAQ AYDLAPGHSG APGLAMLIAE WEGELAQADL GSWDSVVLWW RRGWPAPRPG GRPSLSIAEL DAMEGALHGE RYVNGRDPAE RDAAMRLLAP LADAGKLGPQ DLLGYAALLI VWYEQDAVPA ARDTALDLLA PMVTAGTADP GIVADYAELV FDRAEQSGTT RDWAQAVHWL RESLRIGGAQ NVADTWDSLG HAYWAWSRLD DDDALFDEAV DCFTAAVRHG QPAELLAPAA LVFVADRLRE QRRSAGGVDD ALLADIRAVL ALADQIVDRP ELDTEIRATV AMEILTLQYL IVDGNEITRR LVQEGLEAFP LAGDRFDRLL GYADAHPDPP QGWSSSRAAM RAMTSLLRAM FGTGPRPGAD FGLQHAAEAM RTAEHSDDAA ALFSLISAVA GGFSGDLGRV KAAEQLVGTS ADPSATAISL FARLGQMLLV GMPADGARTA VELLSVLARS LAQVLGGGLR GRLQPPAADT GKPGHLIEVS QYLDVSRRLD VGQLREIADV FAAFPGCDAL VAQAAAAEVI AGIIAATSAG APADRLAELR ARLADADQAW ADSSAQLPPP PRAMGTQLFA HGWQLFAGVT GDRSAARLAV ARSQAVLDWF DGPDHPMWPM VVMTAARARR LRGEPGDLPG GRRLALRALR GHAWLVLLQS GTRDALEVAR GAAADARVVL EWCVEDLRGG DEQARDDLVT AVEAGRGLVL HAAVTTRSAA EQLRELGHAD LAQAWAAAGL EPAEADPAAQ GPGWFADGAE RSALRRAVLA ALSESAAGPD SLLLPPSIGE IRAAVGTHGS DALVYLVPGV DRPGHIIIIP ADGTRPVEHV EHPGLVSADD SPMGRYLAAY AGWRQAAGGE GARQGVAEPA AAFAASRAAL AAWRAALAEL TEWAGRIAEV LLDRVSAVAA DTTWPTFVIT SVGVLGLVPW HAAVLDVEAP GHQTGGGTRL AQRATVSYIP SARLLCRTVA MPAAADGDVL IVGNPVGAAS HPAGDAAWHL PSAFYPRAML LGGGPSRSGR EPATVAGAGA SGDGTPAEVL AWLRRPGARR LVLHLACHGF AVPADPAASH LELADGRPLT ARELLAEQSA ATRVERVFLA ACSTNVTGAD YDEAFSIATA FLAAGARTVY GSLWDLPDRY TATMMFVLHH NLVVEDCSPA EAMQAAQDWA LDPYRTAPAT MPGPVAAAMV DDAWLPFVDP AAWAGLVHLG A
|
| |