Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3781 |
Symbol | |
ID | 3906066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4530305 |
End bp | 4533361 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637881108 |
Product | hypothetical protein |
Protein accession | YP_482861 |
Protein GI | 86742461 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.568022 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0630741 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAACCA CCAGTCGCCG CCCGGTGCTG ACCCGCACCA AGGTGCTCGT GCCGGTGCTG CTTGTCATCG TCCTCGCGAT TACGATCATC GCCATCTTCA CCCGGCTATA CACTGACCTG CTGTTCTACC GCTCGATCGA CTTCAGCAGG GTGTTCACCA CTGTCCTCTA CACCCGGATT TTGCTGTTCG TCCTCTTCGG CGTGGTAATG GCGATCATCG TCGGGACCAA CATCGTACTT GCGTACCGGC TACGGCCGCC GCTTCGGCCG TTGTCGACCG AGCAGCAGAA CCTGGAACGC TACCGGTCGG TGATCGAGCC GTATATGCTG CTGATCCTGC TCGCGGTCAC GACGCTGTTC GGCCTGGCCG CGGGCCTGTC CGCCGCCGGC CAGTGGCGGA CGTGGCTGCT GTGGGTGAAC GGCGAGTCTT TCGGGACCCC TGATCTGCAG TTTCACCGGG ATATCAGCTA TTACGCGTTC AGCTATCCCT TCCAGCGGTT CCTGCTCGGC TTCCTGCTGA CCGCGGTGCT GCTGTCGTTG CTCGTCACCC TGCTGACCCA CTACCTGTTC GGCGGAATCC GGATGCAGAC CACGGGGGAA CGGGTGACCC CGGCGGCGAA GGCGCACATC TCGGTGCTAC TCGGGCTGCT CGCCCTGCTT AAGGCGTGGG CGTACTACCT CGACCGGTTC GGCTCGGTGT TCTCCTCGCG CGGGGTGGGG ACCGGTGCGT CCTACACCGA TGTGCACGCC GTGCTGCCGG CGAAGCTGAT CCTGCTGTTC ATCTCGCTGG CCTGCGCGGT GCTGTTCATC TACAACATCT TCCAGCGGGG CTGGACGCTG CCGCTGCTCG GCGCGGGCAT CCTTGTCCTG TCCTCGGTGG TGATCGGCGG GATCTACCCG GCGATCGTCC AGCAGTTCCA GGTCCGGCCG AACGAGGCGA CCCGCGAGGA GCCCTACATC GCGCGCAACA TCGCGGCGAC CCGGGCCGCC TACGGCATCC AGGACGTCGA GCCGCAGGAC TATGCCGCCA GCACCGACGT CACCGCCCAG CAGGTCGCCG CCGACACCGG CACGGTCCCC AACATCCGTC TACTCGACCC CAGCAAACTC TCCCGGACGT TCCAGCAGCT CCAGCAGTTC CGCGGCTACT ATGGTTTCCC GCCGACGCTC GACGTCGACC GGTACACCGT CACAGGGGCG GACAAGAAGT CGACGACCCA GGACTACGTG GTATCGGTGC GGGAGCTGAA CCAGGCCGGG CTGGGAACGC AGCAGCGCAA CTGGATCAAC GAGCACCTGA CCTATACCCA CGGCAAGGGA TTTGTCGCGG CGCCGTCGAA CACCGTCGAC GTGGGGCGCC CCGACTTCGA TCACGGCGTG AGCGGTTTCC CGCAGGACGG TACCTTCGGT ATCAAGGAGA ACCGGGTCTA CTTCGGGGAG ATGTCGCCGT CGTATTCCAT CGTCGGCACC AGGCAGATGG AGATCGATGG GCCGGGGCCG GGAGAAACCC AGGTCACCAC GACCTACCAG GGCGACGGTG GGGTCTCGAT CGGCTCGACC TTCCGTCGGG CGCTGTTCGC GCTGCGGTTC GGGGAGAAGA ACATCCTGCT CTCCGGTGAC ATCACGAAGA ACTCCCGGAT CCTGGACGAG CGAAACCCGC GGGACCGGGT GAGCAAGGCA GCGCCCTGGC TCACCCTCGA CGGAGACCCG TACCCGGCGA TCGTCAATGG CCGGGTGACC TGGATTCTCG ATGGTTACAC AACCTCCGAC GGCTATCCTT ACTCCGCCCG GCGCACCCTG GGTGACGTGA CCGCCGACTC GGTGACCACC CAGAGTGGCA ACCGCACCCG GCAGGCCTCG AACCAGGTCA ACTACATCCG TAACTCGGTG AAGGCGACGG TGGACGCCTA CAACGGCACC GTCACCCTCT ACGCCTGGGA CGAGTCCGAC CCGGTGCTAC GGACCTGGAT GAAGGCGTTC CCGGACACGG TGAAGCCGAA GAGTGACATC CCGCCGTCCC TCCGAGCACA CTTGCGTTAT CCGGAGGACC TGTTCAAGGT CCAGCGCGAT CTGATCGGGC AGTACCACAT CTCCAACCCG CGGGACTTCT ACTCCCAGGA GGACTTCTGG GATGTGTCCG ACTCGCCGGA TGGAAGCGGG CAACCCCAGC CGCCGTTCTA CGTCTACAGC CAGCTTCCCG GGCGCAAAGA CCCGTCCTAC AACCTCACCT CACCGTTGAT CTCCGCCCGG TCCTCGAAGC TCGCGGCGTA CATGGCGGTG TCCAGTGACC CCGCCAACTA CGGCCGGTTC ACCCTGCTCA AACTGCCGGC GGGGAACACG ATCAACGGTC CGGTCCAGGT GCAGAACGCC ATCGAGGGCA ACGGCGACGT GGCCAAACAG CTGTCGCTAT GGCGCAGCGG GGGAGCGACC ACGATCGAGG GCAACCTGCT GACCCTGCCC CTGGCCGGCG GCCTGCTCTA CGTCGAGCCC TACTACGTCC AGGCCAGAGG GAGCACCGGC TACCCGACCC TGCAGGGGGT CGCCACGGCC TTCGGCGACC GCATCGGTTT CGGTGCCTCG CTGGGTGAGG CCCTGGACAA GGTCTTCGGT GCCGGTGCCG GTGCCGCGGC GGCCGGGGCC GGCATCGGGG CCACGACCAC GGACGCCGGT CAGGACGGCA CGCCGGCTCC CAGATCCGGG CAGGGCGGGG CCGGCGTCCC GCCGCCGGGG CAGACCGCGC TGCAGGACGC CGTGGGCGAC GCCGACCGCG CCTACCAGGC GGGGCAGGAC GCCCTGCGCA AGAGCCCGCC GGACTTCACC GCCTACGGGA AGGCCCAGAG CGAGCTGGCT GATGCGCTGG GTCGCCTGCG TACTCTGTCG AGTCCGGCGG CGACACCGCC CGCTGCCACG GCGACACGGG CCGGTGCGTC GGTGCCCGCG TCACCGGTCC CCGCGTCACC GGCAGCCAAA CCACCGGCTC CGTCTCCTTC GGCGACCGTG GCCGGCGGCG ACACTCCCGG TCCACCGGGG GCCCGGGCCG CCCCGGCGCC CGGCTAG
|
Protein sequence | MATTSRRPVL TRTKVLVPVL LVIVLAITII AIFTRLYTDL LFYRSIDFSR VFTTVLYTRI LLFVLFGVVM AIIVGTNIVL AYRLRPPLRP LSTEQQNLER YRSVIEPYML LILLAVTTLF GLAAGLSAAG QWRTWLLWVN GESFGTPDLQ FHRDISYYAF SYPFQRFLLG FLLTAVLLSL LVTLLTHYLF GGIRMQTTGE RVTPAAKAHI SVLLGLLALL KAWAYYLDRF GSVFSSRGVG TGASYTDVHA VLPAKLILLF ISLACAVLFI YNIFQRGWTL PLLGAGILVL SSVVIGGIYP AIVQQFQVRP NEATREEPYI ARNIAATRAA YGIQDVEPQD YAASTDVTAQ QVAADTGTVP NIRLLDPSKL SRTFQQLQQF RGYYGFPPTL DVDRYTVTGA DKKSTTQDYV VSVRELNQAG LGTQQRNWIN EHLTYTHGKG FVAAPSNTVD VGRPDFDHGV SGFPQDGTFG IKENRVYFGE MSPSYSIVGT RQMEIDGPGP GETQVTTTYQ GDGGVSIGST FRRALFALRF GEKNILLSGD ITKNSRILDE RNPRDRVSKA APWLTLDGDP YPAIVNGRVT WILDGYTTSD GYPYSARRTL GDVTADSVTT QSGNRTRQAS NQVNYIRNSV KATVDAYNGT VTLYAWDESD PVLRTWMKAF PDTVKPKSDI PPSLRAHLRY PEDLFKVQRD LIGQYHISNP RDFYSQEDFW DVSDSPDGSG QPQPPFYVYS QLPGRKDPSY NLTSPLISAR SSKLAAYMAV SSDPANYGRF TLLKLPAGNT INGPVQVQNA IEGNGDVAKQ LSLWRSGGAT TIEGNLLTLP LAGGLLYVEP YYVQARGSTG YPTLQGVATA FGDRIGFGAS LGEALDKVFG AGAGAAAAGA GIGATTTDAG QDGTPAPRSG QGGAGVPPPG QTALQDAVGD ADRAYQAGQD ALRKSPPDFT AYGKAQSELA DALGRLRTLS SPAATPPAAT ATRAGASVPA SPVPASPAAK PPAPSPSATV AGGDTPGPPG ARAAPAPG
|
| |