Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6864 |
Symbol | |
ID | 5675177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8367903 |
End bp | 8369504 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641245713 |
Product | hypothetical protein |
Protein accession | YP_001511104 |
Protein GI | 158318596 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.865283 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACGTGC CCGGGTCTTC GGGCCCGCTC CCGTCCGCTG CCGGGACGGA CGGGCCCGCC GGGAGCGGTG CGGGCGAGGA CGTGATCGCG CGGACGCTGC CGTACGAGCG CCTCGGCACG CCGGGCTCGC CGATGCGGCC GGACAAGACT CTCCTCAGCT CCGTGGCTCG GACCAGTGCC ACCAGCCAGA GCAGCCCCGC CAACTCGGGC AGCTCAACCA GCACGGACAG CCCCACCCAT TCGGACAGCC CTTTCCGCCA GCGGGCCTCT GCCGGCCTGG GCAGCCCCGT GAGCAAGGGC GGGGGCCCGG TTGGGAGCAC CAGCGAGAAC CCCGTCCCGG ACGGGCCAGG GCCCGGAGCC AGGCCCGAAG CCGAAGCCGG AGCCGAAGCC GCGGACCCGG ACGAAGCTGC CCAGGCGCCA GGGGCCGGAG GCCAGGGGAA GGACAGCCGC CGCGGCGGGA ATCACCGCGG CGGGACGGCT CGTCGAGGTA GGCCGCGCCG ACGACGTCGC ACCGTGCTCG GGGTCGCCGC CGCCGCGATG GTCATCCTGC TCGGCGCGAT CGGCCTGGTG ACGCTCACCG GCGGCCCGGA TTCGCGGCGT TCGGCCGATG CCCCGGCGGC GAGTCAGGCC CCGGATACGG CGAAGGCGCC CGGGGCGCAG CCCGCGCAGC CGGAGCCGGC ACCGACCGGG CCGGCCGCGC CGGGGGTGCG GTGGCCCTCC GGGGCGAACG GGAACCCGCC ACTGGACATC CGCGCCTGGG AGGCGTGGAC GGGGCGGCTG ACGAACGTCG CGGTCGTCTT CACGAAACGT AATGACTGGA ACCAGATCGC CTACGACAAC TGGCCGATGT CGGACTACCC GCCGGGGGTG TACAACGGGC AGCTTTCCAT CGCCCAGCCG CTGTTCCCGC GGTCGGGTGA CGAGCGGACC TGCGCCCGGG GTCACTATGA CGCCTACTGG GCGGCGTTCG GCCAGACCCT GACGCGTAAC GGCCGGCCCG ACGCCATCGT CCGGCTCGGC TGGGAGTTCA ACGGGAACTG GTTCTGGTGG TATCCGCGGG ACACCGCGAC CTGGAAGACC TGTTTCCAGC GGGCGGTCAC CCAGATCCGG TCGACGGCGC CGGCGGTGCG GATCGACTTC AACGTCAGCG CGCACCGTGA CCGGATGCCG AACGGTGACG ACGTGTGGGC GGCCTACCCG GGCGACGAGT TCGTGAGCAT CGTCAGCAGT GACGCCTACG ACTCCTACCC GCCGTCGCGC TCCGCGGAGA CCTTCGACCA GCAGTGCAAC ATTCCGTCGG GGGCGTGCAC CGTGGCCGCG TTCGCGCGGG CGCACGGCAA GCAGTTCGCG GTGCCGGAGT GGGGGCTCGT CCGGGTGGAC GGGAACGGCG GCGGCGACAA TCCGCTCTTC ATCGAGAAGA TGCACGACCT GTTCGACCGG AACCGGGACA TCCTCGCCTA CGAGGCGTAT TTCAGCACCG CCGAGGCGGA CAACGTGCGT TCCTCGCTGA TCAACCCGCC GCTGAACCCG ATGGCCGCGC AGCGGTACCT GGAGCTGTTC GGTGCCGGGG CCGGCGGCAG TGGCAGCCCC GGCGGCCTGT GA
|
Protein sequence | MDVPGSSGPL PSAAGTDGPA GSGAGEDVIA RTLPYERLGT PGSPMRPDKT LLSSVARTSA TSQSSPANSG SSTSTDSPTH SDSPFRQRAS AGLGSPVSKG GGPVGSTSEN PVPDGPGPGA RPEAEAGAEA ADPDEAAQAP GAGGQGKDSR RGGNHRGGTA RRGRPRRRRR TVLGVAAAAM VILLGAIGLV TLTGGPDSRR SADAPAASQA PDTAKAPGAQ PAQPEPAPTG PAAPGVRWPS GANGNPPLDI RAWEAWTGRL TNVAVVFTKR NDWNQIAYDN WPMSDYPPGV YNGQLSIAQP LFPRSGDERT CARGHYDAYW AAFGQTLTRN GRPDAIVRLG WEFNGNWFWW YPRDTATWKT CFQRAVTQIR STAPAVRIDF NVSAHRDRMP NGDDVWAAYP GDEFVSIVSS DAYDSYPPSR SAETFDQQCN IPSGACTVAA FARAHGKQFA VPEWGLVRVD GNGGGDNPLF IEKMHDLFDR NRDILAYEAY FSTAEADNVR SSLINPPLNP MAAQRYLELF GAGAGGSGSP GGL
|
| |