Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0633 |
Symbol | |
ID | 5669050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 733839 |
End bp | 735578 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 641239560 |
Product | hypothetical protein |
Protein accession | YP_001504998 |
Protein GI | 158312490 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00895594 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.470302 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTTCTG GTCGTGCCTG GCGACCCGTC GCCGGCCTCC GCCCGGCCAC CCTCCAGCCG GGCGGAGGCC TCGGCAGCCC CGATCGCAGC CCCGACGCAC CAACCAGCCA CGCCCCGAGC CCGAACCGCC CGGCCCGGGC GGTTCGCCGC CCGCGAAGAT CTGCCCGCGG CGAACCGCCC GCGGAGCCGC GACGCCCCTG GAGGTCCACG ATTCCCGGCA GTTGGCGGCG CCTCCCGTGG CGGCCCTCTC GGCAGTTCGC GTGGCGGTTC CTCAGGCCGG CACGCGGCGG GCGTGCCCGC GCCGGGCTGG GCGTGCTGTT TCTCTGCGCC GCGGCGGCCC TCGTCGCGAC CGGTCGCGGG GCCGCACCCA CCGAGCTCGA GCTCACCGAC GGCGGCGTGT GGCTGGCCAC CACCAGCACA GGCACCCTCA CCCACCTGAG CGGACCAGCC GGGCGGGCCG ACGCCGCGGT CACCGTGCCC GGCGCCGTCG GACGTGACCT GACTGTCGCC CGCACCGGCG CGGCGGTCCT CGTGGCCGAC CCCGGTTCGG GCCAGGTGCA CCTGGTCGAC CCTGCCCGGC TGGCCTCCGT CCGCTCGGCG GATCTGGGGC CCGGGGTGAC GATCGTCACC TCGGCGACGG CGGCCTACGC GGTCGACCCG GCGTCCGGGC GGGTCCGGCG GCTCACCCGC GACGACCTCG CCGGCGCCGG CCCCGTCCTG GAGCTCCCGC CGCCGCTGGG GCGCGCGGCG CTGACCGACG ACGGAACCCT GTGGGTGCCG GTCCGGTCCG CGGGCACAGT CGTCGCCCTG CGGGACGGCG CCGCCGAGCC GCCGCGCCCG GTCGCCCCGC CCGGCAACGC CGTCGACGTA GTCCTCGCGG GCGGGCACCC GCTCGCCGTG GACACCACCG CCGCCACCGT CACCGCCCCC GACACGGGAC GCGTGATCGC CCTGCCGCCC GCCGGCCCGA CCAGCGGACC GCTCCCCGGC CTGCTGGCGC CGCCGCGCAC GGACGGCGGC CCCGTGCCCC TCCTCGACCC GGCCACCCGC CGCCTGTTCC TCGTGGACGT CGATCAGGGC TTGGCGACCA CAGTTACCAC GGTGACCATC CCGGACATCC CCGGATCGGG CCAGCTCGGC ACACCGGTGG TCCATGCCGG TCACGGGTAC GTGCCGGACT CCGCGCTCGG CGTGGTGCTC GACTACGACA TCGCCCGCGG CGGCTTCGGC GATCCGGTGC CGGTCGCCGC GCCCGCCGAC CAGCCGCGGC TCACGGTGGC CGTCGACGGC GACCTGGTGT GGATCAACGA CCTGGCCGGG CCGAACGCCG TCCTCATCGA CGGCCGGGGG CGGACGGCGA TCGCCAAGCA GCCGCCGGAT CTCGCCGGTC TGGCCACCGA GGCGGCCCGC CCGCTCCCGC CGGCGCCGCC GCTGCCGACG GCAGGGCGGC CGTCGGCGCC CGGCCCGGCC CGCGATCCGG GGCCGGCCAC GCCGACCGCA CCAGCCGCGC CGACGGGGAC GGCCGCGCCG CGGACCACCT CCACCGGGCC GCCCACCCCG CCCGGACGGA CCACGCCCGA GCCGACCATC GTGCCGCCAC CCACTCCGCC CCCGCCGACC GGACCGCCAC CGACCGGACC ACCGCGGACG GCACCGCCGG ACGGCACGCC GCCACCGCTC CCCTCCCCGA CCGTCGCGCC CCGGCCGACC ACCACACCGC CCGGGACCGA CCTCGTGTGA
|
Protein sequence | MSSGRAWRPV AGLRPATLQP GGGLGSPDRS PDAPTSHAPS PNRPARAVRR PRRSARGEPP AEPRRPWRST IPGSWRRLPW RPSRQFAWRF LRPARGGRAR AGLGVLFLCA AAALVATGRG AAPTELELTD GGVWLATTST GTLTHLSGPA GRADAAVTVP GAVGRDLTVA RTGAAVLVAD PGSGQVHLVD PARLASVRSA DLGPGVTIVT SATAAYAVDP ASGRVRRLTR DDLAGAGPVL ELPPPLGRAA LTDDGTLWVP VRSAGTVVAL RDGAAEPPRP VAPPGNAVDV VLAGGHPLAV DTTAATVTAP DTGRVIALPP AGPTSGPLPG LLAPPRTDGG PVPLLDPATR RLFLVDVDQG LATTVTTVTI PDIPGSGQLG TPVVHAGHGY VPDSALGVVL DYDIARGGFG DPVPVAAPAD QPRLTVAVDG DLVWINDLAG PNAVLIDGRG RTAIAKQPPD LAGLATEAAR PLPPAPPLPT AGRPSAPGPA RDPGPATPTA PAAPTGTAAP RTTSTGPPTP PGRTTPEPTI VPPPTPPPPT GPPPTGPPRT APPDGTPPPL PSPTVAPRPT TTPPGTDLV
|
| |