Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5623 |
Symbol | |
ID | 5673950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6836922 |
End bp | 6839258 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641244476 |
Product | hypothetical protein |
Protein accession | YP_001509880 |
Protein GI | 158317372 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00516948 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCCTT CGATGGAGGA TCCGCGCGTA GTCGACGTCG AGCGGTACCA CGGTCAGTTG CGCTCGGAAC TCGAGATGAT CGCGAACGGA CCAGTACTGG CCGGCCCACG GATCGTCGAG CACATCAATT CCCTGATCAC ACTGATTGAC CTGCACCAAC CGAACGAGTT CGACCTCTGC CTCAGCTGTG ACCGGCTCTG GCCCTGCGCG ACCGTCGTCG CCATCACCGG TGAGCTAGCG GCCACCGAGG ACGAGGTGAC GCCCCGGCCG CTGCCCGAAG CCCCGGCCAC GCGGTCCGGC GACGGCCGCC AGGCGGACAG GCGCCCGGCC GACCCCCGCC ACGCGGCGGC GGGAGGCCCC GCCGACCAGC GCCGGCCGGA CGTTCGCGCG GCCGCGCCAC CACCATTCGA CACGGGCTCC TACTCGACGA CGACGCAGAC CAGCACCGGC CAGTCGACGA TGCCGCACGA ACTGCCACGG GGCACGGACC CGGTCCGGCA GCGCGGCGCC CCCGGCCCGG CGGACGAGCC GATGGCCGCG CCCCGGCCGA CCGGGCCGCA CCGCGTGCCG CCGCCGCCCT CCATGCCACC ACCCTCCATG CCACCGCCGT CGCATCCGTC CGGCATGCCG CCGGGCACGC GCCCGCCGGC CGCCGTTCCG CCGGCCGCGG CCGCGGCCAC CGGCGGGATG CCCACCGGCG GCTCGCGGAC GGGTGGGATG CCCCTGGGTG GCGCGGGCAA CAGCGCGTCC GCCCGCACCC CCGTATTCGG CGACCCGTTG GCCGGGATCC GCCGTCCGGG CGCCGGCATG CCGCCGCGGG TCGGCGGGGG CCCGCCCGGC GAGCAGGCCC CACCCGGCCG GCCGGGGCCG CCCCCCGCCG ACACCGGTCA CCGCTCCGCA CCGCCCGGCC TCCAACCGCC CGGCCTCCAA CCACCTGGCC TCCAGCCGCC GGGGCTCCAA CCACCCGGCT TCCAGCCGCC CGGACTTCAA CCACCTGGGC TCCAAGCAGC CGGGCCGCCG CCGGGCACCG CCGGGCAGCC GGGACGTCCG ATGTCGGCCC CCGGCTTCGA GCGAGCCCGC GAACAGCAGC TTCCCGGTGG CCCGGGCAGC CGGCCCGCGC CCGGGTTCAA CCGCCCGGGC GGGCCGCCCG CGCACCGCCC CGCGGCCGGC CCGGCCGGCG GCCCGCCGAC CGGGCCGATG GAGCGCCCCG GCTACAACGG CCAGCCCGGC TACAACGGGC AACCGGCGTA CAACGGGCAA CCGGCGTACA ACGGGCCGCC CAGCCACAAC GGGCAACCCG GCCACAGCAA CGGGCAGCCT GGCTACGCCG GGCAGCCCGG CCAGGGCCGG CAGCCAGGCC ATCCCGGTTA TCAGGGCCAG GTGGAGCAGT CCGGCCCGAT CGCGCGGCCC GGCCTGTTCG GCCAGCCCGC GGCAACGGGC CCGGGACGGT CCACGGGCCC GGTCTCCACC GGGCTCGGTG ACCGCATGCC CGGACGGGCC CAGCATCCGG GCGATCCGTC CCGCCCCCTG CCCCACCCCG CGCAGTCCCA ACCGGCGCAG CACCAACACG CGGCCGGCGT GTCCCGGTCC GAGCCGTCCC GGGCTCCGGC CCGTCCTGAC GGTGTCGAGG CCCCGGCGGC TCAGGCCGAC GGTTTCTCGG TGCCGACGTC CCGGCAGGCG CCGCCGATCT CCGGCCCGGT GGAGATGCTG CCCGCCAGCA TCGCCGCCGC CCGGGCCGCG GAGCTGGCGC GCGGGCAGGC GGCCGCCGCG GCCGGACCAC CGCCGCCGCG GCATCCCGAC ATCGTCCGCG GCCCCGAACG TGGAACCCGG CCGCCCGTCG GACGGCTCAC CTCCGACCCG GCGCGCACGC GCGGCCGGCA CGCGGGCCCG GACCAGCCGC GTCCCCAGGG CTCCCCGTGG AGCAGCCAGG CCGCCGAGCA GCGCCGTCCC ATCTCCGTCG ACGACTCCTG GGCCGGTGTC GGGCGCAGCC CGCGTGGCCA GAACGGCGAC GGCGGCCAGG GCGGTCAAGG CGGCCAGCCC GGGCACGGTG GCCCGAACGG CCAGAACGGG CACGGCGGTC AGAACGGGTA TGGCGGCCAG AACGGCCAGC ATCAGCCGAA CGGCCAGCGG GGGCCCGGCG GCCCGGGACG TCCCGACACC GACGTCGGCG CGGTCGGCGC GAACCGTTCC AGCGGCCCGG CCGGCCCGGG CCCGGCGGAC CGTGGCCCCG GCCGCGACCA GGCCTCGGAC CCGTCGCTGA GCCCTGAGGT GGAGGCCGTC ACGAGGGCCT GGCTCGCGCG CAAGGATTCG GTGCTCGACG GCATCGACGT CATCTGA
|
Protein sequence | MLPSMEDPRV VDVERYHGQL RSELEMIANG PVLAGPRIVE HINSLITLID LHQPNEFDLC LSCDRLWPCA TVVAITGELA ATEDEVTPRP LPEAPATRSG DGRQADRRPA DPRHAAAGGP ADQRRPDVRA AAPPPFDTGS YSTTTQTSTG QSTMPHELPR GTDPVRQRGA PGPADEPMAA PRPTGPHRVP PPPSMPPPSM PPPSHPSGMP PGTRPPAAVP PAAAAATGGM PTGGSRTGGM PLGGAGNSAS ARTPVFGDPL AGIRRPGAGM PPRVGGGPPG EQAPPGRPGP PPADTGHRSA PPGLQPPGLQ PPGLQPPGLQ PPGFQPPGLQ PPGLQAAGPP PGTAGQPGRP MSAPGFERAR EQQLPGGPGS RPAPGFNRPG GPPAHRPAAG PAGGPPTGPM ERPGYNGQPG YNGQPAYNGQ PAYNGPPSHN GQPGHSNGQP GYAGQPGQGR QPGHPGYQGQ VEQSGPIARP GLFGQPAATG PGRSTGPVST GLGDRMPGRA QHPGDPSRPL PHPAQSQPAQ HQHAAGVSRS EPSRAPARPD GVEAPAAQAD GFSVPTSRQA PPISGPVEML PASIAAARAA ELARGQAAAA AGPPPPRHPD IVRGPERGTR PPVGRLTSDP ARTRGRHAGP DQPRPQGSPW SSQAAEQRRP ISVDDSWAGV GRSPRGQNGD GGQGGQGGQP GHGGPNGQNG HGGQNGYGGQ NGQHQPNGQR GPGGPGRPDT DVGAVGANRS SGPAGPGPAD RGPGRDQASD PSLSPEVEAV TRAWLARKDS VLDGIDVI
|
| |