Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0258 |
Symbol | |
ID | 5668683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 311560 |
End bp | 314049 |
Gene Length | 2490 bp |
Protein Length | 829 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239188 |
Product | hypothetical protein |
Protein accession | YP_001504631 |
Protein GI | 158312123 |
COG category | [S] Function unknown |
COG ID | [COG1289] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.828361 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACAGC TGTGGTGGCG GGGAGCGGGT CGTCCGCGGC TGTTCGGAGC ATTCGACAGC GACCCCGGGC ATTTCTGCCT CACCTCGGCG GCTCGCGTCG CGATCGTCGC CCCCGCCCTG CTCGCCCTCG TCTCCACGCT TGTCGGCGAT GTGCGGATGA GCCTGTTCGC CTGGTTCGGC GCCTACGCGC TCCTCGAGTT CGTCGACTTC GACGGGCCCC GACCGGCCCG GCTCACCGCC TACCTCACGC TCGCCACCAC CGGCGCCGGG ATGGTCGTCC TCGGCACGCT CGGCTCGCGC TCGCCCTGGC TGGCGGCGTC GATGACCGCG GTTGTGGCCT TCGTTGTGAT GTTCTCCGGC GTGCTGAACG CCAAGGTCGC CGTCGCCGGC CGATCCGCGT TGCTCGCCTT CGTGCTGCCT GTGATGACAC CCGGCCCGAT CTCGGCGATA CCGGAACGGC TGGTCGGCTG GGGGCTGGCC TCGGTCGCCT CGATCACCGC CGCGATGCTG CTCTGGCCGC GCCGCCCGCC GGACCGCCTG CGCGCGGAGA CGGCGGACGT CTGCCACGCG CTCGCGACGG CCGTGAGCTG GCAGCCCCAG GTACCGGACC CGCCGGCAGC GGTGTCGCGC GGTCCGGCGG CCGACGTGTC GCGGCGTCCA GCGCCCGATC TCTGGCCGGC GCTGCGGCGG CTGCGGCGGC AGTTCGTCGC GACGGCCCAC CGCCCGACCG GGGTCGGTGG CCGTGCGGCG GCGCTCGGCC ACCTGGTGGT GGACGTCAAC TGGCTGGTGC CGTTCGCCCT TCCGTGGCCG GAGCGGGACC GGACCGCCCG CGCCTGCTTT CCCGCGGAGG CCGCCGAGCT GCACGCCGCC GTCACGGCGA CGCTGCGCGC GGCGGCCACC CGCATCGAAC CGTCCGACCA CGGTCGGCCC AGGCCCAGGC CCAGGCCCAG GGCCGATGGC GGGGGTGGTG GGATGCGGGT CGGGGACGAT CGGCTCGGGA TCGCGCGACT CGAACGGTCC GAGCGGGCGA TGCGGTCGGC GTTGCTGCGG CAGCTGCGGG AGCCGGCGCC CGCCTGCCCA CCGGACCTCG CCGCGGCCGA GGCGTTCCGC CTGCGCCGGC TGGCCCGGGG GGCCAGGGAA CTGGCGCTGA ACGTGCTGAG GGTGACCGGC CCCCTCCCGC CCCGCCCGCC GGCGAGTCCC GTCCGGCGCG CCGTGGACGC GTTCCATCAC GCGCGCCGGC TGGTGCGCGA ACGGGGAACC GCCGCGACCG ATCTCGCCGC CGGCTACGCC AGCCCGCGGT CGGTGTGGTT CCGCAACAGC GTGCGCGGCT CCCTCGGCCT CACCACGGCG GTGATCATCG CCCAGGCCGC CGGGCTCCAG CACGGCTTCT GGGTCGTGCT GGGGACCTTG TCCGTCCTGC GCTCGAACGC CATGGCCACC GGTTCCGCAG CGGTACGCGC GCTCGCCGGC ACCGGAGTCG GCATCGTCGT GGGTGGCCTT TTCGTGGTCG CCGTCGGCAC CCACACCGCC GTCCTGTGGG CGGTTCTCCC GCTCGCCGTG CTGCTCGGGA GCTACTCCCG CCGCAGGTCG GGGTTCGTAC TGGGCCAGGC GGGCTTCACG GTCTCCGTGC TCATGCTGTT CAACATCGTC GAACCCGCCG GCTGGCGGGT GGGCATAGTA CGGATCCAGG ACGTCATGAT CGGGTTCGGG GTGAGCATCG TCGTCGGTGC CCTGCTGTGG CCCCGCGGCG CGGTCGCCGT GATCCGGACC CGCGCCGAAT CCGCCTACCG AAGTGCGGTG ACGTTCCTCG ATCTCGTCGT CCCGCACGCG CCGGGTGCCC CCGAGCATCC GGCCGTGGCA CCGGCCGCCC GCGAGGCGAT CCGGGCCGGC CGCCTCCTCG ACGACGCCGT ACGCCAGTTC CTCGCCGAGC AGCCGCCGGG CCGCTTCGAC GTCGACGCGC TGATGACGAT CGTCGCCGGC GCGCTGCGTA TCCGGCGGAC GGCGCAGCTC CTGTGGAACG GGGACGTCCC CTGGCCGCCC GATCTGACGC CCGATCTCCC ACGCGCGACC GGCGCCGGCC GCGCCGAGGC CACCGGCTTC GCCGTCGCCC AAGGCATCCT CATCGAGGAC ATGCGGGACC TCTGCCGTTG GTACACCGCC TACGCCACGG CCCTCGGCGC CGCGCGGCGG CCGCCCGAGC CCGAAGCCGG CTCCGGCCGG GCCGCCACAG CCGGGCTGAT CATGATCCAC AGGGCTGCCC GCGCGCACCG CTGCCCCGAG ATCCTCGCCG GTGCCGCGCT GACCTCCCGG GCCGCCTACC TGGACATCCT GCGCGACCTG CAGCCCCGGC TGACGGCCGC GGCCACGGCG CTCGACCAGA CGTCCGACGG CGGCCGGGAC CGCCCGCGCA AGCCGGTCGA TCAGGCGCCG GGGCCCCAGG GGTCGCGGCC GGCCCCGGAA CGCTCGCGAG CCTTCATCCG AGCCTCGTAG
|
Protein sequence | MRQLWWRGAG RPRLFGAFDS DPGHFCLTSA ARVAIVAPAL LALVSTLVGD VRMSLFAWFG AYALLEFVDF DGPRPARLTA YLTLATTGAG MVVLGTLGSR SPWLAASMTA VVAFVVMFSG VLNAKVAVAG RSALLAFVLP VMTPGPISAI PERLVGWGLA SVASITAAML LWPRRPPDRL RAETADVCHA LATAVSWQPQ VPDPPAAVSR GPAADVSRRP APDLWPALRR LRRQFVATAH RPTGVGGRAA ALGHLVVDVN WLVPFALPWP ERDRTARACF PAEAAELHAA VTATLRAAAT RIEPSDHGRP RPRPRPRADG GGGGMRVGDD RLGIARLERS ERAMRSALLR QLREPAPACP PDLAAAEAFR LRRLARGARE LALNVLRVTG PLPPRPPASP VRRAVDAFHH ARRLVRERGT AATDLAAGYA SPRSVWFRNS VRGSLGLTTA VIIAQAAGLQ HGFWVVLGTL SVLRSNAMAT GSAAVRALAG TGVGIVVGGL FVVAVGTHTA VLWAVLPLAV LLGSYSRRRS GFVLGQAGFT VSVLMLFNIV EPAGWRVGIV RIQDVMIGFG VSIVVGALLW PRGAVAVIRT RAESAYRSAV TFLDLVVPHA PGAPEHPAVA PAAREAIRAG RLLDDAVRQF LAEQPPGRFD VDALMTIVAG ALRIRRTAQL LWNGDVPWPP DLTPDLPRAT GAGRAEATGF AVAQGILIED MRDLCRWYTA YATALGAARR PPEPEAGSGR AATAGLIMIH RAARAHRCPE ILAGAALTSR AAYLDILRDL QPRLTAAATA LDQTSDGGRD RPRKPVDQAP GPQGSRPAPE RSRAFIRAS
|
| |