Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0523 |
Symbol | |
ID | 5668942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 609387 |
End bp | 610772 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641239452 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_001504890 |
Protein GI | 158312382 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAGCGCT ACTGCACAGG CCAGTCCGTG CCAGTAGAGT TCGGCACCAT CGAACGCGTC GCGCGAGTAT GCGGCGCCGA CGGGGACGAA ATCGCACGCC TGTTCAGACT CTGGGAAACT GCGGCGTCGA TCTCGACGAG CGCCGGCATC GAACTTCCCA CCGCAGGCGG CGACCATGAG CACACCAATC CTGTTGAAGG CGTCGAGTAC TCAACCTCTC CGGTCTCTCT GGTGCCACCA GCACTCGCAG CCGCGGACGA CTCCGCACCA AGACCAGCCA CACAGAATGA GCACAGCCGG CTTCGCGGCG CACGCAGAGC CGGCTGGAGA TGGAACCGCC GGAGTATATC CGCCACGGCC ATCATCGTGG TGTTGTTCTC CAGCGCGGTC GGTACCGTAC TCGTTCTCGC CATGATTCTC GCGCGTGAAC CTGGCGTCCG GCCGCTCGGG CGCCCGTTGG CCGACCAGGC CGGCTGGGCG TTGTCCACCG CGTTCTCCCC CGACGGGAAA GTAATGGCTA GCAGTAGCAG GAAAGGCGGA GTGTGGTTGT GGAACATGGC CGATCCCGCC ACGCCCGTCC GAATCGATCC TGCGCTGACC GGCCCACGCG ACGGGGTGAC ATCACTGGCG TTCTCGCCAG ATGGGAGTCT TTTAGCCGGT GGCAGCTGGG ACGGGTCCAT ATGGTTGTGG GACATAACCG ACAGCGGGGC TTCCAAGCCG GCCGGCCGTG CGCTGACCGA CGACTCGGGA CCGATATGGT CGGTAGCATT CTCCGCGGAT GGCCGCACGC TCGCATCCGG CAGCGACGAT ACGACGGTGC GACTTTGGGA CATGACCAAC CGCGCCAGGC CGTGGCAATT CGTGCGGCTG AGCAGCGATA TGGAGTTCGT GACATCGGTC GCGTTCTCCG CGGACAACCG CCTCCTAGTC GCCGCCGGCT TCAGTAGGAC CATCGCGATC TGGGATATGG CCGACCGTGG GGCCCCTAAA CGGCTGGCCC AATCCCTGTC AACGCCCGCC ACTACGTACG TGGCCGCCTT CTCCCCCAAT GGACGGCTCC TTGCCACCGG GAGCACCGAT GGCTTGGTGC GACTTTGGGA CCTCGCCGTT CCAGAGGACC CCCATCCGAT CGGGAGACCG CTCACCGGGC ATACCAACCG CGTCTGGTCA CTCGCATTCT CTCCGGATGG CGGCACCCTC GCCAGCAGCG GGTTCGACAA CTCCGTGAGA CTGTGGGACG TGACCGACCT GTCCAACCCG GAGCCCATCG GCGCGCCACT CACCGGCTAC CAGGGCTGGG TTCTCTCGGT GCGCTTCTCC CCGAACGGCC GCGTGCTGGC AAGCACCAGC AGCGACAGCA CCATCCGCCT ATGGTCGCTA CCCTGA
|
Protein sequence | MQRYCTGQSV PVEFGTIERV ARVCGADGDE IARLFRLWET AASISTSAGI ELPTAGGDHE HTNPVEGVEY STSPVSLVPP ALAAADDSAP RPATQNEHSR LRGARRAGWR WNRRSISATA IIVVLFSSAV GTVLVLAMIL AREPGVRPLG RPLADQAGWA LSTAFSPDGK VMASSSRKGG VWLWNMADPA TPVRIDPALT GPRDGVTSLA FSPDGSLLAG GSWDGSIWLW DITDSGASKP AGRALTDDSG PIWSVAFSAD GRTLASGSDD TTVRLWDMTN RARPWQFVRL SSDMEFVTSV AFSADNRLLV AAGFSRTIAI WDMADRGAPK RLAQSLSTPA TTYVAAFSPN GRLLATGSTD GLVRLWDLAV PEDPHPIGRP LTGHTNRVWS LAFSPDGGTL ASSGFDNSVR LWDVTDLSNP EPIGAPLTGY QGWVLSVRFS PNGRVLASTS SDSTIRLWSL P
|
| |