Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2761 |
Symbol | |
ID | 5671150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3268666 |
End bp | 3272019 |
Gene Length | 3354 bp |
Protein Length | 1117 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641241670 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_001507090 |
Protein GI | 158314582 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.645221 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.818272 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGTCAA GTGCCCGTGC GGGCAGAGGA AGTATCGCGG GGTCGTCCGG CGACGGCGGG GTGGAATTCA GGCGGGCCGT CGCTGCCTAT GCGGTCGTGG CTGGGCTCTC CGACACTCCG CTGCGTGGTC TGGGCATCTC CGAGGTCGAC GCGCGGGTGA CCGCGGTGGT GCTGGAGACC GAGGATGCGG TCGACGATCT GCGGGTTGAC TTCGCGACCG GTGGGACGGC GTACATCCAG GCGAAGAGGA CCTTACGGGG AGGCACCGTC CTCGCGAAAG CGGTCGCGCA GTGGGCGCGC GCGGGCGAGG GGCCACTGGA CCCGGCCCGC CATCGGTTGG TGATCGCAGC GGGTGAGCTG TCGGACTCGA TGAAGGCGCT GCGCCGGGCT CTCGACGCGC GGCGGTTGTC GGTCCCAGGA ACTCCGAGCC CGCAGGAAGC GGAACAACTT CACCGCGTCG AGGAGCTGTT GTCGCCGCTC AACCCTGCGG CACGAGACAT CGTGCTTCAG TGCGCGGTCG TCTGGCATCT TGACGTCGAG GAGACGGACC GCGCCGACGC GCAGAACGCC TTGCATCAGC TACGCGGGCT CGTCGCGGGC GGAGACGCCC CGGCGGCAGG CAACGCGTGG TCGGCGCTGC TTGATGCTGC TGGGTATGCG GCTCGCCGCC GCGGTGGCCA TAATCTCGAC GGTTGGCGAA CCGCGATGCG CGATCGCGGC ATCGGTCTCG TCGAGCACAG CAACGCACCT GAACACTCTG GGAGCCAGTC CAGCCTCGCC CGGCCATCCA GCCATCCGAT TGGGGACCAC GACCTGCGGG GCCGGCTCAC GCTGCAACCG CCGGTCCGCG CGCCACGACT GAGGGGAAGC CTGGCGTCGG CGACCCGGCC TGGCAGGACG AGGTGGCTAT GGGGGGCCGC CCTGGCGGCC GTTCTGATAA TGGGGGCGGG TACCGCGATC TATTCGGTGC ATGACCGGCA GGCTTCCGCG ACTGACGAGC GGAACCTGGC TGACTCCCGC CAGTTGGCAG CCGACGCGGA GAGCACGCGG GACAGCGACC CGAACCTGGC ACGCCAACTG GCTGTGCTGG CATGGCGCAC CGCGCCGACC ATGGAGGCAT GGCGAGCACT GTTCGCGGTC GCCGATCTGC CGACCGATGT CGACGTGGAC GATCAGGTCC TGGCGGTGGG CCGCGTCGGT GAGCGGTTCG TCGCCGTCAC CGCGCTGTCC ACGAAGGCCC AGCTGTGGGA TGTCACCCGG GTCAGCCGCA AGGCCGAACC ACTGGCCGTG CTGCCGGGCC AAGGTCGGGT CACCGCGGCG GTCTTCAGCC CGGATGGTCG ACGGCTCGCG ACCATGGCAG TCGACGGCAG CGTCCTGGTC TGGAACGTCG AGAGTGCCAC CGCCGGACCG CCCGGGAACA ATCCGGTGGA GCTCACAGCA CTAGCGTCCC TGCCCGGGAA CGACTCCAAC TCGCCAGCAA TGGCCTTCAA CTCCGACGGC ACCCTGCTCG CGGTCCCCGG CCCCGAAGGG TCGGTGGAGG TCTGGAACAC CCGGCTGCCG GCCGGGAGCG CCTCCGCTCA GAACAGCACA GGCCCCGACC TCGAGGCCCT GCCGCTCCAA ATCCCCGGCG TCGTCAGCGG CATCGACGAT CCCGGCGTGT CCGCGCTGGC GTTCACCCCC GACAGCGGTC GGCTGGCCGT TGGAACGTCC GGGGGAACCT TCGCTGTCTT CGCTCTGGGG CGGTCGGCGG AACCACTGAT GCGGCGGACC AACTATGCCG GCGCCATGAA CGCCGACGCG GTCGAAGGCA CTGCGGGACA GGCGACGCCA CCGGACACGA TGGAAACCAG CCACCTCCCG CCGCAGCGGC AACCCGTCAC GCGACTGCTG TTCCGGCCGG ACGACGGCGC GACCCTCGCC GTGGCTGGAG GCGACGGCGT GGTGCGCCTG TGGGACGTCG CCGGGACCGA TCCCACGCTG GCATCGCAGA ACGCCCCCGC GAACCTGCCC ACGAAGCCGA CCGCGGAACT CGGCGACGGC GCGGCCGCCG TTGACGCCAT CGCCTACAGC GCCGACGGCC TCCACCTGGC CGTTGGCTTC GGGGCCAGCG GTGGGACCGT CAACGTATGG GACGTCACCG ACGAGAGCAG CCCGGCCCAA GGCGCCCCCA TCGAGCGGCT CAACGACTCG CCGACCTCCC TGACCTTCGA CCGGGCCGGT CAGACCCTCG CGGTCGGGAC CAAGGACGGT GCTGTGCGGC TGTCGACCGT TGCGTCTCCC GGGATGCCTC AGCCCAGACT CACGGTGCCG GACATCCTCG GGGACCATGT GGCGGTGAAC GGCGCGGGAA CGACCGTCGC TGTATCGCTC GCTGTCGACG GCGTGGACGG CGCGATCGAG ATCTGGGATC TCCGCGCCGG ATCGAGCGAG CCGACCGTCA CGTTCCCCGG CCGCGACGCG ACGGTCTACT CGATCGCGAT CAGCCCGGAC GGCGCCCTGC TCGCCGCCGG ATACGACGAC GGGGTCGTCA GGGTGTGGGA CCTGCGCGCT GCCCTCTCGA CACGCGCCGC CGCATCACCT GTGGCCGACC TCGCCGAGCA CGGCGCGATC GTGCAGGGCC TCGCCTTCAC CAGCGACGGC AGACTGCTCG CGTCCGGAGA CCAGGACGGG ACGATCAAAC TCTTCCGACT CGACGGCACC CATCCCCCGA AGCGCATCGC CGAGGCGGCA TCCACCCACG GCTCGGTCCA GGGAATCACC TTCAGCGCCG ACAACGGAAC ATTGGTGACT GGCGGTGATA GCGGTGTGGG CGTCTACGAC GTCCCGGACG TCAGCACCGG CGGGCCGCTG ACCACGGTCA CCGACATCTC GGACGCAACC ACTGGGAACC TCTCCGGAGG TGTCGCGCTC AGCCCCGACG GACGCACACT CGCCGTCGGC GGCCACGGCA CCGTCACGAC CTGGGACATC ACCGGCCCGA CGCCACAGCA GCAGGGCACA TCCCTCCCTG TCGGAAGCAA GGAGGCGGCG GTGAGCACGG TTGACTTCAG CCCGGACGGC CGCCTGCTGA CGGCAGCCGG CAACGACGGC GACATCGCAA TCTGGGACAC CAGCCGCCCC GGCCAGCCGA AGCCGCTCAC CAACCTCCCC GGCCACGACA ACGGAACCCG CCTCGCGGCG TTCACCGACG ACAGCCGAAC ACTCGTGACT CTCGGCTCCG ACTACCTCGC GAAACAATGG GACGTCGACC CCGAAACACT CGTCGGCCGC GCCTGCAACG GAGTCTCCCA ACCGATGACC AAGACAGACT GGCGCACCTT CGTCAACAAC CGCGAGTACG CCGCGCCGTG CTGA
|
Protein sequence | MPSSARAGRG SIAGSSGDGG VEFRRAVAAY AVVAGLSDTP LRGLGISEVD ARVTAVVLET EDAVDDLRVD FATGGTAYIQ AKRTLRGGTV LAKAVAQWAR AGEGPLDPAR HRLVIAAGEL SDSMKALRRA LDARRLSVPG TPSPQEAEQL HRVEELLSPL NPAARDIVLQ CAVVWHLDVE ETDRADAQNA LHQLRGLVAG GDAPAAGNAW SALLDAAGYA ARRRGGHNLD GWRTAMRDRG IGLVEHSNAP EHSGSQSSLA RPSSHPIGDH DLRGRLTLQP PVRAPRLRGS LASATRPGRT RWLWGAALAA VLIMGAGTAI YSVHDRQASA TDERNLADSR QLAADAESTR DSDPNLARQL AVLAWRTAPT MEAWRALFAV ADLPTDVDVD DQVLAVGRVG ERFVAVTALS TKAQLWDVTR VSRKAEPLAV LPGQGRVTAA VFSPDGRRLA TMAVDGSVLV WNVESATAGP PGNNPVELTA LASLPGNDSN SPAMAFNSDG TLLAVPGPEG SVEVWNTRLP AGSASAQNST GPDLEALPLQ IPGVVSGIDD PGVSALAFTP DSGRLAVGTS GGTFAVFALG RSAEPLMRRT NYAGAMNADA VEGTAGQATP PDTMETSHLP PQRQPVTRLL FRPDDGATLA VAGGDGVVRL WDVAGTDPTL ASQNAPANLP TKPTAELGDG AAAVDAIAYS ADGLHLAVGF GASGGTVNVW DVTDESSPAQ GAPIERLNDS PTSLTFDRAG QTLAVGTKDG AVRLSTVASP GMPQPRLTVP DILGDHVAVN GAGTTVAVSL AVDGVDGAIE IWDLRAGSSE PTVTFPGRDA TVYSIAISPD GALLAAGYDD GVVRVWDLRA ALSTRAAASP VADLAEHGAI VQGLAFTSDG RLLASGDQDG TIKLFRLDGT HPPKRIAEAA STHGSVQGIT FSADNGTLVT GGDSGVGVYD VPDVSTGGPL TTVTDISDAT TGNLSGGVAL SPDGRTLAVG GHGTVTTWDI TGPTPQQQGT SLPVGSKEAA VSTVDFSPDG RLLTAAGNDG DIAIWDTSRP GQPKPLTNLP GHDNGTRLAA FTDDSRTLVT LGSDYLAKQW DVDPETLVGR ACNGVSQPMT KTDWRTFVNN REYAAPC
|
| |