Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6781 |
Symbol | |
ID | 5675094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8253492 |
End bp | 8257184 |
Gene Length | 3693 bp |
Protein Length | 1230 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641245630 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_001511021 |
Protein GI | 158318513 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.57679 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCGGG TGACCAGCGG TGTGCGGGAG CGCTCCGGGC CCATCGCCGT GGTAGGCGCC TCCGGAACTG GCAAGTCATC ACTGCTGCAT GCGGGACTGC TGCCCGCGCT GGCCAGTGGC GAGCCGGGCG TTCCTGCTGC GGCTTCCTGG CCTTACCTGG TGATAACCCC GGGTGCCAAT CCGATCGGTA ATCTCGCGGC CAGAATCTCT ATGTCGGTCG GAGTTTCTTC CCATGAGGCT GAACAGATTC TCCGGTCTTC GCCAGCAGGC CTGGAGCGAC TGGTTGATCA GCTTCTCGTG GACTCGGGAG GTCAACCGCC TCGGCAAGCC CGCCTCCTGC TCATAGTAGA CCAGTTTGAA GAACTTTTTC TGCTTTGTGA CGACGAGGAT GATCGTAAAC TATTCATCAG GGCGATCACA GCTGGCAGTC CGCAAGCTTG CGAGAATGAG GAACCAGTTG AGCTACCGAT GTCGGATCGT GATGAATGTT TATTCACTAC GATACTCGGC ATTCGCGCCG ACGTCTTCGG TGCCTGTTCA CGGTATCCTG AGCTGGCCGA GTCGCTGCGG CGTCCACTTT TGGTGGGCCC GATGTCTGAG GAAGACATCC GACGGACTCT CGAAGGTCCC GCCCAAGTGG CCGGCCTGCG CTGGGAGTCG GGGCTAATCG AACACATCCT CGCTGAGTTT CTGGAACGAG GTCGGTCCGA GGCTGGGACC TTGCCACTCT TGTCGCACGC AATGCGGCAA ACCTGGATCC ATAGTGACCG TAGCACGCTA CTCTTCGCCG ACTACCATGA GTCAGGCGGA GTGCGACGCG CGATCGCCAC GACTGCAGAA TCCTGCTATG CCGGAATGGG CGATCTCGAA CAGGATGTTG CTCGCAATAT TTTCCTGCGA ATGGTGCGGC TTCAGTCGAC CGACGAGCCC ACCAGACGAC GGCTCCCACT GACGGATCTC CCGCCGACAG GGGCTCACCA GCGCGTTCTT GGACGGCTTG TGCAAGACCG GTTGGTGAGC GTCGATCGCG GCTCCTGCGA AATTGTGCAC GAGGCGCTGC TGCGGGAATG GCCTAGGCTC CGCGACTGGA TCAGTCGTGA CCAGGAGCGT CTTCGCGCGC GAGATGAGCT GGTCGGGCAC GCGGAGGCAT GGGAGCGGTC GGGCCGCGAC GATTCACGGC TTTACCGAGG GGCCCAGCTG GGACGCATCC GCGAACGCCT CGCGACACCG AGCTCGAGCG AGGAGGATCT GCCCGAGCCT GGGCCGTCGT TCCTCGCCGC GTCGGTCGCT CACCAGGCCG AGCTCGACGC CGAAGCGGAA GCACGGGTGC GGGGGGACCG CCGGACCAGC CGACGACTGC GCCAACTCAC GGCAGGCCTA GCCATTCTGA CGGTCCTCGC GGTCGCCGGC GCGTCTGCAG CCATTGTGCA GCGCGGGAAC GCCGTCGCCG AGCGCGATGC CGCCAGCCAG GCAAGCATCC TCTCGACCGC GGACGGGTTG GTGGCCCGCG CCGATGCTGC CCGCGATGTC GATCCGCTGC TGTCTCTGCG GCTGGGTATC GCAGCACGAG CGGTCGCAGA CAATACACGG ACCCAGTCCG GCCTCCTTGA GACGATTACC GATTCCCCTT ACATCGCGTC GAGCTTCGCT ATCTCTGACC CGTTCCTGGA AGTGGCCGTC AGCCCGGACT CCCATCTTGT TGCCGTCGGC GGAGTTAGTG GGAAAATCGA ACTCTGGAAC ATCGTCGACC CGCGGCGACC GAAGAAGCAG TACGGGCCGT TCGGGCTCGG CGATGGAGTA GCCATCACTC AGATCACCTT CGACCCGCAG TCGTCCGCGC TGGCTGTGGT GACGATCGAC AAGGTGACGC TCTGGGACAT CAGCAATCCG ACGGCGCCCC GGCCGTTCGG TCAAGTCCAG GAAGATGTCC TTAGTTTTGT CAAATTCAGC CCCTCTGGTG AGATTCTGGT CGTCGCATCC CCGGGGATGA CATCATTGTG GGATGTGACC GTGCGTGACC AGCCGGAGCG GATAGGTGTG CCGGTTGTTG GCAATGGTGG AGCTGTATGG GCTGTGACCT TCTCGCCAGA CGGACGACTC CTCGCTACCG GCGATAATAT CGGCTATGTG CTCGTTATGG ACGTCAGCGA TCCGAGATCT GTGGCGACGC TGGGGCGCTT CCCTTCTCCG CAGGGGACGG TGACCAGTCT CGCTTTCGTC GGGGACGACG AACACCTGGC CGTGGGAGCA GGAAAATACG ACGGAAATAT CATCATGATG GACATCGATA ATCCTGGCGC CCCGATCGTT ATAGAAAATA ATCCGTTACG CCATTTCACC AAACGGATCG TGGGTCTTGA CACCGCCAAG GGATCAGATT TCCTTTTTGC TGCCAGCGAG AACGGAGAAC TGGTTGCCTG GAATGTCTCC GACTCGGAGG TACCGTACCC GGTCCAGTCG TCGTTCGCCG GGCACACCCA GAGTATCTCC GGCCTGGACT TGGCCGAGGA TGGGAGGATC CTCGCGACGG CCGGTGTTGA TGGGACAACG ATCTTGTGGG ACATCGCCCA ACCGACCGAG CCCCGACAAG TCGCAGGAAT CACGACCGAG CCCGAAGTTG GCTACAGAGC GCTCGCGACC AACCGCAGCG GTACGCTCCT GGCTGGCGGA CGCAGCGACG GCTCGACCCT CTTGTGGGAT ATCACCCATT TGGATAAGCC GCGTCAAATC ACGACCATCA ACGCGGGATC GGAAGTCAAT AGCCTTGCAC TTACCGGTGA TGGCGGCATG CTGGCGGTTG GACATGTCAA CGGCGATGTC GCGCTATGGA ACATCCGTGA CGCGGAACAC CCTGAACTCA TCACCACCAA ACGCTCCGGG CTCGGGCTAG TCACGGCGGT AGAGTTCAAC AAACCCGGCG ACCTGCTCGC TATCGGTGCA GTGTCTGCCA CCGGCAGCCG CGTTGATGAC GCGAGCGTGA CACTGTGGAG GGTCAACGAG CGGGAGCCAT CCGCTATCGA GCTCTACCGG GGTTCGGACG CCCCGCAGTC ACTCTCCTTC TCTCCAGATT CAGATGCAAT TCTGGTCGGC TTTGAAGGAG GAACCTCGCG ACTGTGGCAG CTGCCAGCGA ACTCCACAAC TATGGTCACC GAGCTTCCCG GGCACAGTTT CGAAGTAGCT GCTACTGACT TCGACGATCG CAGCAAGCTG GCCGCCACCG GGAACGGCAA TTCTAACGGC AAGGTTCTGC TCTGGGACGC CACAGATCTG CAGGCTGTTC ATAGGGTCGG TGATCCATTG ATCGACCCAG ACCAGGAGGT AATAACCGAG ATCGAGTTCA GCCCGGGCGA GTCGATGTTG GCGGTCGGAG GCACCCGCCA GTACGACATG GGCCAAGCTG TCAGCTCGGT CCGGCTCTGG AACCTCGATG AGGCATCCAG GCCACGCTCT CTCGGAACAA TTCACGTGAG TCCCACCAAC AACCTCTCTG ATCTCGCTAT CGTGCAGCGC GACGGAGCAA TCGTCACCGC CACGGCCTTT CCCGAGGGAG GGATCCAAAT CTGGGACGCT AAGGTACTTC GCACTGTGCG AGAAGATGCG ATCAAGATCA GCTGTCAGCG GGCAGAACGC GGTCTTAACG AAGCGGAATG GGCCCAGTAC GTACCCGGCC TCCCCTACCG CGCCACCTGC TGA
|
Protein sequence | MERVTSGVRE RSGPIAVVGA SGTGKSSLLH AGLLPALASG EPGVPAAASW PYLVITPGAN PIGNLAARIS MSVGVSSHEA EQILRSSPAG LERLVDQLLV DSGGQPPRQA RLLLIVDQFE ELFLLCDDED DRKLFIRAIT AGSPQACENE EPVELPMSDR DECLFTTILG IRADVFGACS RYPELAESLR RPLLVGPMSE EDIRRTLEGP AQVAGLRWES GLIEHILAEF LERGRSEAGT LPLLSHAMRQ TWIHSDRSTL LFADYHESGG VRRAIATTAE SCYAGMGDLE QDVARNIFLR MVRLQSTDEP TRRRLPLTDL PPTGAHQRVL GRLVQDRLVS VDRGSCEIVH EALLREWPRL RDWISRDQER LRARDELVGH AEAWERSGRD DSRLYRGAQL GRIRERLATP SSSEEDLPEP GPSFLAASVA HQAELDAEAE ARVRGDRRTS RRLRQLTAGL AILTVLAVAG ASAAIVQRGN AVAERDAASQ ASILSTADGL VARADAARDV DPLLSLRLGI AARAVADNTR TQSGLLETIT DSPYIASSFA ISDPFLEVAV SPDSHLVAVG GVSGKIELWN IVDPRRPKKQ YGPFGLGDGV AITQITFDPQ SSALAVVTID KVTLWDISNP TAPRPFGQVQ EDVLSFVKFS PSGEILVVAS PGMTSLWDVT VRDQPERIGV PVVGNGGAVW AVTFSPDGRL LATGDNIGYV LVMDVSDPRS VATLGRFPSP QGTVTSLAFV GDDEHLAVGA GKYDGNIIMM DIDNPGAPIV IENNPLRHFT KRIVGLDTAK GSDFLFAASE NGELVAWNVS DSEVPYPVQS SFAGHTQSIS GLDLAEDGRI LATAGVDGTT ILWDIAQPTE PRQVAGITTE PEVGYRALAT NRSGTLLAGG RSDGSTLLWD ITHLDKPRQI TTINAGSEVN SLALTGDGGM LAVGHVNGDV ALWNIRDAEH PELITTKRSG LGLVTAVEFN KPGDLLAIGA VSATGSRVDD ASVTLWRVNE REPSAIELYR GSDAPQSLSF SPDSDAILVG FEGGTSRLWQ LPANSTTMVT ELPGHSFEVA ATDFDDRSKL AATGNGNSNG KVLLWDATDL QAVHRVGDPL IDPDQEVITE IEFSPGESML AVGGTRQYDM GQAVSSVRLW NLDEASRPRS LGTIHVSPTN NLSDLAIVQR DGAIVTATAF PEGGIQIWDA KVLRTVREDA IKISCQRAER GLNEAEWAQY VPGLPYRATC
|
| |