Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6056 |
Symbol | |
ID | 5674377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7368859 |
End bp | 7372749 |
Gene Length | 3891 bp |
Protein Length | 1296 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641244904 |
Product | DNA-directed RNA polymerase subunit beta' |
Protein accession | YP_001510306 |
Protein GI | 158317798 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.376172 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTGGACG TCAACTTCTT CGACGAGCTG CGCATCGGCC TGGCCACCGC GGACGACATC CGCCAGTGGT CGTTCGGTGA GGTCAAGAAG CCGGAGACGA TCAACTACCG CACCCTCAAG CCGGAAAAGG ACGGGCTCTT CTGCGAGAAG ATCTTCGGTC CGACCCGGGA CTGGGAGTGC TACTGCGGGA AGTACAAGCG GGTCCGGTTC AAGGGCATCA TCTGTGAGCG CTGTGGCGTC GAGGTCACGC GCGCCAAGGT GCGCCGCGAG CGGATGGGCC ACATCGAGCT CGCCGCCCCC GTCACCCACA TCTGGTACTT CAAGGGCGTG CCGAGCCGTC TCGGCTACCT GCTCGATCTG GCGCCGAAGG ACCTCGAGAA GGTCATCTAC TTCGCCGCCT ACATGATCAC AAAGGTCGAC ATCGACTCCC GGCACCGCGA CCTGCCGACC CTCGAGGCCC GCATCGGTGT CGAGAAGCAG CAGCTCGAGG ACAAGAAGAA CGCCGACGTC GAGACCCGCC AGCGCAAGCT GGAGGAGGAC CTCGCGCAGC TCGAGGCCGA AGGCGCCAAG GGCGACGCCC GCCGCAAGGT GCGCGAGTCG GCCGAGCGCG AGATGCGCCA GATCCGTGAC CGCTCCCAGC GCAAGATCGA CGACCTCGAC CGGGTCTTCG ATCGGTTCAA GAACATGAAG GTCCAGGACC TCGAGCCGGA TGAGCTGCTC TTCCGCGAGC TGCGCGACCG GTTCGGCCAG TACTTCGAGG GCGGCATGGG CGCCGAGGCG CTGCAGCACC GGCTCGCCGA CTTCGATCTG GCCGCCGAGG CGGAGTCGCT GCGCGAGACC ATCCGCAGCG GCAAGGGGCA GAAGAAGGCC CGCGCGCTCA AGCGGCTCAA GGTCGTGTCG GCGTTCCTGA ACACCCGCAA CTCCCCGATG GGGATGGTGC TCGACTGCGT TCCGGTGATC CCGCCGGACC TGCGGCCGAT GGTCCAGCTC GACGGTGGCC GGTTCGCCAC CTCCGACCTG AACGACCTGT ACCGCCGGGT GATCAACCGG AACAACCGGC TCAAGCGGCT GCTCGACCTC GGCGCGCCCG AGATCATCGT CAACAACGAG AAGCGGATGC TGCAGGAGGC CGTCGACGCG TTGTTCGACA ACGGCCGCCG CGGCCGGCCC GTCACCGGGC CCGGCAACCG TCCGCTCAAG TCGCTGTCCG ACATGCTCAA GGGCAAGCAG GGCCGGTTCC GCCAGAACCT GCTCGGCAAG CGCGTCGACT ACTCGGGCCG TTCGGTCATC GTGGTCGGCC CGCAGCTCAA GCTGCACCAG TGCGGCCTGC CCAAGCAGAT GGCGCTCGAG CTGTTCAAGC CGTTCGTGAT GAAGCGCCTG GTCGACCTCA ACCACGCGCA GAACATCAAG TCGGCCAAGC GCATGGTAGA GCGGGCCCGC CCGGTCGTGT GGGACGTCCT CGAAGAGGTC ATCACCGAGC ACCCCGTGCT GCTCAACCGG GCGCCGACCC TGCACCGCCT CGGCATCCAG GCCTTCGAGC CGCAGCTGGT CGAGGGCAAG GCCATCCAGA TCCACCCGCT GGTCTGCACC GCGTTCAACG CGGACTTCGA CGGCGACCAG ATGGCGGTGC ACCTGCCGCT GTCCGCCGAG GCGCAGGCCG AGGCCCGGAT CCTGATGCTG TCGAGCAACA ACATCCTGTC GCCGGCGTCG GGCCGTCCGC TGGCCATGCC CAGCCTCGAC ATGGTCACCG GGGTGTTCCA CCTGTCCCGG GTGTCCGAGG GTGCGATCGG CGAGGGCCGG TTCTTCTCCA GTGTCGCCGA GGCCCAGATG GCCTTCGACG CGCGGGAGAT CCACCTGCAG GCCCGCATCC AGGTGCGGCT GCGGGAGAGC ACGCCCCCCG CCGAGTGGGC GCCGCCCGCC GACTGGCTGC CGGGTGACCC GTTCACCCTG GAGACCACGT TCGGGCGCTG CCTGCTCAAC GAGGCGCTGC CGGAGGGCTA CCCCTTCATC AACGCCCAGC TGAACAAGAA GGCGCAGGCG GCGATCGTCA ACGACCTCGC CGAGCGGTAC CCGAAGATCC AGGTCGCGGC GACGCTGGAC GCCCTCAAGA GCGCCGGTTT CTACTGGGCC ACCCGGTCCG GTGTGACGGT CGCGATCGAG GACGTCGTGG CTCCGCCGAA CAAGGCCCAG ATCCTCGACG AGTACGAGCA GCGGGCCGAG CGGGTCGAGA AGCAGTTCGG CCGCGGTTTC CTCTCCGACG AGGAGCGCCG CAGCGAGCTG GTGCAGATCT GGACCGAGGC GACGAACAAG ATCGCCGAGG CCATGGAGGC GAACTTCCCG GAGACCAACC CGGTCTACAC GCTGGTCAAC TCCGGGGCCG CCGGAAACAT GATGCAGATC CGGCAGCTCG CCGGCATGCG CGGACTGGTC TCCAACCCCA AGGGTGAGAT CATCCCGCGG CCGATCAAGG CGAACTTCCG CGAGGGCCTC ACCGTCGTCG AGTACTTCAT CTCCACGCAC GGTGCCCGTA AGGGTCTCGC CGACACCGCC CTGCGGACCG CCGACTCGGG CTACCTGACC CGTCGTCTGG TGGACGTCAG CCAGGACGTC ATCGTCCGCG AGGAGGACTG CGGCACCGAG CGCGGCATCC TGACCCGGAT CGCCCGCAAG GGACCGGACG GGGTGCTGGT GCGCGACCGC TACGCGGAGA CCTCCGCGTA CGCCCGCTCG CTCGCCTCCG ACGCCGTGGA CGCCCAGGGT GAGGTCGTCG TCCCGGCCGG CGCCGACGCC GGCGACGTGG TCATCGGCCA GATCATCGAG GCCGGGATCG AGTCGGTGCG GGTGCGCTCG GCGCTCACCT GCGAGTCGCG GATGGGTGTC TGCGCGCACT GCTACGGCCG CTCGCTGGCC ACCGGCAAGC TGGTGGACGT CGGTGAGGCC GTCGGCATCG TCGCCGCCCA GTCCATCGGT GAGCCGGGAA CCCAGCTCAC GATGCGTACC TTCCACAGCG GTGGCGTCGC CGGTGACGAC ATCACCCAGG GTCTGCCGCG TGTCGTCGAG CTGTTCGAGG CCCGTAGCCC CAAGGGCAAG GCCCCGATCA GCGAGGTGAC CGGCCGGGTC AAGATCGAGG AGACGGAGAA GACCTTCAAG GTCGTCATCG TCCCCGACGA CGGCAGCGAG GAGATCGCCT ACCCGGTCTC CCGCCGCTCC CGGCTGCGGG TCCGCGAGGG CGAGCGGGTC GAGGTCGGCG CCCAGCTGAT CGACGGCGCC GTCGACCCGC ACGAGGTGCT GCGCATCCTC GGCCCGCGCC AGGTCCAGCT CCACCTGGTC GACCAGGTCC AGGAGGTGTA CCGGTCGCAG GGTGTGTCGA TCCACGACAA GCACATCGAG ATCATCATCC GCCAGATGCT CAAGCGGGTG AACGTGCTCG AGTCGGGGGA GACCACACTG CTGCCGGGTG AGCTCGTCGA GCGCGCCCGC TTCGAGGGCG AGAACCGGCG CGTGGTGGAG ATCGGCGGCC AGCCGGCCTC GGCCCGCCCG GTGCTCATGG GCATCACCAA GGCCTCGCTG GCCACCGAGT CGTGGCTGTC GGCGGCGTCC TTCCAGGAGA CCACCCGGGT GCTCACCGAC GCGGCGATCA ACGCCCGCTC GGACTCGCTG GTCGGCCTCA AGGAGAACGT CATCATCGGA AAGCTCATCC CGGCGGGTAC CGGCATCTCC CGGTACCGCA ACATCCGGGT GGAGCCGACC GACGAGGCGC GCGCGGCGAT GTACTCGGTC AGCGGCTACG AGGACGGCGC CTCGGTCGAG TACGGCGCCT TCGGTGCCGG ATCCGGCCAG GCCGTCCCGC TGGACGAGTT CGACTACCGC AGCTCGGGCG ACTACCGCTG A
|
Protein sequence | MLDVNFFDEL RIGLATADDI RQWSFGEVKK PETINYRTLK PEKDGLFCEK IFGPTRDWEC YCGKYKRVRF KGIICERCGV EVTRAKVRRE RMGHIELAAP VTHIWYFKGV PSRLGYLLDL APKDLEKVIY FAAYMITKVD IDSRHRDLPT LEARIGVEKQ QLEDKKNADV ETRQRKLEED LAQLEAEGAK GDARRKVRES AEREMRQIRD RSQRKIDDLD RVFDRFKNMK VQDLEPDELL FRELRDRFGQ YFEGGMGAEA LQHRLADFDL AAEAESLRET IRSGKGQKKA RALKRLKVVS AFLNTRNSPM GMVLDCVPVI PPDLRPMVQL DGGRFATSDL NDLYRRVINR NNRLKRLLDL GAPEIIVNNE KRMLQEAVDA LFDNGRRGRP VTGPGNRPLK SLSDMLKGKQ GRFRQNLLGK RVDYSGRSVI VVGPQLKLHQ CGLPKQMALE LFKPFVMKRL VDLNHAQNIK SAKRMVERAR PVVWDVLEEV ITEHPVLLNR APTLHRLGIQ AFEPQLVEGK AIQIHPLVCT AFNADFDGDQ MAVHLPLSAE AQAEARILML SSNNILSPAS GRPLAMPSLD MVTGVFHLSR VSEGAIGEGR FFSSVAEAQM AFDAREIHLQ ARIQVRLRES TPPAEWAPPA DWLPGDPFTL ETTFGRCLLN EALPEGYPFI NAQLNKKAQA AIVNDLAERY PKIQVAATLD ALKSAGFYWA TRSGVTVAIE DVVAPPNKAQ ILDEYEQRAE RVEKQFGRGF LSDEERRSEL VQIWTEATNK IAEAMEANFP ETNPVYTLVN SGAAGNMMQI RQLAGMRGLV SNPKGEIIPR PIKANFREGL TVVEYFISTH GARKGLADTA LRTADSGYLT RRLVDVSQDV IVREEDCGTE RGILTRIARK GPDGVLVRDR YAETSAYARS LASDAVDAQG EVVVPAGADA GDVVIGQIIE AGIESVRVRS ALTCESRMGV CAHCYGRSLA TGKLVDVGEA VGIVAAQSIG EPGTQLTMRT FHSGGVAGDD ITQGLPRVVE LFEARSPKGK APISEVTGRV KIEETEKTFK VVIVPDDGSE EIAYPVSRRS RLRVREGERV EVGAQLIDGA VDPHEVLRIL GPRQVQLHLV DQVQEVYRSQ GVSIHDKHIE IIIRQMLKRV NVLESGETTL LPGELVERAR FEGENRRVVE IGGQPASARP VLMGITKASL ATESWLSAAS FQETTRVLTD AAINARSDSL VGLKENVIIG KLIPAGTGIS RYRNIRVEPT DEARAAMYSV SGYEDGASVE YGAFGAGSGQ AVPLDEFDYR SSGDYR
|
| |