Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6769 |
Symbol | |
ID | 5675082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8232858 |
End bp | 8236907 |
Gene Length | 4050 bp |
Protein Length | 1349 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641245618 |
Product | hypothetical protein |
Protein accession | YP_001511009 |
Protein GI | 158318501 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCGCAG ACGCGGTGGA ACGGTTCCTG GAAGACGCCG ATGCCGAGGA GCGGGAGTGG GCCGCGCCGG CCGGCGCACC GCGCGAGCTC GTCACCAAGC GGTTCCTGTT CGGAAACGGT GACGCGGCGC TTGAGGTAGC GGTCGCGGTC AGCCCTGCGG GTCCGCCGAG GATTCCAGAC CTGCGCCTGC TGTGGAAGCT GCGGCAGGGC AGCCGGGCGT CACCGGTCCT GCTGGTTGCG CTCTACGATG AGGGCACCGC GACGAAAGCG GCCACCTGCG GGATCGTCGA GAACCCGCTC ATGTCACTGG ACCCCGGCCA ACTCGACCGG GTGTGCGCGG CAGCGTTGCG GGAGCCCAAC CGGCACGACG CCCAGCGCAA GCTCGAGCGG CTGCTGGCCG CGATCAGTGA CGGGCAGGGA ATTTCAGGGC TGGACAACCG CGGCCTGTTT GCGGAGCACC AACTCCGCAA TGGTGTCCCG GAAGGTAAGT CGTGGGCTGA GGCCGGTCAG GTGGCCCGGC CGCTGCTCGG CCTGCGGGGT ATGGCGCTGA TCCGGGCGTT GGGTTACGGC ACGACAGAGC GCGGTTCCAC CGCGACGCTG TTGACGCATC AGGGGACGTC GCGGGCGATC GCGGTGCTCC TTGAAGCTGG GGAGATGTTC GACCGGCCGT CACCGCGGTT CGGCGCGGTC TCGCCGGTGT TACACGCGAT CTCGGTGGCC GCACGGGAGA ACCTGCCCTG GGTCATCGTG CTGCGTGGCA GCCAGATCCG GCTGCACCCG GTCAACCCGA CCATCGGCGT CGGACGCAAG AGTCAGGGCG AGACCTTCAC CGAGCTCGAT CTGGACGTGC TCTCTGCCGA GCACGCCGCC TTCCTGTACC TGCTGTTCTC GCCGACAGCG CTCGCCTCCG AGGGCTCCGT CACCGAGATC CTTCGCGACT CGGCGAACTT CGCCGCCGAT CTCGGCAGCC GCCTACGCGA GCGGGTCTAC ACGGAGGTCG TGCCCGGCCT CGCCGTCGCA ATCGCCCGGC AGATGCGCGC GACCAGCGAG GCTGACCTTC AGGAGGCCTA CCACCGCACC CTGATGGTCC TGTTCCGGCT GCTGTTCGTC GCATACGCCG AGGACCGGGC GCTGCTTCCG TACGGCCGGA ACCGTAGCTA CGACCGGCAC GCTCTTAAGA CGCTCGCGAA GATGTTCGCC GACGAGCCCA AGCTGGTCTT CGACCGCGAG GCGACCTCGC TCTGGGACGA CATGCAGGCC GTTTGGAAAG CCGTCGACAG CGGCAACACC GGCTGGGACG TCCCGGCCTA CAACGGCGGT CTGTTCACCC ACGATCCGCG CACCAGCCCG TCCGGTGCCG CGCTGCGCAA GACGCAGCTG ACCGACGCCG AGTTCGGCCC CGCCCTACGT GCTCTGCTCG TCGACACGGG ACCGGACGGC ACGCAAGGCC CGGTTGACTT CCGTTCCCTG TCCGTCCGCG AGTTCGGTAC CCTTTACGAG GGCTTGCTTG AGTCGTCGCT GTCGATCGCG TCGTCGCCGC TCACGATCAA CAGCGAGGGA AGCTACGTCC CAGCCGGTGA ACGGGATCTC ATAAGGGTCA AGGCCGGTGA GGTCTACTCC CACAATCGTT CGGGTAAGCG CAAGGCTACG GGCTCGTATT TCACCAAGTC GTTCGCAGTC GAGCATCTGC TGGACACCGC TCTCGAGCCT GCGATCAAGG AGCACCTCCA CCGAGTCAGG ACGCTCCTGG ACGCTGGCGA CGACGCGGCG GCCGGCGAGA GTTTCTTCGA CTTCCGGGTC GCGGACCTGG CGATGGGATC GGGCCATTTC CTGATCGCGG CGATTGACCG GATCGAGACG CGGTTCACCG CCTTCCTCAG CGACAGGCCG ATCGCCGCGG TTGGAGATGA GCTGGCGCGG TTGGCGCAGG CAGCCCGTGA TGCGCTGGGA ACCTCGGCCG TGGAGGTCGA AGTCGAGACG TCGTCGTTGT TGCGGCGGCA GATCGCCAGG CGTTGCATCT ACGGACTGGA CTTGAACCTG ATGGCAGTCG AGCTGGCCAG GCTGGCTGTC TGGATCCACA CCTTCGTTCC CGGACTGCCG ATGTCGTCGC TCGACCACGG GCTGATTGTT GGCAACAGCC TCACGGGCAT CGGCACGATC GACGAGGTGG TCGCCACCCT CGACCCGAGC GCCGAGGGTG TGGGGGTCAA GAGCTTCTAT GAGGACGCGA TTCGGGAGAC GCTACATGAG GCGGGCGAGC AGCTGGCGCG GGTCGCGCGG ACATTGGAGG CCACCAAGCA GGAGGTCCAA GAGGCCGCTC TCGCTTACGC CGCGGCGCTG AAGGCGGCGG CCGACGCCCA GGCTCTGTTC GACACGATGA TCGCGGTCCG GCTGGGAATT GGAGTGTCCC TGGTAACGCC GGAGCAGGCG ATCGAGATGG GCCACACCGA CCAGGTCCGC GACGCGGTAG CGAAGCTCGG GGTCGCGCAT TTCCCGCTGC GCTTCCCGGA GGTGTTCCTG CGGGAGCGGC CGGGGTTCGA CGTGCTGATC GGCAACCCGC CGTGGGAGAA GGTCAAGGTC GAGCAGGACC AGTGGTGGGG CCTTCGGTTC CCCGGCCTCG CTTCGATGCC GCAGAAGCAG CGAGACGCCG CGATCGCCAA GTACCGGCGC GAGCGGCCGG ATCTCCAGGC CGAGTACGAG ACCGAGGTCA CGGCCGTGCA GGACCTGAAG GATCTGCTGG CAAAGGGGCC GTTCCCCGGG CTGCGGGCCG CGACCGACAC CGAACTGTCA CTTGCCTTCG CTTGGCGATT CTGGCAGGTA CTGCGGCCCG GCGGACGGGT CGGCGTCGTC CTGCCGCGCA CGATCCTCGG TGGCAAAGGC GGCACTAAAC TCCGGCACAC AGTCCTTGAG AACGGCGCGT TTGCGACCGT CACGATGCTG GTCAACAGCC GCCGGTGGGT ATTCGACGAG GTGCACCCGC AGTACACAAT CGGGCTTACG ACGCTGGTAA AGGGCCGAGC CCATCCTCGT ACCGTGGAGC TGCGTGGCCC GTTCTTCTCG CTGACCGAGT TCGACGGTGC TGCTCACCGG ACCGGGCATG TCATCGAGGC CGGTGCCTTC GTGTCCTGGT CCGAGGACGC CGCGTTCCCA GTGCTGCCGA CGCCGGCCAG CCTCGATGTC TTCCTCACGC TGCGCTCCCA CCCGCGGCTC GACACTCCGG GCGACTGGCA CTTCGTGCCG CTCCGCGAGC TTCACTCGAC CGGCAACAAG GAGCTGTTCG ACTTCGACCT GGACCACCCG CGCGGTGACA TGCCAGTGCT GACCGTGCGG TCGTTCCACC TGTGGGATCC CTACTTCGGT CTGCCGTACG CGTACGCGAA GTCCACCGAC GTCTACGCCT ACTTGCAACC ACGGCGCCGG AAGCAGGTCA GGAAGTCCGA CAGCGCCATG TTCGGGATGC CCACCGCCTG GGCTGCGGAC CCCGAGACTC TGCCGGCCAA ACACCCGCGC ATAGTCTTCC GGGACACCGC CCGAGCCGTG GACTCTCGGA CGGCAATCGT CGCGCTCGTG CCCGGTGGGA CCACCACCGT TCACTCGGCC CCATACCTCC TCCGCCGTGC CGGGACCGCT GCCGATGAGG CTTACCTGCT GGGTGTCCTG TCCAGCCGGA TCCTGGACTG GTACTCCCGC CGCTACGTCG AGTTGCACTT CACCGCGGGG ATCATGTCGT CGCTGCCGAT CCCGAGGCCC GCCGCCGACG ATCCGCTACG CCGCCGCGCC GTCGAGATAG CGGGCAGGCT CGCGGCGGTG GATCACCGCT ACGCCGCCTG GGCCAGAGAG GTCGGTGTAC CAGTCGGGTC CGTACGTACG GCCAGCGAAA AGGTCGACCT CATCGCCGAA CTAGACGCGG TTGTCGCGCT GCTCTACGGC CTCGGCCGGG CGGCCGTCGA GCACATCTTC GAGACGTTCC ACCGGGGTTG GGGCCACGTG GACGCCCTCG CCGCGACGCT CCACCACTTC GACTCCTGGG CTACGAGGAA GGCCACGTGA
|
Protein sequence | MLADAVERFL EDADAEEREW AAPAGAPREL VTKRFLFGNG DAALEVAVAV SPAGPPRIPD LRLLWKLRQG SRASPVLLVA LYDEGTATKA ATCGIVENPL MSLDPGQLDR VCAAALREPN RHDAQRKLER LLAAISDGQG ISGLDNRGLF AEHQLRNGVP EGKSWAEAGQ VARPLLGLRG MALIRALGYG TTERGSTATL LTHQGTSRAI AVLLEAGEMF DRPSPRFGAV SPVLHAISVA ARENLPWVIV LRGSQIRLHP VNPTIGVGRK SQGETFTELD LDVLSAEHAA FLYLLFSPTA LASEGSVTEI LRDSANFAAD LGSRLRERVY TEVVPGLAVA IARQMRATSE ADLQEAYHRT LMVLFRLLFV AYAEDRALLP YGRNRSYDRH ALKTLAKMFA DEPKLVFDRE ATSLWDDMQA VWKAVDSGNT GWDVPAYNGG LFTHDPRTSP SGAALRKTQL TDAEFGPALR ALLVDTGPDG TQGPVDFRSL SVREFGTLYE GLLESSLSIA SSPLTINSEG SYVPAGERDL IRVKAGEVYS HNRSGKRKAT GSYFTKSFAV EHLLDTALEP AIKEHLHRVR TLLDAGDDAA AGESFFDFRV ADLAMGSGHF LIAAIDRIET RFTAFLSDRP IAAVGDELAR LAQAARDALG TSAVEVEVET SSLLRRQIAR RCIYGLDLNL MAVELARLAV WIHTFVPGLP MSSLDHGLIV GNSLTGIGTI DEVVATLDPS AEGVGVKSFY EDAIRETLHE AGEQLARVAR TLEATKQEVQ EAALAYAAAL KAAADAQALF DTMIAVRLGI GVSLVTPEQA IEMGHTDQVR DAVAKLGVAH FPLRFPEVFL RERPGFDVLI GNPPWEKVKV EQDQWWGLRF PGLASMPQKQ RDAAIAKYRR ERPDLQAEYE TEVTAVQDLK DLLAKGPFPG LRAATDTELS LAFAWRFWQV LRPGGRVGVV LPRTILGGKG GTKLRHTVLE NGAFATVTML VNSRRWVFDE VHPQYTIGLT TLVKGRAHPR TVELRGPFFS LTEFDGAAHR TGHVIEAGAF VSWSEDAAFP VLPTPASLDV FLTLRSHPRL DTPGDWHFVP LRELHSTGNK ELFDFDLDHP RGDMPVLTVR SFHLWDPYFG LPYAYAKSTD VYAYLQPRRR KQVRKSDSAM FGMPTAWAAD PETLPAKHPR IVFRDTARAV DSRTAIVALV PGGTTTVHSA PYLLRRAGTA ADEAYLLGVL SSRILDWYSR RYVELHFTAG IMSSLPIPRP AADDPLRRRA VEIAGRLAAV DHRYAAWARE VGVPVGSVRT ASEKVDLIAE LDAVVALLYG LGRAAVEHIF ETFHRGWGHV DALAATLHHF DSWATRKAT
|
| |