Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5240 |
Symbol | |
ID | 5673574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6296933 |
End bp | 6300094 |
Gene Length | 3162 bp |
Protein Length | 1053 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641244094 |
Product | hypothetical protein |
Protein accession | YP_001509504 |
Protein GI | 158316996 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.170249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATTT ACCAGCCCAT GCTGTTCGTC GGACTGGGCG GAACAGGCTG CCTGGTCGGC GCCGAGCTCG AGCGCCGGCT GCGGCACGAG CTGTGCGGCC CGGACGGCCG GGCGCTGAAC CTGCGGGTGC CCGGCAAGAA CTATCTGCCC TACCAGCTGC CGTCCTGCGT GCAGTTCGTC TACGCCGACC TCAACGAGTC CGAGCTCACC AGGCTGCGCA CCCGGGTCGT CCCCTCGGAG GAGCACGCGA ACGCGGCGGC GCCCACCCAG CACGTCACGC ACGGCCTCAT CCCGACCGTC GACACCTACC CGGACGTCGC CCGCAGCCTG CGGGTGAACG CGCGGGAGCT GGTCCGGGGC TGGCTGCCGC CGGCCGAGGG CGAGCCCCGG GTGGCGCCGC TGATGCGGGG GGCGGGCCAG CTGCCCACCG TCGGGCGGGC GGCACTGTTC GAGACGTTCC GGTCCGGGGT GGCGCCGGCC CGCCGTCCAC TGTCCGACGC AGTCGGCAAG ATCGCGACCT CCGGCCAGGA GCTGGCGGCG CTCGGCGGCC GGCTCGGGCA GACCTGTGAC GTCTTCGTCG CCTTCTCCGT CGCCGGCGGC ACCGGCGCCG GCATCTTCTA CGACTACCTG CACCTGATCG GCGACGCGTT GCAGCGCTCG AAGGTCCGGG CGCAGATCTA CCCGCTGGTG CTGATGCCCT CGGCGTTCGA CGAGGGGGTC GGCGGCGGCC GGCGGGCCCG GCTGAACGCC GGCCGCGCGC TGCTCGACCT GTTCCGCCTC ATCGACGACC AGAACGGCCA CGACGCGGGG ACGGAGCTGA CCGCGCGCGG CACCTCCGGC ACGCTCGGCG TCCAGTACCC GGACGGCGAG GGCCAGATCA ACCTGTTGCC CTCGACCATC CAGACCGGAT TCCTGTTCAA CCGGGCCGAC GGCGTGGAAC GCGACGACCT GCACCGCTCG GTCGTGTCGC TGATCCTCTC CCTGGTCGGC ACCGGGCAGG AACACGACGA CGGCGACGCC ATTGCCGACC GGATCTACCA GTCGTTCGCC GACGACTTCA TCAACCGCGG CGTCGAGCGG GAGGTGCCGG CGTCCTCCGG CATCGGCCGG CGGGGCGTGT CGACGAGCCT GGTCGCGTCC ATGACGATCC CGGTCGAGGA GCTGGCCGAC CTGGTCGCCT CCCGGCTGCT GGCCCGCGGG GCGTACGACC TCGCCGCGCC CGCCCCGGGC ACGGCCGCCG ACGACGCCGG CATGATCAGG CGGTTCGCCT CCGCCGCGAA CATCGGCCCG ATGGTCACCC GGCAGCCCTT CCAGTTCACC GAGCCGGCAC CGGCCCGCGG CGCGGCGGAG ATCCTCTCCG CGCTGCGGGC CCGGCTGCAG GCCATGGAGG GCAACCTGGC GGGCCTGGGC ACGATGGTCC GCCAGCAGGT GCCGGCGATC GCCGGCGCCT TCGACCCGGC GGCCGGCCTC GACGACCTGC TCGGCGAGGT CGACCTGCTG CACGCCCGCC GGATCGTCGA GGGCCGCGCG GGGGAGAGTG ACCCGGTGAG CCGGGGCGGG GTGGTCGGCT TCCTGGCCAA CCGCCGCACC GAGCCGCCCG CGCCCGCCGG GATCTCGGTG AACCCACCGG CGCCGCTGGC GATCCGCGAC CGGCGCCTCG GCGCCAAGGT GCGCTGGGGC GACCGGGAGG TGGTGGAGAC CATCCACCGC CAGGACACCT GGTACAGCTG GCGTACCCGG TGCGTCTGGA ACGCCGCCTG GGACGAGCAG CAGGTGCGCT GGGAGAAGGC CCTGCGGCGG TTCCGCGCGC AGCTGAGGCT CGCCGCCGAG GCCTTCGACG AACACAGCCG CTCCGAGCCC GGCGAGTTCG CCCGGCGCAC GGCGGACCTG TTCCGGCCGC GGGTCGGGGT GACCTACCTG CTGCCCCCGC ACGGCGACAT GGACGACTTC TACCAGAGGG CCCTGCAACG GCTCACGTCC GCGCTGGGCC TGCGGGGAGC CGCCTCCGAG GGCGACGTGA TGCACCGCCT GCTCGGCCCG GACGGCTGGC GCCGGGTGTG GGAGGCGACC GTCACCAGGG GCGGGGAGGC CGCCGTCGCC GTCGCCCGGG AGCGGCTGCA GGAGGCGGTG AAGCGGCTGT TCCAGGAGTC CGACGGGCTC GACGGCGAGC CGCTGCTGCC GACGATGGCG AGCCTGCTCG CCGCGACGGT GCGCCGGGAC GGCCCCGCCG CGGTCGGGGA GGACGACATC CGCCAGTTCC AGACGAAGAT CCACGGTCTG GTGCCGGCCA GCTTCTCGCC CCAGGGCTCC GGCAATCTGA AGATCCTCGT CTCCTACCCG GCCGGGGCCC GGGACACCCA GATCGAGGGC TACCTCGCCC GCGCGATCCG GCTGCCGAAC GAGAGCGGCA TCTCGATGGA GTTCCGCCCG ATCAACGCGG ACTCGGTCGC GGTCGTCCTG CTGCGCACCT CGATGAGCAT CACCGAGGTG CCCGAGCTGC GGGAGATCCT GCACCACTGG GCCGACGCGC TGCGCAACGA GCAGTCGCAG GACTTCCTCA AGTGGCGCCA GCGGCTCGGC TTCGACTACG GCTGGCTGGC CACCACCGAG GAGGACCGGG TCCGCATCCT GCACCGGCTG CTCTGCGCGA TGTGGAACGG GCAGGTCCAG GCGCTGGCCG GCGGGACGGA GTCCCCGTAC TCGATCCGGG TCTCGCTCGG TGACCCCAAC GCCGACACCG ACAGCGACGG CGTGAGCATG ACCCTGGCGC TCTCCCCGTT CGAGCCGGCC TCGTCGTGGG GCAGCATGCT CCGCGCCTAC GAGGACTGGT CGCTCGCGGA CGACGAGCGG GTGCGCCGGG ACTTCTCCGA ACAGCTGATG CGGGTGGTGC CGATCGGGGT GACCGGCCGG GCCGCCAACC CGCACCCGGT GTTCCGGGCG TTCGTCGACA ACGCGGACAA GCAGGCCGAC CTGCTCGCCG AGATGCTCAT GAAGCTGCCG CCGGGCAGCC GGGGCTGGGC CGAGCAGCAA CACGCCTTCT GGGCGCACAC CGTGCCGGCC GCGCTGGACA TGGGGTTCTC CAACGTCTCC ACCCCGGTGC GGGCGAACCT GCGCCAGCTC TACGAGATGG TCGGCACACG CGGGAAGGAC CTCGGCAGGT GA
|
Protein sequence | MNIYQPMLFV GLGGTGCLVG AELERRLRHE LCGPDGRALN LRVPGKNYLP YQLPSCVQFV YADLNESELT RLRTRVVPSE EHANAAAPTQ HVTHGLIPTV DTYPDVARSL RVNARELVRG WLPPAEGEPR VAPLMRGAGQ LPTVGRAALF ETFRSGVAPA RRPLSDAVGK IATSGQELAA LGGRLGQTCD VFVAFSVAGG TGAGIFYDYL HLIGDALQRS KVRAQIYPLV LMPSAFDEGV GGGRRARLNA GRALLDLFRL IDDQNGHDAG TELTARGTSG TLGVQYPDGE GQINLLPSTI QTGFLFNRAD GVERDDLHRS VVSLILSLVG TGQEHDDGDA IADRIYQSFA DDFINRGVER EVPASSGIGR RGVSTSLVAS MTIPVEELAD LVASRLLARG AYDLAAPAPG TAADDAGMIR RFASAANIGP MVTRQPFQFT EPAPARGAAE ILSALRARLQ AMEGNLAGLG TMVRQQVPAI AGAFDPAAGL DDLLGEVDLL HARRIVEGRA GESDPVSRGG VVGFLANRRT EPPAPAGISV NPPAPLAIRD RRLGAKVRWG DREVVETIHR QDTWYSWRTR CVWNAAWDEQ QVRWEKALRR FRAQLRLAAE AFDEHSRSEP GEFARRTADL FRPRVGVTYL LPPHGDMDDF YQRALQRLTS ALGLRGAASE GDVMHRLLGP DGWRRVWEAT VTRGGEAAVA VARERLQEAV KRLFQESDGL DGEPLLPTMA SLLAATVRRD GPAAVGEDDI RQFQTKIHGL VPASFSPQGS GNLKILVSYP AGARDTQIEG YLARAIRLPN ESGISMEFRP INADSVAVVL LRTSMSITEV PELREILHHW ADALRNEQSQ DFLKWRQRLG FDYGWLATTE EDRVRILHRL LCAMWNGQVQ ALAGGTESPY SIRVSLGDPN ADTDSDGVSM TLALSPFEPA SSWGSMLRAY EDWSLADDER VRRDFSEQLM RVVPIGVTGR AANPHPVFRA FVDNADKQAD LLAEMLMKLP PGSRGWAEQQ HAFWAHTVPA ALDMGFSNVS TPVRANLRQL YEMVGTRGKD LGR
|
| |