Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6380 |
Symbol | |
ID | 5674696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7743024 |
End bp | 7746077 |
Gene Length | 3054 bp |
Protein Length | 1017 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641245229 |
Product | hypothetical protein |
Protein accession | YP_001510624 |
Protein GI | 158318116 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.726246 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGGCA CGGTGGTCAC CTTCTACTCG TACAAGGGTG GCGTCGGGCG TTCTCTCGCA CTCGCCTGGG TAGCCTGGAT CCTCGCCTCG GCCGGGAAAA AGGTCCTCGT CGTCGACTGG GATCTGGAGG CCCCCGGCCT GGAGAACTAC TTCTGGCCGG CCGCGGCCGT CAAGGGGATG CGGACCAGGC CCGGTCTCGT GGATCTCGTC ATCGACTACC GCGACCGCCT CGACGCGGAC GCACTGGCAC GGTACCTGGA ACGTCATTCA TGGCAGCCGC TCGCCGCCCA CCCGGACTGG TCCGGGGAGG ACTGGGCGGA CCCGTCGACC GCCGAGCAGG CACGCGCGGA GCTGGCCGAT CCCGACCTGC GGGAGTGGGT CCGGGAGCGA ATCACCAGGG TGGATCTGGA TGCGGTCCGC GCCGAGCTCA CCAGGGATTC GCCCGGCGGC GAGGCTGATT CTGACCCTAT GAAGCAGCGG TTGAGGGACT GTACCAGCTG GGCCTCGTTC GTCGAGGAGC TGGACATCGA CGACGTTTTC CAGCCCGGTG GCTCGGTCGA CCTCATGCGA TCCGGCCGCC TCGACGCTGA ATACGCCGCA AAGCTGGCGC TCCTGGACTG GAACGAGCTG TACACGAAGT TCGCCGGGCA CCGCTTCTTC CGGACCCTCA AGTCCAGATG GACGGACTAC TACGATTTCG TCCTGGTGGA TTCGCGGACG GGCATGGGGG ATGTGTCGGC CACCTGCACG CAGGTTCTCC CCGATGTCAT GGTCGCCTGC TTCGGTATGA ACGAGCAGGG CATCCTCAAC ACCGCACGGG TCGCCGCCCA GGCCCGTGCG GCGGGCGCCC GTGAGCGGCG TGAACTGAGA ATCCTGCCGT TGCCGTCACG GATCGAGACC TCCGCGCGGG CGGCGGCGGG AAAGGCAAAG GACCGCTACC AGCGGGTGTT CGAACTGCCG TGGGAGGAGT TCGAGGAGGT CTCCACGACG CTGCCGCGTC AGCCGTTGAT AGGGCCCAGA ATCCTGGGTG GCACGCAGAA GTACTGGGAG CAGGTGGCGT TGCCGCACAC CCCGCGGTAC GCGTTCCAGG AGGGTCCGCC GCACGGAATC GACCGGGATC GTCTGCAGCG GGCCTGCGAA TTTCTCGCCC AGAACATCGG GCGGGATGAC ACAATCCGAC TGAACCCGCC GTCGCGGACG GCATGGGCGA AGATGGTGGC GGGCTTCGAC CGTCCCGCGC GCAAGCTACA GTACGACGAG TTCGTGATCA GCTACGCCGA AGGCGGTCGG GATCCGCTCT GGGCGGCCTG GGTGTACGAA CAGCTGTGCC GGATCTCCGC GCGGGTACGG CAGCACAACG CGAGCCGGGA CGGGACGGGC TTCCTCGACG ACCGGCTCCG GGACGCCGGC GGCGATCCGG GTACGGACAT CGACGGCGGG AAGCTCTGTG TGCTGATCCT GCTCAGCCCC AGATATGTCG AATCCGACTT CGGCCGGGCC ACCTGGCGGT GGGCCGCCAG CCGCTACACC AGGGCGGACC CGGCCGACCG TGGCTCCAGC CGATGGAGCC TCCTGCCCGT TCTCATCTCC TCCCACGCCC CGGAGACGGC ACCGTTCGCC GAGATCCAAC CGGTGAGCCT GGTCAAACTC GACGAGGAGA CAGCCAGAAC GACGCTGCTG GCGGAGCTGG CCGACACGTG GGACGAGCGC GAGTCCGCGG TCCTGACGCC CGTGGAGTTT CCCGGGAGCC CGGAGGATCT CGTCCGCCGT TACCAGGAGC GGGCTGACGA GCTCGAAAGG GCGAACGCTG TGACGGCGGA ACGGGTGGCG ACCCTGCGCC AGCTCGCCCA GGAGAAGCTG GAGCATCACG GTACGACCTC GACCTCCCAG GCCGTCCAGC ACCTGAGGGA GGCGAACCGG CTCGCCCGTG ACAGGCGGGA CAGCCTCGTG GCCGCTCTCA CCGGGTTCGA GCTCGCCTAC GCCGAGTTTT CGGAAGAGCA ACTACCACAG CGGCGGCGTA CCGGCCGGCC CCTACAGACC GCCGTGCGCG CCGTCGAGGA GACCATGCGG CTGGTGGACT CCGACACCAT CGCCGATCTC TCCGACGCCA CCGACATCCG TGGCGCCATT GACCCCAACA ATTCCCGCGG CGGAATCCAC GCCAGGAGCA CCCTGACGTC TTCCAGGCTC TACCGCAATG ATCTGCGCCC CCTTCGCCGG GCCATAGACG AGGTGGACCG GCTTGGGCTG ATCGAACCCA TGTCCCCGTA CGGGCAGCGG CTGCGGCTGG CGAAGGCGGT TGTGCTCCGT CTCCTCGACG ACGTCCAGGA GGCACGGCCA CTCCTGGTCG AGGTCCACCA GAACGCCGGC CGGGACCGCG CTCTCCGCGC CCGGTGCGAC CTGGAGATCG GGATGGGGTC GTTCGCACGC GGCAGAACGT TCTGGGACGA CGCGTACCGC AAACTGCTAC GGCTGGTCCA GCAGCCAGAT GCCGAAGCCA GGATCAGGTT CCTGGCGCTT TCCACCCTAG GCGCGATGTC GAAGCCCGAA GAGGCCAAGG GGCATTTCGA GCGGGCGCTC GAGGCGGTAG CTGGGGAACG CCACCTCGCC GGACTCCAGA TCGCCGCCCT CGTGAATCTC GGCCGGAACG CGGCTGCCCG CGGCCTGCAT CCCACGGAGG AGGGCGGACC GCTGAAGCAC TTCGAGGAGG CGGTCGACGC GGCCAGGGGC GAGTGGCCAC TCGTCTCGCT TCCGCTCGCC GAGGCATACT GGCAGACCAG CCGTCCGGTG AAATCACCTC ATACATCGAA GGAAATTCTG GCCCGTCTGC TGCGAGCGAT CTGGATCTAC ACTGTTCTCG GCCTGCAGGA CGAGGTCCAT GAATGCCGAA AGATCATCAG GAGTCGCGCG GCGAAGAACA TGGGCTGGGA TTCCTTCAAG GAAATAAACG ACGGCGAACC GACCTCCACG TTCACCCCCG CGAAGATCAG CGACTTCGAC GTGATCATGG CGATCCTCGA CGTCCGAGTA CCCGACACCA CCGCGCTCCG CTGA
|
Protein sequence | MPGTVVTFYS YKGGVGRSLA LAWVAWILAS AGKKVLVVDW DLEAPGLENY FWPAAAVKGM RTRPGLVDLV IDYRDRLDAD ALARYLERHS WQPLAAHPDW SGEDWADPST AEQARAELAD PDLREWVRER ITRVDLDAVR AELTRDSPGG EADSDPMKQR LRDCTSWASF VEELDIDDVF QPGGSVDLMR SGRLDAEYAA KLALLDWNEL YTKFAGHRFF RTLKSRWTDY YDFVLVDSRT GMGDVSATCT QVLPDVMVAC FGMNEQGILN TARVAAQARA AGARERRELR ILPLPSRIET SARAAAGKAK DRYQRVFELP WEEFEEVSTT LPRQPLIGPR ILGGTQKYWE QVALPHTPRY AFQEGPPHGI DRDRLQRACE FLAQNIGRDD TIRLNPPSRT AWAKMVAGFD RPARKLQYDE FVISYAEGGR DPLWAAWVYE QLCRISARVR QHNASRDGTG FLDDRLRDAG GDPGTDIDGG KLCVLILLSP RYVESDFGRA TWRWAASRYT RADPADRGSS RWSLLPVLIS SHAPETAPFA EIQPVSLVKL DEETARTTLL AELADTWDER ESAVLTPVEF PGSPEDLVRR YQERADELER ANAVTAERVA TLRQLAQEKL EHHGTTSTSQ AVQHLREANR LARDRRDSLV AALTGFELAY AEFSEEQLPQ RRRTGRPLQT AVRAVEETMR LVDSDTIADL SDATDIRGAI DPNNSRGGIH ARSTLTSSRL YRNDLRPLRR AIDEVDRLGL IEPMSPYGQR LRLAKAVVLR LLDDVQEARP LLVEVHQNAG RDRALRARCD LEIGMGSFAR GRTFWDDAYR KLLRLVQQPD AEARIRFLAL STLGAMSKPE EAKGHFERAL EAVAGERHLA GLQIAALVNL GRNAAARGLH PTEEGGPLKH FEEAVDAARG EWPLVSLPLA EAYWQTSRPV KSPHTSKEIL ARLLRAIWIY TVLGLQDEVH ECRKIIRSRA AKNMGWDSFK EINDGEPTST FTPAKISDFD VIMAILDVRV PDTTALR
|
| |