Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0045 |
Symbol | |
ID | 5668471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 56774 |
End bp | 60883 |
Gene Length | 4110 bp |
Protein Length | 1369 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641238974 |
Product | hypothetical protein |
Protein accession | YP_001504419 |
Protein GI | 158311911 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCGCG GCAAGCTCAA CCCCGACCCG GTGCGGCCGC TCGACGGCTA CCGCGTCACC GCCAACGTCG ACGTCCCGAT GCTCGGCGAC GACGGTCAGA CCACCACCAC CGTCGCCGTC GGCGCCGACG TCACGTCGGG CGGGGATGTC GCCATCGAGC TGCCGGAGCG GCGCACGGGC GACGTCCGTC TCGTGGCTAC GGCACCCGAC GGTAGTCGCG TCGGGGCGAC GACCATTGCC GCCGCCAACG TCGACCAGCC GTTCACGCTG GACGTCACCA CCGTCGATCC GGTGACCGTG GTGCCGTCGG ATGTGCCCGG CCTCGGCCGA CGAGAACGCC TCGTCGGCCG GGTGCTCGAC CCGCAGGGCG CCGGCGCTCC GGCCAAGCTG CTCGTCGTCG TCTGGGGTGT GCCCCACGGC GCCCAGCCGA CCGACGCGGT GCCGTTGCTG GTGGCCGAGA CCGCCGCCGG CGGGTACTTC GGCGACGACC TGCCCCCTGA CGAGCTCGAC GACGCGTGGG CCAAGGTGGC TGGCGGCGCG CCCGTACCGA TCCTGCTGGA CGGCGGCCGC CTGCCGCGCA CCGTTGTCAT CGTCACCCCG GTGAAGGCGT CGCACGATAA CGACGACGAC GGCTGCGGAT GCCACGCCGC GCCGCCGCGC AGCCCGGACC AGGCCGACCT GGCGGCCAAC CCGGAGGCGT TCGCTGCCGA CCCCGGCCGT TGCGTCGACC TCACGGTGCC CAACCGGGCG CTGGAGGAGG TGACGTTCCA CGCCGTGGTC CGCACCACCC AGCCGGAGAT CCGCGGCCTG CAGATGCCCA ACGCCGACGA CGTGCCCTCG CCGGTCGTCG ACCGGCTGGT CCAGCTCGTG GCCAGCCGGC CGATCGAGGC GATCGCGCGG CTCGAGTCGC CGAGCACGTA CACGACGAGC GTCGACGCGG CGATCAAGCC GGCGTCGAGC CCGCAGCTCG CCGCGCTCGC CGTGCGCGCC GCGGTGCCGG AGGTGGACGA ACCGACCGTG CGCCGATCGT TGAACCTGCG CGCCGACGTC GTCGGCGAGC TGATGCGTGA CGGCGCCCAC CTGCGCGTCG ACGACCTGCT CCACGCCGAG CGGGTGAGCA CGATCCGCAC CGTGCGCGAC ATCGTCCAGG CCGTCGGCGG CGCACCGGCC GGGCGGGTGG AGCTCGGCGG CGAGACGCAG GTGGACTGGG ACGACACCCC GACGCTGTAC CAGGCCACCA CCATCGCCCA TGGTCACCTG TTGACCTTCA AGCAGGTGTG GCGGGCCGAC GGCTACTCGC TCGGTGACCT GCTCTACTCG CTGCCGCTCG CGCCCGGCCA GAAGAAGCTC ATCGCCGTGG TCGACTGGGA CCGCCGTGAG GTGGAGGTTC GCCAGGCGCA GCGGTCGGAG ACCGAGGTGC TCGATGCCTC GCTGGCTCAT GACCGAGACA TCTCCGAGGT CATCAGCAGC TCGCTGCGCC AGTCGATGCG CGGCTCGTCC AGCGCCGACA CCGAATCGAT CGCCGGCGGG ATCGCCGGGT TCATCGGCCC CGTCGTGTTC GGCTTCGGTG GCGGCTCGTC CAGTGCCGGC TCGACCGCCA GCCAGCGCTC GTCGCGCGAC GTCGCCGGGA CGATGCTCAA CCAGGCACGC GACCGGACGT TGCAGGCGGC CAGCGCAGTG CGCAGCCAGC GGGCCACGGT CGTGCAGACG GCGCGCCAAG GAGAGTCGCT GCGCGTCCAG ACCGACGTGG TGGCCAACCA CAACCACTGC CACGCGATGA CCGTCGAGTA CTTCGAAGTG CTCCGTCACC TGCAGGTCAC CCAGGAGATC GCCCACGTCC AGGAGTGCCT GTTCGTCCCG TTCGAGATCA CCCCGTTCAA CGGCGCCAAG GCGCTGCGCC ACCGCGACGT GCTGGTCCGC CACGTGCCCG GCGCGCTGCG TCGCGGCTTC GACGCGCTGG AGCGGATGCG CACGAACTGG TCCGATGCCG ACTTCCCGCT GGCGCGCTAC GCCGACGAGC CGCTGACCTA CCTGGACGGC GAGCTGCAGA TGAGCTTCAG CATCCCGCGG CCAGCCGACG GTGGCACTGA CAAGGATCAG TTCGACTCGT CGAAGTGGTC GGTCTACGAG GCCAGCGGGC TGCTGGCCCA ACCGGCGCAG ACGCTGTGGA ACGCCTACCT CGGCACTGCC GTGGTGGCCA ACCGTGACGC GATCTTCGCT GCGTCGATCG CCCCGAACAT CGCCCAGCGG ATCGTCGACC GGCTGGCGCT GCGGCTCGTG CTCGACAACG GCTCGCCGGT CGGCGTCGGC ATCGACGCCA CGCTCGTGTC CCGCTATCGC CCGGACGCGC CGCTGCTCGT CAGCCTGCGC CCGACCTCGA CCACCCCGGC GGTGGTGCGC GCTCAGATCG AGCGCGCCGT CATCTCGCTG GGCGCGTTGT TGCCGCCGGA GGCCAAGCTC GTCGTCCAGC AGGCCGGCCT GCGCTACCGC ACCGCCCACC TCGAGCACGA CCTCGTCCCC ACGCAGCGCA CGTACAACGA TCTCGGCCCG GGCGACGAGG TGGAGCTCGT CGTCGGCCTG GACCGTCAGG AGAAGCGCAA CCCGCGGGCC GAGGACCCGC GCCTGGCCGA CGCCCTCGTG GATCACCTCA ACGAGCACAT CGAGCACTAC CACCAGGCGA TCTGGCGGGA GATGGACCCG AACCGGCGGT ACCTGCTACT CGACGGCTTC GTCGCCCCGA ACGCCGGCGG CCGCAGCGTG GCCAGTGTCG TGGAGAACCG GCTGATCGGC ATCGTCGGCA ACTGCCTGGT GATGCCGGTC GTGCCGGGCG TCAAGCTCGA CCGGTCCTAC GAGCACTCAA AGCGCACCCA GACCGACCTG CTCGACCTGT ATGCCACCGA CCCCGCGCCG CCGATGCGCA TCTCTCTGCC GACCAAGGGT GTGTTCGCTG AGGCGGTGCT CGGAGCCTGC AACAGCTGCG AGCTGAAGGA CGACACCCGC TTCTGGCGGT GGGAGGAGTC GCCGATCCCG GACGAGCCGA CCCCGATCGA GCAGGTGTCC ACGGCAACAC GGCGCAGCAC GGTGCCGAAC CTCACTCCCG ACGCGTTCCC CCAGGCGCTG GTGCGCTTTC AGGACGTGCC CACGGCTCCG GCACCGTCCG CGCTCGCCGC CGCGCTGGAG TTGATCGGTA CGCCGAACCT GTTCTCCGAC ATCACTGGCC TGGCGCTGAA CCAGGCGAAC TCGGCCGCGG CGTTGGAGGA GGCGTCGAAG ACGGCGACGT TCTTTGCCCA GCAGGGCGCG GCGCTCGCCC AGCAGCGCTT CCTCGCCTCG GACACCCAGC GCCAGCTGAA GCGGATCAAG GATGCTCGCG ACCAGAAGCT GATCACGCCG GAGCAGGCCA ATGAGCGGGC GATGAGCCTG CTGCAGGGCG CGACGGGTGA GCGCCGGCCG GAGGGCCAGG TGACCACCGC CAACCCCGCC GTGCAGAAGG TGCTCGACCG GGTCAGCCAA TCGTCCAACG CCTCCATGAC GCTGCGCCGC CCCGACGGCA CCGTCTCGGT GAAGAAGGGC GACGACGCCG GCGGTCGCAT CGACGTCACC GCCGATCCGC CCGTCGGGCT GATCAAGCAA CCGACCAACC TGGTGTGCTG GGCGGTTGCC GGGGCGATGA TGGCGAACTG GCGCGACCGC CGCTCGGCGA CGATTCAGAC GGTGCTCGAC GAGCTCGGCG GCACGTGGCG GGCGAAGTTC GACGCCAACC AGGCCCTAGC GGTGGCCGAA ACCGTCGGGT TCACCAAAGC GCTCGGTCTC GTCGCCGAAG GACCGGCGTC GTACCTGCCG ATCGGCATCG CTCGGCTGGT CGAGACCTAC GGCCCGCTGT GGGTGATCTC CGACGACACC GTGGAGAACA ACCAGCTCGT CCACGTGCGC ATCGTCACGG GGATAAGCGG CGGCGGCACG GCCGAGACGA CGATGGTCAA CATCGTCGAC CCGGACCTCG ACCAGCCGAC GACGGAGACG TTCGCCGAGT TCGCCCGCCG GTTGGAAGCC AACGACGCCA TCGACGTGGG CCTCGGGATC ATGCGCTTCC CGACCGTGCA GACCACTTGA
|
Protein sequence | MMRGKLNPDP VRPLDGYRVT ANVDVPMLGD DGQTTTTVAV GADVTSGGDV AIELPERRTG DVRLVATAPD GSRVGATTIA AANVDQPFTL DVTTVDPVTV VPSDVPGLGR RERLVGRVLD PQGAGAPAKL LVVVWGVPHG AQPTDAVPLL VAETAAGGYF GDDLPPDELD DAWAKVAGGA PVPILLDGGR LPRTVVIVTP VKASHDNDDD GCGCHAAPPR SPDQADLAAN PEAFAADPGR CVDLTVPNRA LEEVTFHAVV RTTQPEIRGL QMPNADDVPS PVVDRLVQLV ASRPIEAIAR LESPSTYTTS VDAAIKPASS PQLAALAVRA AVPEVDEPTV RRSLNLRADV VGELMRDGAH LRVDDLLHAE RVSTIRTVRD IVQAVGGAPA GRVELGGETQ VDWDDTPTLY QATTIAHGHL LTFKQVWRAD GYSLGDLLYS LPLAPGQKKL IAVVDWDRRE VEVRQAQRSE TEVLDASLAH DRDISEVISS SLRQSMRGSS SADTESIAGG IAGFIGPVVF GFGGGSSSAG STASQRSSRD VAGTMLNQAR DRTLQAASAV RSQRATVVQT ARQGESLRVQ TDVVANHNHC HAMTVEYFEV LRHLQVTQEI AHVQECLFVP FEITPFNGAK ALRHRDVLVR HVPGALRRGF DALERMRTNW SDADFPLARY ADEPLTYLDG ELQMSFSIPR PADGGTDKDQ FDSSKWSVYE ASGLLAQPAQ TLWNAYLGTA VVANRDAIFA ASIAPNIAQR IVDRLALRLV LDNGSPVGVG IDATLVSRYR PDAPLLVSLR PTSTTPAVVR AQIERAVISL GALLPPEAKL VVQQAGLRYR TAHLEHDLVP TQRTYNDLGP GDEVELVVGL DRQEKRNPRA EDPRLADALV DHLNEHIEHY HQAIWREMDP NRRYLLLDGF VAPNAGGRSV ASVVENRLIG IVGNCLVMPV VPGVKLDRSY EHSKRTQTDL LDLYATDPAP PMRISLPTKG VFAEAVLGAC NSCELKDDTR FWRWEESPIP DEPTPIEQVS TATRRSTVPN LTPDAFPQAL VRFQDVPTAP APSALAAALE LIGTPNLFSD ITGLALNQAN SAAALEEASK TATFFAQQGA ALAQQRFLAS DTQRQLKRIK DARDQKLITP EQANERAMSL LQGATGERRP EGQVTTANPA VQKVLDRVSQ SSNASMTLRR PDGTVSVKKG DDAGGRIDVT ADPPVGLIKQ PTNLVCWAVA GAMMANWRDR RSATIQTVLD ELGGTWRAKF DANQALAVAE TVGFTKALGL VAEGPASYLP IGIARLVETY GPLWVISDDT VENNQLVHVR IVTGISGGGT AETTMVNIVD PDLDQPTTET FAEFARRLEA NDAIDVGLGI MRFPTVQTT
|
| |