Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1060 |
Symbol | |
ID | 5669474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1246309 |
End bp | 1250343 |
Gene Length | 4035 bp |
Protein Length | 1344 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641239989 |
Product | hypothetical protein |
Protein accession | YP_001505422 |
Protein GI | 158312914 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.362047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGCGG CGTTCGACGG CAGCGGCTAC CGGCGACGGG TCCTGGCGCC GCTGCGTGCC CGGACCCCGG TGGACACGGC CGACCCCTAC CTGGTCGCGG ACCTCGATCC GCTGCTGGAA CACACCGACT CCGAGGTCGC CGCGCAGCTC GCCCGGGTGA TGGCGTTCCT GCAGCGTGAG CGCAACTCGG CGAAGTACGC GGCGCTGGCC ACCGAGCTGG TGCGCCGCCG CGGCGAGTGG GAGGCGCCGC TGCTCGACGG CGACGCCCGG GCCCGCCTGC GGCACACCGT CCTCGATGCC CGGCGCAACG GCGACGCCGA GCGGCTCGCC AAGGTCGACG GCTACCTCGT CACACTGCGT GATCGTTTCG GTGGCATCCC GGCCTCCAGG GTCGCCGGCC TGCGCCGCCT CGCGGCCGCG GCCGGGGTGA CCGGGGCCGA GTTCGACGCC CGGCTGGGCC GCGAGGTCAT CATCGCGGAC GGCGGCGGCG CCGGAGTCGA GGCGCTCGCC CCCGAGGTCC GCAGCCAGAT CCGGCAGCGG CTGGAGGACC TGCGCGTCCT GCGCGGCGGA GACCGGGCCG GCACCGCCTC CCTGTGGGAC TTCCTCGGTC TGCCGCCCGA CGCCGGGCCG GAGCGCATCC GCTCCGCGTG GGAGGCGGTG GCGGCCGGCA ACGCCCGACG CCCGCACGAC CGCGAGAAGA CGCTGACGGC CGACCTGCTG GCCATGGTGC GCTCCCGGCT GGTCGAGGGT GACCCGGCGG CGTACACCGC GGGTCTGCTC GCCGATGTCG CCGACGAGCT CCGCCCGATC GTCGAGGAGC ACGTCGTGCT CGACGGCGAG CTGACGGCAG TCGCCTACGA GGGGCTCGTC CGGGCCGCCC TGGCCGCCGG CCGCGGGCTG GGCGCCGAGC AGGCGAAGAC GGTCATCCTG GGCATCGCCC GCAACCTGGG CGCCGCGGTC AGCACCGGCG GCGCCGTCGA CTACGTGCTG TGCCCCGGCT GTGGACGTCC CGAGCCGGTC GGCGGCTCGC GGACCTGCCG GTACTGCGAC GCGGGGCTCT ACACCACCTG CCCCGGGTGC GCCTCGCTCA CCGAGGCCGC CGCCGTGACC TGCCGGCGCT GCGGGTACAG CCTGCGCCAG GTCCGGGCCG CCGGCGACGC GCTCGCCGCC GTGCGCCAGG CGCTGGAGGC CGGCCGCCCC CGCGAGGCGA GCGACGGCCT CGCCCGGGTG CGCGCGGCCG TCGCGGCCGC CGGCTCGGGG GCCGGCAACG GGACGGCGGA GGCCGCCGAC GAGCTGGAGT CGCTCGTCCG GGCCGGGCTC GGCGCCGCCG AGGCGGGCTG GCGGGCACTG GCGGAGGAAC GCTCCACCCT GCGCTCCGAC GCCGCGGTGG AACGCGCCCG CTGGCTGGTC GCCCGCGCGG CGGACGTCCC CGGGCCGGAC GGCCGCCCGC CGGCCGAGGT GCTCGGCGAG CTGACCGCGC AACAGGCCGT CATCCGGCGC CGGGTCGAGG CCGCCCGCGG CCTGCCGCCC GAGCAGCAGG AGGCCGCGCT GGTCGCCGTG CTCGCCACCG CCGTCGACAG CGCGGACGCG CTGCGCGCTC TCGCTGCTCT GCCGCTGCAG CCACCGACCG ATCTGACCTC CGTGCTGGCC GACGACGCGG TCCTGCTGCG CTGGCGGCCG TCCGCCTCGG CCGGCCCGGT CACCTACCGG GTGGAGCGGG TCGCCGTCGA TCCCGGCTCC GGGCAGCTCA CCCGGCGCGG CCTGGGCACC ACCAGCTCCA CCGAGCTCGC CGACGCGGGC GCCCCACCGT GGACGCCGGT CCGGCACGAG GTGACCGCGC TCTCCGGCGA GCGGCGCTCC TGGCCGGTCA GCACCGCGCC GGTCATCGCC GTGCGGGACG TCGCCGACCT ACGGGCCGAG GCGACCCCGA CCGGGGTCCG CCTCACCTGG CGGCCGAGCG GGCCGTCCGA CACCGTGACG ATCGAGCGCA CGGTCGATCC CGACTCGTCG GTCTCCGCCC CGCCGCGCCG GGCCCGGGTG ACCGGCGGGA GCTTCCTCGA CTCCGACGTG CTGCCCGGCG TGGGCTACCG CTACCGCGCG TTCGTCGAGT ACACCGACGT CGACGGCAGC GCCGCCCGCA CCTCGGGGTC GCGGGCCGAA TTCGGCCTGC TCACCCGGCC GCGACCGGTC ACCGACCTGG TCGTCGGCGC CGAGGACGGG CAGGTCGCCC TGCGGTGGAC GCCGCGTTCC GGGGCCGAGG TGCGGGTCTA CGCGACGGCC GTCCCGCCGT CCGGCGGCGC CGTCGGCGCT CTCCACCCGG GTGGTCCCGG TGCGGACGGG TCGGGCTCCG GTGCGAACGG GGCCGGCGGG TATGGCGCGG GTGCGTACGG TGCCGGTGGG CAGGGTGCCG GTGGGCAGGG TGCCGGTGGG CACGGCACCG GGGTGCTTGG CGCCGGCGCG GTGTCGGTCG GGGCGGGGGA GTTCGGGCGG GGGCCCGAGC CGAGCTCCGG CCCGCTGGCG CTGCTGGGCG GCGAGGGCGC CGAGGTCCCG CTCGCCGCGC TGACACCACC GCTGCGCCTG GTCGGCGCGA GCCGGCAGGG TCACCTGCGG GACGCAGCGG TTCCGCTTGC TCCCGGCACC GGCGAGCTGA TCTACACCCC GGTCACGGTC GTCGGTGGTC TTGGTGTGCT CGGCCGCTCG GCTCCGCACC GCTTCCCGGT GGTGACGCAC GACGTCTCCG AGTTCACCGC CGGCATCGCC GACTTCCCGA CCGGCATCGC CGACTTCCCG ACCGGCATCG CCGACTTCCC GACCGGCAAC GGTGGTCACG GTCCCGGCAA CGGTGGTCAC GATCCCGCCA GCGCCGGGCT CGGTCCCGCG CAGTCCGCCC CGCCCGGCCA CGGGGGTACC GCTCCCGGCC ATGGCGGTGG CCCGCCGGCC GGCGGGCCAC CGCTGCCCGC GGTGTCCGCG GGCGCCCGGC CGGTCATGAT CGACGTCATG CCGTCCCCGG AGACCGTCCG GGCGCCCGGG GGCGCTCCGC TCGTAGCTCC CGTACCTGTC GGTCCGTCGT CCCCAGGTGG TCCGTCGTCC CCGCTCGGTC CGTCCTCTCC GATCGGTCCG CCTGTATCGG GTGCCCCGCC GCTTCCGGGT GCCCCGCCGT TTTCGGGTGC CCCGCCGCTT TCGGGTGCGC AGCCCGCTCC CGGCGCGTCG CCGGTGCCGG GCCCTCCGCC GGCGACAGGT GCCCTCCCGA TTCCCGGTGG TCCGGCGGGG CCCGGCGGAC CGAGCGCCGG AGGCGGCGCG CCGGGGACTG GCGGGGTCCC TGTGGGCCCG CCGACGGTGG GAATGCCGCT GCCCGTGGCG TCGGACGAGT CCGGGGCGGA CCAGCGTCCG GAACACGAGG AGTGGTCAGC CGCCGTCCCG CGGCCCGCGG CTGACCCGGT TGACATGCCG GACCCGATGG GCGCCGCCGC ACCCGGGTCG GGACCGGCGG CCGCCACGTC CGGGGCGCAC CCCGGTATGG GGACGGGTCC AGCGACGCCC GACGTCGGCG GGCACTCCGT GGCGCCGGCC GTTCCCGGCG CACCCGGCCC CGGCACGCTC GCCGCGCCGC CCCCCGCGCC CGCCCAGACC CAGGGCCACG CCCTGGTTCC CGGGCCGCCC GCGCCGCTCG GTTCTCCGGC CCAGCTCCAG CCGCCGGTGC CGGTGCCACC GCCGGCGGAG CTGACGTCCG TCACCTATTC GGTGTCGAAG GCGGGGTGGC GCCGGCGGAC CCTGCGCGTC CAGGTGCGGG CGACCGGGCC GGCCCCCAGG CTGGTGCTGT TGGCGCGGCC CGGCGAGGAG CCGCCCGGGT CGCCGGCCGA GGGACAGGTG CTGGCCGAGT TGCAGCCGGC TCCGAGCTCG GGGTCGTGGA CCATGGAGGT CACCCTCGAG GGGGCGCAGC TCCCGTGGGG GGTCCGGCTC CTGCCGGTGG TCACACCGGG TGCTCCGGCG GTGTGGATCG ACCATCCTGA GGACCCGATG CTCGTCGTCC GCTGA
|
Protein sequence | MSAAFDGSGY RRRVLAPLRA RTPVDTADPY LVADLDPLLE HTDSEVAAQL ARVMAFLQRE RNSAKYAALA TELVRRRGEW EAPLLDGDAR ARLRHTVLDA RRNGDAERLA KVDGYLVTLR DRFGGIPASR VAGLRRLAAA AGVTGAEFDA RLGREVIIAD GGGAGVEALA PEVRSQIRQR LEDLRVLRGG DRAGTASLWD FLGLPPDAGP ERIRSAWEAV AAGNARRPHD REKTLTADLL AMVRSRLVEG DPAAYTAGLL ADVADELRPI VEEHVVLDGE LTAVAYEGLV RAALAAGRGL GAEQAKTVIL GIARNLGAAV STGGAVDYVL CPGCGRPEPV GGSRTCRYCD AGLYTTCPGC ASLTEAAAVT CRRCGYSLRQ VRAAGDALAA VRQALEAGRP REASDGLARV RAAVAAAGSG AGNGTAEAAD ELESLVRAGL GAAEAGWRAL AEERSTLRSD AAVERARWLV ARAADVPGPD GRPPAEVLGE LTAQQAVIRR RVEAARGLPP EQQEAALVAV LATAVDSADA LRALAALPLQ PPTDLTSVLA DDAVLLRWRP SASAGPVTYR VERVAVDPGS GQLTRRGLGT TSSTELADAG APPWTPVRHE VTALSGERRS WPVSTAPVIA VRDVADLRAE ATPTGVRLTW RPSGPSDTVT IERTVDPDSS VSAPPRRARV TGGSFLDSDV LPGVGYRYRA FVEYTDVDGS AARTSGSRAE FGLLTRPRPV TDLVVGAEDG QVALRWTPRS GAEVRVYATA VPPSGGAVGA LHPGGPGADG SGSGANGAGG YGAGAYGAGG QGAGGQGAGG HGTGVLGAGA VSVGAGEFGR GPEPSSGPLA LLGGEGAEVP LAALTPPLRL VGASRQGHLR DAAVPLAPGT GELIYTPVTV VGGLGVLGRS APHRFPVVTH DVSEFTAGIA DFPTGIADFP TGIADFPTGN GGHGPGNGGH DPASAGLGPA QSAPPGHGGT APGHGGGPPA GGPPLPAVSA GARPVMIDVM PSPETVRAPG GAPLVAPVPV GPSSPGGPSS PLGPSSPIGP PVSGAPPLPG APPFSGAPPL SGAQPAPGAS PVPGPPPATG ALPIPGGPAG PGGPSAGGGA PGTGGVPVGP PTVGMPLPVA SDESGADQRP EHEEWSAAVP RPAADPVDMP DPMGAAAPGS GPAAATSGAH PGMGTGPATP DVGGHSVAPA VPGAPGPGTL AAPPPAPAQT QGHALVPGPP APLGSPAQLQ PPVPVPPPAE LTSVTYSVSK AGWRRRTLRV QVRATGPAPR LVLLARPGEE PPGSPAEGQV LAELQPAPSS GSWTMEVTLE GAQLPWGVRL LPVVTPGAPA VWIDHPEDPM LVVR
|
| |