Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1746 |
Symbol | |
ID | 5675686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2092698 |
End bp | 2095937 |
Gene Length | 3240 bp |
Protein Length | 1079 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240667 |
Product | hypothetical protein |
Protein accession | YP_001506090 |
Protein GI | 158313582 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000886792 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGAACG ACGAGTCGCC GAGGTCCCAA CCGCCGTCCA GGGCTGGCCG TGCTGCCACC GGCGGTGGCG GCGCGCGGCG CACCGAGGGC CGCCGTGACG CGCCGGGTGC CCGGCGTGGG CCGTCCTCGT CCGGCTCGTC CGGGCGGGAT GCCGGTGGCG AGGCCCGGCG CTCCGGTCGG GTCACCAGAG GCGATGCGGG CCGGACGGGG TTCGTCCGGC GTGCTGACGG TGGGCGAGGC CCCGCGCCAG CTGGCGCCTC GGCGGCGGGG ACGGGGGCGG GTGGTCGTGG TGCCCCCGCG TGGCAGGGCA CCGGGCGTGC CGGCGGTCCG TCCCGGCAGG GAGGTCCCGG CCAGGGCGCC GGACGTCCGA AGGCCGCCTG GCGTCCCGCT GGCACAGGCC AGGGGCGTGG TGGCGCTGGC GGTTCCGGTG GACCCGGCGG CGAGCGGTGG GGTCGTCCCG CCTCGTCGCG TGAGGGTGGT CCGCGCGAAG CGAGCGGCGG GCGGCCTGCT GGAGACCGTT GGTCCGGCGG TGCGGGCCGG CCGGGAGCGC ACGGTCGGCC GGAAGGGCAG GGCCGGTCCG AGGGGGGCCG GCCGGCCCGC CGCTGGGAGG GCTCCGGCCG GAGCGCGCCG GCCTCGGATT CGCGGGGCCG GCAGTGGTCG GGGAGCCCGT CCCGCGAGGC GGGAGACCGC CCCAGACAGA GTGACAGGCC GCCCTATGAC GCGGCCCGCG GCACCACATC CCGGCACGAC GGCCCTCGTC ACGACGGGTC CCGCCATGAC GGCCCTCGTC ACGACGGGCC TCGGCATGAC GGGCCCCGCC ACGGCGGAGG TCGTGACGAG AGTGGCCGCG GTGATGCTCG GCGGGATGGC GGTGGCTGGG CCGGCGGCGG CTCTCGAGCC GGTTCGTCAC CCGACCGCGG TGGGCAGCGC AGCTACGGAG ACGGCGGTGG ACGACCGTCC GGTGGACGAC CGACGTCGGG CCAGACGGCG CGGGGCTCCT CCGGGCGTCC GTACGGTGCC GGGCGGCCGG TCGGCGGTGG GCGTGACGGT TCCGCGGGCC CCGGGAACGG CGGCTCGGGC GAGAACCGGT TCGAGCGGCG CTCACGGGCG CAGGGCTCGG GCCAGTGGCG ACCCTCCTCG GGGGACGCGG CGACACGGGG CGCGGCAGGT CCGCGTAGTG GGATTCGACG TGACGGCCCG GGCAGGTCCG ACGGGACGGC GGCCGGCCGC GGCCCGGGTG CCGACCGGCG TCCCCCGTCG GCCGGGTCGC GGCCCTGGCA AAGCGGTGCG ACCGGCGATG CCCGGACCTC CAACGGCTCC GGGTACGACC GTGGTCACCG GTCTGATCGC GCCTCCGGAC CCGATCGTGG TTACCGGCCC GATCGTGGCG CCGGGTCCGA CCGTGGTTAC CGGTCTGATC GTGGCGCCGG GTCCGATCGC GGTGACAGCC GTCGGAGTGG CCCGCCCAGC GGGGGCTCGT CCGCACGGCC CTACACGCGG CCGACGCACC GTGAGCAGCC CGCGTCCGGC ACGGACCGGG TAGATCGGGC GGAGCGCACG GACCGACCGG ACGCTCCGGG CACCGGTGCG GATCGGGCCC GTCCCGACCA GACCCGCTCC TATGAGGCCC GTGCGAACCA GGGCCGCCCG GACGAGGGCC GTGCGGAGCA GGGGGGCCCG GACCGGGGCC GCCCGGACGT GGGCCGCTCC GGTCAGGGCC GTGCGGACAC GGGACGTCCG TATCAGGAAC GGCCGGATCG GGCGGCTCGG CCGGATCGAT CCGGTGGGAG CCGGTTCGAC GCGAACCGTG CCGGCGCAGG GGGCTCCTAT GCGGGTCGCC GTGGTCCGGA CGCCCGCTCG GGATCGAGCG GTGGACCTCG GGCCGGCCGT GGTGGGGTGA GTGGCGGGCC CAGCCGTTCG GGCCCCGGTA CGCGTCCGGC CGGCGGAGCC GGGGGGCGAT CCTCCGCGGG GTCGGCCGGC CGGCCGAACA GGGACGACGT CACGGCGCGC CGCAGGCCGC CCGCACCTGC GCTGCCGGAC GAGGCCAAGG CCGAGCTGCT TGATCGCGAC ATTCGCCGTG ACCTGCGCAG CGTGCCCGCG CCGCTGGCCG AGACGATCGC CCGGCACCTC GTCGCGACCG CGCTGCTCGT CGACACCGAC CCGGTGCAGG CGCTGGCCCA CGCGCGTGCC GCGGCGGCCC GGCTGCCCAG GATGGCCGCG GTGCGGGAGG CCGTGGGAGT CGCGGCCTAC CACGCCGGCG AGTTCGCCTC CGCGCTGCTG GAGCTGCGCG CGGCCCGCCG CATCGACGGC TCGTCCCACA ACCTCCCGCT GATGGCTGAC GCCGAACGTG GCCTCGGTCG GCCCGAACGC GCGATCGACT ACCTCTCGGA CCCGGGTGTC GCGGCGCTGG ACGCCGCCGG CCGGGCCGAG CTCCTGATCG TCGTGTCCGG TGCCCGGCGC GACATGGGTC AGCCGGAGGC GGCCGCGGTG CTGCTGCGCG ACGAGGTGAC CGCCAGAACG GAGCCGAAGC CGTGGACGCC CCGTCTCTGG TACGCCTACG CCGAGGCGCT GCTGGCGGCC GGTCGCACGA TGGAGGCGCT GCGCTGGTTC ACGGCCACGG CCGGCATCGA CGAGGACACG ACCGACGCGG CCGAGCGCGT CTACGAGCTC ACCATCGATG ACGAGACCGA GATCGACAAC GGGGACGGCG GCGCCGAGGA CAACGGGCCG GAGAGCAATC GGCTCGAGGA CGAGCTGCTC GGTGACGAGC GACTCGAAGA CGACGGGCCC GCCGCCGAAA CCCACGACGG CGCGACCGTA GACGAACCGG ACCCTGACGG CCTGATCGCC TCCGGCACAG GTGCGGCCGT TGACGCCGTG GCAGATGACG CCGCGCTGGC CGACGCCGAC GATGCCGGTG ACGCGGCCGA CGATGCCGGT GACGCGGACG CGGAAGCTGA GGCCGCCGAC CATGGCGACA CGGGTGGCGC CAGTGACACC GGGGCAGCGG TCGATGTGAC CGTCGACGGC GAGCCGGCGG CACCGGTCGA GCAGCCGGCG CCGGCGGAGC CCGCCGCGGC CGTGGAGCCC ACCGAGGTCG GCTTCTCCGC GGCCGAGGAC ACGAGCGCGG TGCCTGCCGC GGAGGTGACC GCCGACACCC CGCCGATTCC AGAGATCATT TTCTCGGACG CCCCGGGCGG TGCGGACCAG GCCGGAGCGC ACCGGACATC CGAGGACTGA
|
Protein sequence | MPNDESPRSQ PPSRAGRAAT GGGGARRTEG RRDAPGARRG PSSSGSSGRD AGGEARRSGR VTRGDAGRTG FVRRADGGRG PAPAGASAAG TGAGGRGAPA WQGTGRAGGP SRQGGPGQGA GRPKAAWRPA GTGQGRGGAG GSGGPGGERW GRPASSREGG PREASGGRPA GDRWSGGAGR PGAHGRPEGQ GRSEGGRPAR RWEGSGRSAP ASDSRGRQWS GSPSREAGDR PRQSDRPPYD AARGTTSRHD GPRHDGSRHD GPRHDGPRHD GPRHGGGRDE SGRGDARRDG GGWAGGGSRA GSSPDRGGQR SYGDGGGRPS GGRPTSGQTA RGSSGRPYGA GRPVGGGRDG SAGPGNGGSG ENRFERRSRA QGSGQWRPSS GDAATRGAAG PRSGIRRDGP GRSDGTAAGR GPGADRRPPS AGSRPWQSGA TGDARTSNGS GYDRGHRSDR ASGPDRGYRP DRGAGSDRGY RSDRGAGSDR GDSRRSGPPS GGSSARPYTR PTHREQPASG TDRVDRAERT DRPDAPGTGA DRARPDQTRS YEARANQGRP DEGRAEQGGP DRGRPDVGRS GQGRADTGRP YQERPDRAAR PDRSGGSRFD ANRAGAGGSY AGRRGPDARS GSSGGPRAGR GGVSGGPSRS GPGTRPAGGA GGRSSAGSAG RPNRDDVTAR RRPPAPALPD EAKAELLDRD IRRDLRSVPA PLAETIARHL VATALLVDTD PVQALAHARA AAARLPRMAA VREAVGVAAY HAGEFASALL ELRAARRIDG SSHNLPLMAD AERGLGRPER AIDYLSDPGV AALDAAGRAE LLIVVSGARR DMGQPEAAAV LLRDEVTART EPKPWTPRLW YAYAEALLAA GRTMEALRWF TATAGIDEDT TDAAERVYEL TIDDETEIDN GDGGAEDNGP ESNRLEDELL GDERLEDDGP AAETHDGATV DEPDPDGLIA SGTGAAVDAV ADDAALADAD DAGDAADDAG DADAEAEAAD HGDTGGASDT GAAVDVTVDG EPAAPVEQPA PAEPAAAVEP TEVGFSAAED TSAVPAAEVT ADTPPIPEII FSDAPGGADQ AGAHRTSED
|
| |