Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7172 |
Symbol | |
ID | 5675473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8753255 |
End bp | 8755342 |
Gene Length | 2088 bp |
Protein Length | 695 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641246009 |
Product | hypothetical protein |
Protein accession | YP_001511397 |
Protein GI | 158318889 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.417277 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTCAAG CGAGGCAGAT TCCGACACGT CAACCGCTCT ATGTCGTCTC CGACGGCTCC CGTCGTATGG ATTCCGCCCG GCTGACGGCC GCCAATCTGC TGTTGGGCCA CCTGGTCGAC GGAGTCACGT CAACACCCGA GCTGGCGGCA GCCGTTCCGC TGTGCGTCCA GGAGTTCGAC CAGGAATCGA CGACGCTGAT GCGGCTAGCC CCGTTGGATC CGGCCGTGCC GGTCCGCGAG CTGACGGCCA GCCTCGCGGA GCCATCCCTG ACAGCGGCGT TCACCGGGCT GCGGGCGACA GCCACCGGTG ACCTCGCCCA GCTGTTGGCC GACGACTTCG AGGTGGGTCA GCCCATCGTG GTCGTCATCA CCAGCGGCCA CTCCCGCGAC ACCGAGGCCG ACCGGCTGCG GGCCTTCACC GAACTGGTGA CGACGAACCT GTGGCCGGAC CAGACCGCCG ACTGGGAACC GTCGGTGTAC CTGTTCGCCC TCGACGCTCC GGACTCGGCG ATGCTCGATC GGTTCGCCAG CTCCGGTGCC CGTTGCCGGG TCCGCCCGGT GGCCGTCGGC ACCGACCCCG CACCCGATGT CGCCGACGTG ATCAGGGATC TGGAGCTGCG TGTCCAGGGC GGGGCGCAGG GGATGAAGGT GGCACGCTGG GTGCTGTCGG TCGCCCAGCT CGGCTCCACC GCGCCGACCG CTCGCGTCGA CGGCGTGGAG ATTCACGAGA TCCCTGAGCT GACTCTGCCA TGGGTCACGA AGCCGCTGTG GTACCGCCGC TTCCTCGACA TCACGGACGC GGACGCCCAT CCGTTCGACC CGCTGGTGGA CCTCGTCGCC CTGTCCGACG AACCGCCTCC CCCGGGCTCC CCGGGGCAGC TCGCCGACCA GGCCTGCTGG CCGGTGCGAG TCGTGGTCGC GGACGACTCG CCGACCGCCG TCGTCGGGCT GGTCGCTCCG AGGCCGCCGG AGCGGTTCGT CGACGACGAC GGAGCACAGC GCCGGGACCG CACCGCCGCC CAGCTTCACC TGGATGCCGC GGCGCTGCCG CTGGACCCCG GAAAGGTACC GGCCGCCACC GACAGCCTCA TCCGGGCCCA GCTGTGCGAG CGGCTGGCCG CCGCCGTCGC CACCGCCCAC CACTACGGCG TGCGGCTCGG CCGGCAGACG CTCGAGAGCG CGGTCTACGC CCTCGACCCG GAGCCCGACG TGCTCCTCGT CGACTGCGAC ACCGCGAAGC TCGATCCGTC CGATCAGGCC AGTCCACAGG AGGACCTGAC CTGGCTGGCG CGGTTCGTCG AACGCTGCGT GGACGACCAG CAGCTGCCGC CGGTCGCCCT CGGCGAGGAG GCGCCGCCGG TGGTCCTGGA CGCCACCGGC TGGAAGATGA TCGCCGACGC GAAGTCCAAG GTGGGACTGG CCGTGCCGTC CGCCAGTCGC TGGCAGCGCT ACCTGGCCGA CCGGGTGCTG GAGCTGCGTG GGCCGCCCAC CGTCACCGCC GTGCGGGTGA GCCCGGTCCT CGTCCCTCGC GGCGAGAAGG TCACGGTCCG CTGGCGAAGT CGGTACGCCG AGTCGATGAT CGTGATCAGT CCGGACGGCA AGCAGATCCA GGTGCCGGCG AAGCAGCTCG CGGACGGCGC CGCGCGCATG ACCGTGACCG CCGCCGGGCC GGTCCGGTTC CGGGCGGTCA ACCAGGTCGG CACGACCGAG CTGGCCAGCG ACTGGATTCA CGTCTTCGAC CTCCCGACGG GCGCGGATGT CGACTATCCG AAGATCTCGA ACCTGCCGGC GATCTGGCTC GACGGCATGA TCATGAACAC CTGGGCCTTC GACGACACGA ATTTGGCCGC GATGCTGCCG GCGATACCCG GCGGTGCCGG TAACAGCGAG GGCAGGGGGC GCGCCGGCCA CGTCGGCCCC GTAGGCCGGT CGGGTCGGGT CGGCCGGGCC GGCGGGCGTG GCGGCGACCC CGCGGCCTCG GTGCCGGGGC GCGCCGAGTT CCCGATCGAC CCGACGACCT GGTTCGCCAA CCCGCCGGAG ATTCCCCGAC GCGGCCGCGC GAGGAGATGG AAACTGCCAT GGACGTGA
|
Protein sequence | MSQARQIPTR QPLYVVSDGS RRMDSARLTA ANLLLGHLVD GVTSTPELAA AVPLCVQEFD QESTTLMRLA PLDPAVPVRE LTASLAEPSL TAAFTGLRAT ATGDLAQLLA DDFEVGQPIV VVITSGHSRD TEADRLRAFT ELVTTNLWPD QTADWEPSVY LFALDAPDSA MLDRFASSGA RCRVRPVAVG TDPAPDVADV IRDLELRVQG GAQGMKVARW VLSVAQLGST APTARVDGVE IHEIPELTLP WVTKPLWYRR FLDITDADAH PFDPLVDLVA LSDEPPPPGS PGQLADQACW PVRVVVADDS PTAVVGLVAP RPPERFVDDD GAQRRDRTAA QLHLDAAALP LDPGKVPAAT DSLIRAQLCE RLAAAVATAH HYGVRLGRQT LESAVYALDP EPDVLLVDCD TAKLDPSDQA SPQEDLTWLA RFVERCVDDQ QLPPVALGEE APPVVLDATG WKMIADAKSK VGLAVPSASR WQRYLADRVL ELRGPPTVTA VRVSPVLVPR GEKVTVRWRS RYAESMIVIS PDGKQIQVPA KQLADGAARM TVTAAGPVRF RAVNQVGTTE LASDWIHVFD LPTGADVDYP KISNLPAIWL DGMIMNTWAF DDTNLAAMLP AIPGGAGNSE GRGRAGHVGP VGRSGRVGRA GGRGGDPAAS VPGRAEFPID PTTWFANPPE IPRRGRARRW KLPWT
|
| |