Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6672 |
Symbol | |
ID | 5674987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8102830 |
End bp | 8105535 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245523 |
Product | RNA-binding S1 domain-containing protein |
Protein accession | YP_001510915 |
Protein GI | 158318407 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.514873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.542159 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGTCT CGGAGACGGT GTCTGTCCAG GCGCGGATCG CCGCGGAGCT GGGCGTGCGC GAGGGGCAGG TGGCCTCGGC GATCGACCTG CTCGACGGCG GTGCGACAGT GCCGTTCATC GCCCGGTACC GCAAGGAGGT GACCGGCGCC CTCGACGACG CGCAGCTGCG AACCCTGGAG GAGCGGCTGC GGTACCTGCG GGAGCTGGCG GAGCGGCGGG CGGCGATCCT GGAGTCCATC CGGAGCCAGG GCAAGCTGGA CGACGCCCTC GAGGCGCAGA TCATGGCCGC CGACACCAAG GCCCGCCTCG AGGACATCTA CCTCCCCTAC AAGCCGAAGC GCCGGACGAA GGCGCAGATC GCGCGTGAGG CCGGGCTGGA GCCGCTCGCG GACGCGCTGC TGGCCGACCG CTCGCTCGAC CCCCGGGCGG AGGCGGAGCG CTACGTCGAC GCCGAGAAGG GCGTCGCGGA CGCCACGGCC GCGCTCGAGG GCGCCCGCGC CATCCTGGTG GAGCGCTTCG CGGAGGACGC CGACCTGATC GGTGCCCTGC GCGAGCAGAT GTGGTCCCGT GGTTATCTCG TCAGCCGGGT CCGCGAGGGC AAGGAGTCCG ACGGCGCGAA GTTCGCCGAC TACTTCGACT TCGCCGAACC GTTCACGAAG CTGCCGTCCC ACCGGATCCT CGCGATGTTC CGCGGTGAGA AGGAGGAGAT CCTCGACCTC ACCCTGGAGC CGGACGCGCC CGCCGACCCG GCCGAGCCCC CGGCCCCCGG CCCCACCGAC TACGAGCGGC GCATCGCCGA GCGTTTCGAC ATCGCCGACG CCGGCCGGCC GGCCGACCGC TGGCTGCTCG ACGCGGTGCG CTGGGCCTGG CGGACGCGGG TCCTGGTCCA TCTCGGCGTC GACCTCCGCA TGCGGCTGTG GACGTCCGCC GAGGACACCG CGGTGCGCGT GTTCGCGGCG AACCTGCGTG ACCTGCTGCT CGCCGCCCCC GCCGGGCAGC GGCGCACCAT GGGCCTCGAC CCGGGGTTCC GTACCGGCGT CAAGGTGGCG GTCGTCGACG CGACCGGGAA GGTCGTCGCC ACCGACACGA TCTACCCGCA CGTCCCGGCC CGCCGCTGGG ACGACTCGGT GGCCTCGCTG GCGCGGCTGT CCGCCGAGCA CGGCGTCGAG CTGATCGCCA TCGGCAACGG GACGGCCTCC CGGGAGACCG ACAAGCTGGC CGACGACCTG ATCCGCCGGC ATCCCGAGCT GAAGCTGACC AAGGTGATGG TGTCGGAGGC CGGTGCCTCC GTGTACTCCG CCTCCGCCTA CGCGTCCCAG GAACTGCCTT CGATGGACGT CTCCCTGCGC GGCGCGGTGT CCATCGCGCG CCGCCTGCAG GACCCGCTCG CCGAGCTCGT CAAGATCGAC CCCAAGTCGA TCGGGGTCGG GCAGTACCAG CACGACCTGG CCGAGGCCAG GATGTCGTCC TCCCTGGACG CCGTCGTCGA GGACTGTGTG AACGCGGTCG GGGTGGACGT CAACACCGCC TCGGCGCCGT TGCTCTCCCG GGTCTCCGGC ATCAGCGGCG GGCTGGCCGA CAACATCGTC CGCCACCGTG ACAGCAACGG CCCGTTCCGG TCGCGGACGG GGCTGCTGGA CGTGGCCCGG CTGGGCCCGA AGGCGTTCGA GCAGTGCGCC GGCTTCCTGC GCATCACCGG CGGCGACGAC CCGCTGGACA GCTCCAGCGT GCACCCGGAG TCCTACCCGG TAGTCCGGCG GATCCTGACG GTGACCGGTG GCGACCTGCG CGCGCTGATC GGCGACACGA AGACACTGCG CTCGCTCAAG CCCACCGAGT TCGTCGACGA CACGGTCGGC CTGCCGACGG TGACCGACAT CCTCGCCGAG CTGGAGAAGC CCGGCCGCGA CCCGCGGCCG GCGTTCCGGA CGGCCGAGTT CACCGAGGGC GTCGAAACCC TCGCCGACCT GGTGCCGGGC ATGATCCTCG AGGGCGTGGT GACCAACGTC GCCGCGTTCG GCGCCTTCGT CGACATCGGC GTGCACCAGG ACGGCCTCGT CCACGTCTCG GCGATGTCGA AGAACTTCGT CAGCGACCCC CGTGAGGTCG CCAAGCCCGG GGACATCGTG CGGGTGAAGG TCCTCGACGT CGACATCCCG CGCAAGCGGA TCTCGCTCAC GCTGCGCCTC GACGACGAGC CGGGCGCGGC GGACGCCGGC GGCGGGCAGA GCGGTCAGGG TGGCCAGGGC GGTGAGCGGC GCGCGGGCCG CGGCGGCCGC GGCGGGCAGA ACCGCGGTGG CGCCGGTGGC GCCGGTGGCG CCGGCCGGAG CCAGGAGCGC GACGGCGGGC AGCCCGCCGA CCAGCAGGGC GGCCGGCAGG CGGCAGCGGC CGGCGGCGGG CAGCCGGCCG GCGGCCAGCC CGGAGGCGGG CGTGGCGGTC CGGGGCAGCG GGGCGGCCCC GGCCAGCGCG GTGGTCCGGG CCAGCGCGGT GGCGGTGGCG GTGCGGGTCA GCGCGGCGGT GCCGGTGGCG CGGGTGGCGG TCAGCGCGGT GGCGGCGGTG GCCAGCGCGG CGCCGGTCAC CGCGGCGGGC GGACCGACGG GGCCATCGCC GACGCGCTGC GCCGCGCCGG TCTGGTGACC GGCGACGAGG TCACCCTCGG CGGAAAGCAG GACGACAGCC GGGACAGCCG CCGCGGGCGC CGATAG
|
Protein sequence | MSVSETVSVQ ARIAAELGVR EGQVASAIDL LDGGATVPFI ARYRKEVTGA LDDAQLRTLE ERLRYLRELA ERRAAILESI RSQGKLDDAL EAQIMAADTK ARLEDIYLPY KPKRRTKAQI AREAGLEPLA DALLADRSLD PRAEAERYVD AEKGVADATA ALEGARAILV ERFAEDADLI GALREQMWSR GYLVSRVREG KESDGAKFAD YFDFAEPFTK LPSHRILAMF RGEKEEILDL TLEPDAPADP AEPPAPGPTD YERRIAERFD IADAGRPADR WLLDAVRWAW RTRVLVHLGV DLRMRLWTSA EDTAVRVFAA NLRDLLLAAP AGQRRTMGLD PGFRTGVKVA VVDATGKVVA TDTIYPHVPA RRWDDSVASL ARLSAEHGVE LIAIGNGTAS RETDKLADDL IRRHPELKLT KVMVSEAGAS VYSASAYASQ ELPSMDVSLR GAVSIARRLQ DPLAELVKID PKSIGVGQYQ HDLAEARMSS SLDAVVEDCV NAVGVDVNTA SAPLLSRVSG ISGGLADNIV RHRDSNGPFR SRTGLLDVAR LGPKAFEQCA GFLRITGGDD PLDSSSVHPE SYPVVRRILT VTGGDLRALI GDTKTLRSLK PTEFVDDTVG LPTVTDILAE LEKPGRDPRP AFRTAEFTEG VETLADLVPG MILEGVVTNV AAFGAFVDIG VHQDGLVHVS AMSKNFVSDP REVAKPGDIV RVKVLDVDIP RKRISLTLRL DDEPGAADAG GGQSGQGGQG GERRAGRGGR GGQNRGGAGG AGGAGRSQER DGGQPADQQG GRQAAAAGGG QPAGGQPGGG RGGPGQRGGP GQRGGPGQRG GGGGAGQRGG AGGAGGGQRG GGGGQRGAGH RGGRTDGAIA DALRRAGLVT GDEVTLGGKQ DDSRDSRRGR R
|
| |