Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1248 |
Symbol | |
ID | 5669661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1503306 |
End bp | 1504844 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240180 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_001505608 |
Protein GI | 158313100 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.375848 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0192244 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAGGGAC GTCCGGCGCA GGGCCGCCCG GCGCGGGACC GACCGGCCGC CGGCGAGGAG AACTGGCCGG CGGGAGCCTG GCCGCGCCAC GAGCCCCGCT CGGCTCCCGC GCCGATCCCG CCCACCCGCC GGCTGCCTCC GCCGGGCGCC ACCGACGGCG GGCGGACCTG GCCCGCCCCG AACGACGGGC CCGGCGGGTA CCGCGCGCCG GCCGGCCCGG TGCCCGGGTA CGGCGGCCCG CGCGGGCCGT ACGACACCCC GGGCGAGGAC CTGCCCGCGG AGGAGCCACA CCGCCCGGTG AGCGGTGTGC GCCGGACGGT GACCCTGGTG GCCGCCATCG TCTCGGTGGC CGTCCTGGTC GTCGCCACGA GCGGCTGGGC CGTGCTGCGC CACTACGACG GCAAGGTGAA CCACATCGAG CTCACGTTCT CCGACTCGGC CGCGCGGCCC TCCGCCGCCG GCGGAGGCAC CCAGAACATC CTGCTGGTGG GCTCGGACAC CCGCGCGGGC ACCGGGGGCG AGTTCGGCCA GACCGAGGGG CAGCGCTCGG ACACCACGAT CCTCGCCCAC CTCGACGCCG ACGGCTCGAC GACCCTGGTG TCCTTCCCCC GCGACCTGTG GGTGCAGATC CCGGGCTACA CCGGCTCCGA CGGCACCCAG CACGACGCGC AGAAGTCCAA GCTCAACGCG GCGTTCGCCT ACGGCGGGCC GTCCCTGCTG GTCCGGACCA TCGAGACGCT CACCAACATC CGGATCGACC ACTACCTCGA GATCGACTTC CTGGGCTTCC AGGCGATGAC GGACGCGCTC GGCGGCGTCA CCGTCTGCGT GAAGGAGCTG ACGCCCGAGC TCAAGGCGCA GGGCTTCGAC AACCTCAACG ACCGGTACTC CGGCTGGCAC GGCCAGGTCG GCAACAACAC GCTCACCGGT GAGCAGGCGC TGGCCTTCGT CCGGCAGCGT TACGGCCTGC CCGGCAGCGA CCTCGACCGC ATCCACCGCC AGCAGCAGTT CCTCGGCGCG GTGTTCCGCG AGGTCGCCTC CACCGGGACC CTGCTCAACC CGCGCAAGCT GCTCGACGTC GTGGACGCGG CCACCTCCGC GCTGACCCTG GACGACCACA CCTCGCTCAC TGATCTGCGC CTGCTCGCCG TCCGGATGCA GGGCATCAGC ACCGGTGGGG TCACCTTCGC GACCGTCCCG GCCACGCCGT CACAGGCCGG CGGGCAGTCC GTGCTCCTGG CGAAGACCGA CGAGCTGACG ACGCTGCTCG CCGGGATCGG CGGCTCCCCG CCGCAGGCCG CCGGGCCCCC CGCGCTCGGC CCCGCCGGCT CGCCGTCCGG CTTGACGGCG GCCTCGGCGG CCTCGGCGGC CTCCGTGGGT GCTACGGCCG CGTCCGGTGC CGTGCCGGCG TCCGGTACCG GCCACGGTTC GGTGGTCACC GCGGACCTGC GCGCCACCGG AGGGCGGCCC GCGGGCGGCG CGGTGACCCT GGCCCAGGCC ACCCCCGAGC CGTCCGGCGG GGTGGGCTGC ACCTACTGA
|
Protein sequence | MQGRPAQGRP ARDRPAAGEE NWPAGAWPRH EPRSAPAPIP PTRRLPPPGA TDGGRTWPAP NDGPGGYRAP AGPVPGYGGP RGPYDTPGED LPAEEPHRPV SGVRRTVTLV AAIVSVAVLV VATSGWAVLR HYDGKVNHIE LTFSDSAARP SAAGGGTQNI LLVGSDTRAG TGGEFGQTEG QRSDTTILAH LDADGSTTLV SFPRDLWVQI PGYTGSDGTQ HDAQKSKLNA AFAYGGPSLL VRTIETLTNI RIDHYLEIDF LGFQAMTDAL GGVTVCVKEL TPELKAQGFD NLNDRYSGWH GQVGNNTLTG EQALAFVRQR YGLPGSDLDR IHRQQQFLGA VFREVASTGT LLNPRKLLDV VDAATSALTL DDHTSLTDLR LLAVRMQGIS TGGVTFATVP ATPSQAGGQS VLLAKTDELT TLLAGIGGSP PQAAGPPALG PAGSPSGLTA ASAASAASVG ATAASGAVPA SGTGHGSVVT ADLRATGGRP AGGAVTLAQA TPEPSGGVGC TY
|
| |