Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6624 |
Symbol | |
ID | 5674939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8056687 |
End bp | 8057916 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641245475 |
Product | RNA polymerase sigma 70 family subunit |
Protein accession | YP_001510867 |
Protein GI | 158318359 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCAG TTGAACCCCT CGGCGGGGGC CAGTGGCGCG AGCTCGCGCG ACAGGCCCTC GCCCGGCTGC TGCGCAGCCA TGGCAGCGCC CAGTTCGACC TGTGCGAAGA CGCCGTCCAG GAGGCGCTCC TGCAGGCGTA CCAACAGTGG CCGGCGCGGT TTCCGGACGA CCCGTTGGGC TGGCTGATCG CTACCGCCCG CCGCCGGTAT GCCGACCGTG CCCGCACTGA CGCCCGGCGC CGACACCGCG AGGCGCGCGT CGCGTCGTTG CAGGCTCCGG TGATGCCGGA GGCCGTTCAC CGGGACGATT CGCTGCTGGT CCTCCAGCTG TGTTGCCACC CCGACCTGCC CCGCTCCGGA CAGGTAGCGC TGACCCTCCG GGCGGTCGCC GGGCTGACCA CCGCCCAGAT CGCCAACGTC TACCAGCTCC CCGAGCGCAC CATCGCCCAA CGGATCACCC GGGCCAAACG CCGCGTCAGC GAACTCGGCC GGCCGCTGCC GCCGCCCGGA CACGCGGGCG AGGGCGTCAC TGCCGTACTC GACGTGCTCT ACGTGATGTT CGCCGAGGCG CACCACACGA CCGCCGGAGC GCCTCCCCGC GACGCGGGTC TCGCCGCCGA GGCGATTCGC CTGGCTCGGC TCCTTCGTCG CAGCGTCCCT GAGTCCACCG AGACCACCGG GCTTCTCGCG CTGATGCTGC TCACCGAGGC CCGCCACCCA GCCCGGGTGG CGCAGGACGG GCACCTGACG TCACTCGACG AGCAGGATCG GTCGCTGTGG GACCAGGAGC TGATCAACGA AGGGATCGCG CTCGTCGAGC AAGCCGCCCG GGGTGCCGAA CCCAGCCCTT ACCTGCTCCA GGCCTGCATC GCGGCTCTGC ATGCCGAAGC CCCCGACATC GCGACAACCG ACTGGGACGA GATCCTCGCG CTCTACCGGC TTCTTGAGAT CGTTACCGGC CACCAGAACC CCACGGTCAC CCTCAACAGC ATCGTCGCCC AGGCCATGGT CGACGGCATC GATGTCGCCC TCGCCCGGAT CGACGCGCTG GAAGCCGACC ACCCTGGCCT TCCCCGCATC GACTCCGTAC GCGCCCACCT CCTGGAGCGG GCCGGCAGGA CGGACGACGC GGCCGGGGCC TACCGGCGAG CCATCGTCGG CACCGTCAGC CTCGCCGAGC AACGCCACCT CAGGCGACGT CTACGCCGCC TGTCTGGCGC GCACGTGTGA
|
Protein sequence | MIPVEPLGGG QWRELARQAL ARLLRSHGSA QFDLCEDAVQ EALLQAYQQW PARFPDDPLG WLIATARRRY ADRARTDARR RHREARVASL QAPVMPEAVH RDDSLLVLQL CCHPDLPRSG QVALTLRAVA GLTTAQIANV YQLPERTIAQ RITRAKRRVS ELGRPLPPPG HAGEGVTAVL DVLYVMFAEA HHTTAGAPPR DAGLAAEAIR LARLLRRSVP ESTETTGLLA LMLLTEARHP ARVAQDGHLT SLDEQDRSLW DQELINEGIA LVEQAARGAE PSPYLLQACI AALHAEAPDI ATTDWDEILA LYRLLEIVTG HQNPTVTLNS IVAQAMVDGI DVALARIDAL EADHPGLPRI DSVRAHLLER AGRTDDAAGA YRRAIVGTVS LAEQRHLRRR LRRLSGAHV
|
| |