Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1237 |
Symbol | |
ID | 5669650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1484052 |
End bp | 1485323 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641240169 |
Product | RpoD family RNA polymerase sigma factor |
Protein accession | YP_001505597 |
Protein GI | 158313089 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.002863 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGCTGC CTGCCCTGGA ACTAGCCGAG CGCACCGACG AGTCACGCCC GCGCCCGCGG CGTACCCGTC GCTCCGCTTC TCCTGTCCGT AACACTCCAA GCCGCACGCT GGCCGCGGTC CCCGACGAGC TCGACGAGCT CGACGTCTCC GCCATCGCGG AACTCATCGC GCGTGGCCGC GAGACCGGCG AGCTCAGCCG TTCCGAGCTC CGTGAGGCTC TCGAGGCCGC CGACATCGGC GTCGAGCTGC TGCCCGCGCT GATCTCCCGC CTGGGTGCCG CCGGGATCGA CCTCGTCGAA GAAGAAGAGG AGCGCGTCAC CCCCGGCGCT CCGGCCGCCG GCCGCACGGT CGCCGACCAC GCCGGCACCG CCGACCTCGT CCGCATGTAC TTGCGGGAGA TCGGCAAGGT CCCGCTGCTC AACGCCGCTC AGGAGGTCGA GCTCTCCAAG CGCGTCGAGG CGGGCCTGTT CGCCGAGCAC AAGCTCGACA CCGACCAGGA CCTGGCCGAC GACCTGCGCC GGGATCTCGG CGTGCTGGTC ACCGACGGCC AGGCTGCCAA GCAGCAGCTG GTCTCGGCCA ACCTGCGCCT GGTGGTGTCG GTCGCCAAGA AGTACAGCGG CCGGGGTATG ACGCTGCTGG ACCTCGTCCA GGAGGGAAAC CTGGGCCTGA TCCGCGCGGT CGAGAAGTTC GACTACGCGA AGGGCTACAA GTTCTCCACC TACGCCACCT GGTGGATCCG TCAGGCGATC GGCCGCGCGC TGGCCGACCA GGCCCGCACG ATCCGTATCC CGGTGCACGT GGTCGAGCAG ATCAACAAGA TCACCCGGCT GCAGCGCCAG CTTGTCTCGA CCCTCGGCCG CGAGCCGACG GACGAGGAGC TCGCGCTCGA GCTGGACATG CCGATCGAGC AGGTCGTCGA ACTGCGCCGC TACGCGCAGG ACACGGTCAG CCTGGAGACG TCCGTCGGTG ACGACGGCGA CTCCGTGCTC GGCGACTTCA TCGAGGACTC GGACGCGACG TCGCCCGCCG ACGCCGCCTC CTACGGCGCC ATGCAGGACG AGATCGACAA CGTCCTCGGT GCGCTGAACC CTCGTGAGCG CGAGGTCATG CGGCTGCGTT TCGGGCTCGC CGACGGGAAG CAGCACACCC TCGCCGAGGT CGGCAACCGG CTCGGGCTCA CCCGTGAGCG CATCCGCCAG ATCGAGCGGG ACACGCTGCG GGAGCTGCGC AAGCCGGCCG TGGCCGGAAG GCTGCGCGAG TTCCTCGACT GA
|
Protein sequence | MTLPALELAE RTDESRPRPR RTRRSASPVR NTPSRTLAAV PDELDELDVS AIAELIARGR ETGELSRSEL REALEAADIG VELLPALISR LGAAGIDLVE EEEERVTPGA PAAGRTVADH AGTADLVRMY LREIGKVPLL NAAQEVELSK RVEAGLFAEH KLDTDQDLAD DLRRDLGVLV TDGQAAKQQL VSANLRLVVS VAKKYSGRGM TLLDLVQEGN LGLIRAVEKF DYAKGYKFST YATWWIRQAI GRALADQART IRIPVHVVEQ INKITRLQRQ LVSTLGREPT DEELALELDM PIEQVVELRR YAQDTVSLET SVGDDGDSVL GDFIEDSDAT SPADAASYGA MQDEIDNVLG ALNPREREVM RLRFGLADGK QHTLAEVGNR LGLTRERIRQ IERDTLRELR KPAVAGRLRE FLD
|
| |