Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5199 |
Symbol | |
ID | 5673533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6239938 |
End bp | 6240939 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641244053 |
Product | ECF subfamily RNA polymerase sigma-24 factor |
Protein accession | YP_001509463 |
Protein GI | 158316955 |
COG category | [K] Transcription |
COG ID | [COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGACA GGCCCGACGC GGGCGGCCCC GGTGCGCTGG CCGAGGCGTT CGAGGCGGAG CGCGGGTACC TGCGCGCCGT GGCCTACCGC ATCCTCGGCT CGGTCACCGA CGCCGAGGAC ATCGTCCAGG ACGCCTGGCT GCGCCTCGCC CGGACCGACC CGGCCGCCAT CGAGGACCTG CGCGGATGGC TGACCGTGGT CGTCGGCCGG CTCTGCCTCG ACCATCTCCG CTCGGCTCGG GTCCGGCGGG AGACCTACGT CGGGCCGTGG CTGCCCGAGC CGCTGGTCGA CCCGTCCGGT CTGGGCGGGT CGGGTGGGTC GGCCGGCGGG GGCGGCGCGA CCGGCCCGGT CGGCGAGGTC ACGGCCGTGG CCGCCGAGCG GGCCGACCCG GCGGACCGGG TCACCCTCGC CGAGTCGGTC AGCATGGCCA TGCTGGTCGT CCTGGAGTCA CTGAGCCCGG CCGAGCGGAC CGCGCTGATC CTGCACGACG TCTTCGGCTA CGGCTTCGAG GAGGTCGCCG AGGTGACCGG CCGCAGCCCG GCCGCCAGCC GGCAGCTCGC CAGCCGCGCC CGGCGGCACG TGCGGGAGCG GGCCGTCCGC TTCGATCCCG ACCCGGCCCA GCGGCGCGGT GTCGCCGATG CGTTCCTGGC GGCCGCCGCC GGCGGTGACC TGGCCGCGCT GCTGCGCGTC CTCGACCCGG ACGTCGTGCT GCGCTCGGAC GGCGGCGGTG TGGTGCGCGC CGCGCTGCGC CCGATCGACG GGGCGGACAA GGTGGCGCGG TTCCTACACG GCCTGATCGA GAAGGGCCGC CGGCAGTACG GGGCCGCGGT GCGCTTCGTC CCGGTCGAGG TCAACGGGGG AGCGGGTATC GCCACGTACA CCGGTCCGCG GCTGGTGAAC GTCGTCGCGC TCACGGTGTG GCGCGGGCTG GTCACGGAGA TCGACGTCGT GGTCAACCCG GCGAAGCTGC GCCATCTCAC GCAACCGCCA CACGGCGGCT GA
|
Protein sequence | MTDRPDAGGP GALAEAFEAE RGYLRAVAYR ILGSVTDAED IVQDAWLRLA RTDPAAIEDL RGWLTVVVGR LCLDHLRSAR VRRETYVGPW LPEPLVDPSG LGGSGGSAGG GGATGPVGEV TAVAAERADP ADRVTLAESV SMAMLVVLES LSPAERTALI LHDVFGYGFE EVAEVTGRSP AASRQLASRA RRHVRERAVR FDPDPAQRRG VADAFLAAAA GGDLAALLRV LDPDVVLRSD GGGVVRAALR PIDGADKVAR FLHGLIEKGR RQYGAAVRFV PVEVNGGAGI ATYTGPRLVN VVALTVWRGL VTEIDVVVNP AKLRHLTQPP HGG
|
| |