Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0974 |
Symbol | |
ID | 5669388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1138756 |
End bp | 1139886 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641239902 |
Product | FliA/WhiG family RNA polymerase sigma factor |
Protein accession | YP_001505336 |
Protein GI | 158312828 |
COG category | [K] Transcription |
COG ID | [COG1191] DNA-directed RNA polymerase specialized sigma subunit |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02980] RNA polymerase sigma-70 factor, sigma-B/F/G subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.279124 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTTCAG CCGGCAGAAC ACCCACCACC CCTCGTACGG TCCCCACGAA CCGCGCGGGC GACCCGGTCG CGGCACTCAG CCAGCGCGGT GCCGGCGACA TCACCAGCAC CGACATCGCC AGCACCGACA TCGCCAGCAC CGACATCGCC AGCGCTGACA TCGCCAGCGC TGACATCGCC GGCCCCGACG TCGGCGGGAT CACCGGGACG GCGCACGAGG CCGACGGCCT GACCGAACGC GGATCCGGGC CTGGTGCCGA GCACGGTGGT CCACCGCCCG CCTCAGCCGT GGGCGAGCAG GCCCAGCCCG ACGAGGCCGT CGACGCTGCC GGTGCGGGCT CCCGCTCCGG CTCGTCCGGG CACGGCGCCG GGTCGCCGGA TCGGATCCGG GCCCGCGCGC TGTTCGTCCG GCTGGTGTCG CTGCCCGAGG GGGACCCGGA ACGGGCCGCC CTGCGTGACC AGCTCGTCCG CATGCACCTT CCCCTCGTCG AGTACCTCGC CCGGCGGTTC CGAAACCGCG GCGAGCCGCT CGACGATCTG GTGCAGGTCG CGACCATCGG GCTGATCAAA TCCGTCGACC GGTTCGACCC GGAGCGCGGG GTCGAGTTCT CGACCTACGC GACCCCGACC ATCGTCGGGG AGATCAAACG GCACTTCCGC GACAAGGGCT GGGCGATCCG GGTGCCCCGT CGGCTCCAGG AGCTCAAGCT CTCGCTGACG AAGGCGACCT CCGAGCTGTC CCAGTCGCTG GGCCGCTCGC CGACGGTCAG CGAGATCGCC CGTCACCTGG AGATGAGCGA GGAAGAGGTC CTCGAGGGCC TCGAGTCGGC GAACGCCTAC TCGGCCGTCT CGCTGGACGC GCCCGACTCC GGGGACGACG AGGCTCCGGC CGTCGCCGAC ACCCTGGGGG TGCAGGACGA GTCGCTGGAG GGCGTGGAGT ACCGCGAGTC CCTCAAGCCG CTGTTGGAGA AGCTTCCCCC GCGGGAGAAG CGCATCCTGC TGCTCCGCTT CTTCGGCAAC ATGACCCAGT CGCAGATCGC GAACGAGCTC GGCATCTCGC AGATGCACGT GTCCCGGCTG TTGGCCCGCA CGCTGGCCCA GCTCCGCCGC GGGCTACTGG AAGACGGCTG A
|
Protein sequence | MTSAGRTPTT PRTVPTNRAG DPVAALSQRG AGDITSTDIA STDIASTDIA SADIASADIA GPDVGGITGT AHEADGLTER GSGPGAEHGG PPPASAVGEQ AQPDEAVDAA GAGSRSGSSG HGAGSPDRIR ARALFVRLVS LPEGDPERAA LRDQLVRMHL PLVEYLARRF RNRGEPLDDL VQVATIGLIK SVDRFDPERG VEFSTYATPT IVGEIKRHFR DKGWAIRVPR RLQELKLSLT KATSELSQSL GRSPTVSEIA RHLEMSEEEV LEGLESANAY SAVSLDAPDS GDDEAPAVAD TLGVQDESLE GVEYRESLKP LLEKLPPREK RILLLRFFGN MTQSQIANEL GISQMHVSRL LARTLAQLRR GLLEDG
|
| |