Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4379 |
Symbol | |
ID | 5672732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5225702 |
End bp | 5226904 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641243248 |
Product | RpoD family RNA polymerase sigma factor |
Protein accession | YP_001508665 |
Protein GI | 158316157 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.222017 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCGGT CTTTCCGCGG ACATGAAGAT GCGCCGCCCA CGGATCTGAG CGCCCCCGAC GAAGCCGCGG AGGAAAAGAT CCGATCCCTC ATCGCCCGGG GTAAGGAGAA CGGTTTCGTC ACCCCGGACG ACATTGCCGC TGCGCTACTC GCAGCGGAGC TGCCGCCAGA GAGCAGCGAC GTTGTCCTAC GGCTACTCGC GGAGGACGGC ATCGAGGTCC TCGACGAGGT GGGCGGGGAC GCTTCAGACA TGCCTAGCCG GCGTCGTGAG GGAGAGGAAC TTGCGCTTAC GACGCCACCC TCGGACCCGG TACGGATGTA TCTCAAGGCC ATCGGCCGGG TGCGGCTGCT GACCGCAGAG GAAGAGGTCG ACCTAGCGAA GCGGATCGAG GCGGGTCTGT TCGCCTCCGA GAAGCTCGCC GCCATCCGAA GGACCTCCCC GCGGCTGCGT CGGGACTTGG AGGCGATCGA GCAGGACGGT CAGATCGCCA AGCGCAAACT GGTGGAGGCG AACCTGCGCC TCGTGGTGTC CATCGCTAAG CGGTACGTCG GCCGGGGCAT GCTGCTGCTG GACCTGATCC AGGAGGGCAA CCTGGGCCTG ATCCGTGCGG TGGAGAAGTT CGACTACACC AAGGGATACA AGTTCTCCAC CTACGCCACC TGGTGGATCC GGCAGGCTGT CACGAGGGCC ATCGCGGATC AGGGGCGCAC CATCCGGATT CCGGTACACA TGGTCGAGAC AATCAACAAG GTCACCCGGA TCCAGCGGCA ACTATTGCAG GATCTGGGCC GGGAGCCTTC GCCGGAAGAG ATCGCCACAC AGGTAGACCT CGCACCGCAC AGAGTGGAGG AAATTCTCAA AGTCGGACAG ACACCGGTCG CCCTGGAGAC CCCGATCGGC GAGGAGCAGG ACTCCCAGCT CGGAGACTTC ATCGAGGACA ACGACGCGAT TGTGCCGTTC GAAGCGGCAA GTTTCGTCCT CCTGCAGGAG CAAATCGACT CGGTCCTACA CACGCTGTCC GAGCGGGAGA AGAAAGTCAT CCAGCTCCGG TTCGGTCTGA CTGACGGCCA GCCTAGGACG TTGGAGCAGG TAGGCCGGGA ATTCGGGGTG ACCCGGGAAC GGATCCGGCA GATAGAATCA AGAACACTGG CGAAGCTAAG CCACCCGGCG CGTTCACAAC GGCTACGCGA CTACCTGGTA TAG
|
Protein sequence | MGRSFRGHED APPTDLSAPD EAAEEKIRSL IARGKENGFV TPDDIAAALL AAELPPESSD VVLRLLAEDG IEVLDEVGGD ASDMPSRRRE GEELALTTPP SDPVRMYLKA IGRVRLLTAE EEVDLAKRIE AGLFASEKLA AIRRTSPRLR RDLEAIEQDG QIAKRKLVEA NLRLVVSIAK RYVGRGMLLL DLIQEGNLGL IRAVEKFDYT KGYKFSTYAT WWIRQAVTRA IADQGRTIRI PVHMVETINK VTRIQRQLLQ DLGREPSPEE IATQVDLAPH RVEEILKVGQ TPVALETPIG EEQDSQLGDF IEDNDAIVPF EAASFVLLQE QIDSVLHTLS EREKKVIQLR FGLTDGQPRT LEQVGREFGV TRERIRQIES RTLAKLSHPA RSQRLRDYLV
|
| |