Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0610 |
Symbol | |
ID | 3903478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 693356 |
End bp | 694405 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637877943 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_479723 |
Protein GI | 86739323 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.296446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATCG CTCAGCGTCC CACCCTCATC GAAGACCCGA TCTCCGAGTT CCGTTCACGC TTCGTGATCG AGCCGCTGGA GCCGGGCTTC GGCTACACCC TCGGTAACTC GTTGCGCCGG ACCCTGCTGT CCTCCATCCC AGGTGCCTCG GTCACCAGCA TCCGCATCGA CGGCGTGCTG CACGAGTTCT CCACCGTGCC CGGCGTCAAG GAGGACGTGA CCGACCTGAT CCTGAACCTC AAGGAACTGG TCGTCAGCTC CGACAACGAC GAGCCGACCG TGATGTACCT GCGCAAGCAG GGCCCCGGCG AGGTCACGGC GGCGGACATC GCCCCGCCGG CGGGCGTCGA GGTGCACAAC CCCGAGTTGC GGCTCGCCAC TCTGAACGAC AAGGGCAAGC TTGAGATCGA GCTGACCGTC GAGCGGGGGC GCGGGTATGT CAGCGCGGCC CAGAACAAGC AGGCCGGCCA GGAGATCGGG CGGATCCCGA TCGACTCGAT CTACTCGCCG GTGCTGAAGG TGACCTACAA GGTCGAGGCG ACCCGCGTCG AGCAGCGGAC CGACTTCGAC CGGCTCATCG TTGACGTCGA GACCAAGCCG TCGATCTCCC CGCGAGACGC GATGGCCAGC GCCGGCAAGA CCCTGGTCGG CCTGTTCGGG CTGGCTCAGG AGCTCAACGC CGAGGCGGAG GGCGTCGACA TCGGCCCGTC CGCCGCGGAC GCGGCGCTGG CGGCCGACCT TGCGCTGCCC ATCGAGGAGA TGGACCTGAC CGTCCGCTCC TACAACTGCC TCAAGCGTGA GGGCATCCAC ACGATCGGCG AGCTGGTCTC CCGTAGCGAG GCCGACCTGC TCGACATCCG TAACTTCGGG CAGAAGTCGA TCGATGAGGT CAAGACGAAG CTGGGCGCGA TGGGCCTGCA GCTCAAGGAC TCCCCGCCCG GGTTCGACCC GCGCCAGGCG GTGGACACCT ACGGCACCGA CGCGTACAGC CCGTCGTTCT CCGACCCGTC CGACGACGGC GCGGAGTTCA TCGAGACCGA GCAGTACTGA
|
Protein sequence | MLIAQRPTLI EDPISEFRSR FVIEPLEPGF GYTLGNSLRR TLLSSIPGAS VTSIRIDGVL HEFSTVPGVK EDVTDLILNL KELVVSSDND EPTVMYLRKQ GPGEVTAADI APPAGVEVHN PELRLATLND KGKLEIELTV ERGRGYVSAA QNKQAGQEIG RIPIDSIYSP VLKVTYKVEA TRVEQRTDFD RLIVDVETKP SISPRDAMAS AGKTLVGLFG LAQELNAEAE GVDIGPSAAD AALAADLALP IEEMDLTVRS YNCLKREGIH TIGELVSRSE ADLLDIRNFG QKSIDEVKTK LGAMGLQLKD SPPGFDPRQA VDTYGTDAYS PSFSDPSDDG AEFIETEQY
|
| |