Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3833 |
Symbol | |
ID | 3905581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4594750 |
End bp | 4595634 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637881159 |
Product | RNA polymerase sigma factor SigE |
Protein accession | YP_482912 |
Protein GI | 86742512 |
COG category | [K] Transcription |
COG ID | [COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.326287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0286461 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGCAG CGGGATTCGC CGCTCCCTGG CCGGCTCGTC GGGTACGTTT GACGCGAACC GACCGAGAGG CGGCCGTGGC TGCTCAGGAC CCCGGGGCGG GCTCGGACGC CGCCGGCGTC GCCGCCACCG CAGAAGCTTC TGGTTGGGTT CCTCCTTCCT GGGAAGACGT GGTCCGTGAG CACGGCAACC GGGTCTATCG CCTCGCCTAC CGCCTCACCG GTAACGCCCA CGACGCCGAG GACCTGACGC AGGACGTGTT CGTCCGGGTG TTCCGGTCGT TGGCGGACTA CACCCCCGGC ACCTTCGAGG GGTGGCTGCA CCGGATCACC ACCAATCTCT TCCTCGACCG CATGCGTCGC CAGCAGAAGA TCCGGTTCGA TGCGCTGCCC GAGGACACCG AGCGGCTGGC CGGCCGCGAG GCGAGCCCGG AGGCGGTCTA CGCCGAGGCT CATCTCGACG CGGACGTCGA GGCCGCGCTC GCCGCTCTGC CGCCCGACTT CCGGGCCGCC GTCGTGCTGT GCGACATCGA GCAGCTCTCC TACGAAGAGA TCGCTCAGAC GTTGGGCGTG AAACTGGGCA CAGTGCGCAG CCGGATCTCG CGCGGTCGGG CGATGTTGCG CGCGGCGCTG GCTGATCGCG CGCCACGTAC TGGTGACGGC GCCGCGTTGT CCGCCGCCGT GACGGTCTCG GACGACGCCC TTCTCACCGC GGAGCCCGCC GTCGGGCCGG TGGCGGCGGA GCTCGCCGAG GCGGACGTGG CGCCGCCGGA CACGACCAGA CCGCGTCGGG ACCGAGGCGA CGACGGGCGG CCCAGAAGCA GGGGACGGCA TGGGCCCGTC GCCGCCGGGA GCAAGGGTGG TGCTCCGCGG GAGCGGCGGG CGTGA
|
Protein sequence | MSAAGFAAPW PARRVRLTRT DREAAVAAQD PGAGSDAAGV AATAEASGWV PPSWEDVVRE HGNRVYRLAY RLTGNAHDAE DLTQDVFVRV FRSLADYTPG TFEGWLHRIT TNLFLDRMRR QQKIRFDALP EDTERLAGRE ASPEAVYAEA HLDADVEAAL AALPPDFRAA VVLCDIEQLS YEEIAQTLGV KLGTVRSRIS RGRAMLRAAL ADRAPRTGDG AALSAAVTVS DDALLTAEPA VGPVAAELAE ADVAPPDTTR PRRDRGDDGR PRSRGRHGPV AAGSKGGAPR ERRA
|
| |