Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7240 |
Symbol | |
ID | 5675541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8841018 |
End bp | 8842592 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 641246077 |
Product | putative RNA polymerase sigma factor |
Protein accession | YP_001511465 |
Protein GI | 158318957 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00570923 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.19325 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAGGT CAGGAACGCC GTCACGGGGA GATGCTGAGC GGATGAACGA CGACGCGGAC CTGGTGGCGC GGGTTCGGCG CGGCGATCGT CAGGCTTTCG ATGACCTGTA CAACCGCTAC GCCGACGACG TCTTCTCGAT GTGCCTGCTG ATCCTCGGCG ACCCGACGGT CGCGCGGGCG GCCGCCGGCA CCGCGTTCGC CCTCGTGGCG AGGACGAGGC TCAACCCGCT CTCGGACCCG ACCCGACTGC GCCCCTGGCT ACTGGAGCTC GCCCGCGGCA GCGCGCTGGC CTGGTCGGGA TCCCCGCAGG CGCGCAGCGT TCCCGTCCCG CACGGCGTCT CACCCGAGGA GCTGCTCGAC GGCGCCGTGG TGCCCGCGCC GGCCAGCCTG CGGGCCGGGC TGGAGCGCAC CTTCGACCGC GCGGCCGTCG CCGCCGAAGC CGCCGCGGCG GCCGCGAGCC GCCGAGCCGC CGAGCGCGCC GGCTCCGAGT TCGCCGGTAC CGAGCCTGCG GGCGTCGCGC CTGCCGGAGT CGACGACCTG GCCGCCGCGG CGGGCCTGGC TGCCGGCGGG GTGCCGGGCC ATCCCGGTGC CGCCGCCCCG AGCCGCCCGG CCGTCCCGGC CTTCACCGCC TCGGCCTTCC CGACCCAGGA CCGTCGCGGT CCGGCACCGG TCGCGGACGG TCGCCCCGCG GGCGGCTACG CCCCTGAGGA CGCCGTAACC ATCATCTCGC CGTCCACCGG GCAGTTGCGC ACGCAGCGCC AGGACGACCA GCATGAGCAG ACCATGGCAC CGGCCGGCGC CGGCGCCCAG GACCGGTACG GGAACGTCCT CCCGCTGATC CCCGTGGACC GCGCCTACCC GGGTGACGCC GCAGGGTCCG GCGGGTTCGC CGACTTCCGG GCCCGGCCCG CGATCGCCGT GGCCGCCGCC CTGGTCGTCG CCGTCGCCGG GCTCACCGCC GTCATCTCCT GGCCCGCGAG CGAGGCCGCG CTGGTCGCCG ACGGCGGCCC CGGCATCGTC GCGATCGCCC CCTCGGCGCC ACCCGGCCCC CTGCCGACGA TCGCCCGGCC GCCGGTCGCG TCCGAGGCGC CCTCGCCCAC TGTGGTGCCG ACCGTCGCGG GCGCCTACAC GAGCCGTCCC GGTGTATCCC GCCCCGGCCA GGTCGTCCGC GAGGTCGACA CCGGCCAGCG CGGCGCCGAC CAGCCCGCCG CACCGCCCGC GCCGGCCCCG AGCACACCGC CGCCGCCCAC CACGAACGCG CCCGCGCCGC CGGCGACGAC CCCGCCGACG ACACCGCCCG CGACGACCGG CCCCACCACG ACCGCCGGCC CGACGACGGC GCCCCCGGCC ACCACGGGAC CCCCGGCCAC GGGCACCCCG ACGACGCCGC CGGACGGCGA GGCTCCGCCG CCGGCGACGA CCTCCGAGCC CGCCCCGCCC CCGAGCACCG CTCCGGCCCC GGCGCCGCCG ACCAGTGAGG CACCGGCCCC GCCGGCGACG ACCGCACCGG GACCGGCCGG CGACCAGCGC CTCCCACCGG GCACCCGAGA CGTGCAGAGC CCGTTCCCGG TCTAA
|
Protein sequence | MPRSGTPSRG DAERMNDDAD LVARVRRGDR QAFDDLYNRY ADDVFSMCLL ILGDPTVARA AAGTAFALVA RTRLNPLSDP TRLRPWLLEL ARGSALAWSG SPQARSVPVP HGVSPEELLD GAVVPAPASL RAGLERTFDR AAVAAEAAAA AASRRAAERA GSEFAGTEPA GVAPAGVDDL AAAAGLAAGG VPGHPGAAAP SRPAVPAFTA SAFPTQDRRG PAPVADGRPA GGYAPEDAVT IISPSTGQLR TQRQDDQHEQ TMAPAGAGAQ DRYGNVLPLI PVDRAYPGDA AGSGGFADFR ARPAIAVAAA LVVAVAGLTA VISWPASEAA LVADGGPGIV AIAPSAPPGP LPTIARPPVA SEAPSPTVVP TVAGAYTSRP GVSRPGQVVR EVDTGQRGAD QPAAPPAPAP STPPPPTTNA PAPPATTPPT TPPATTGPTT TAGPTTAPPA TTGPPATGTP TTPPDGEAPP PATTSEPAPP PSTAPAPAPP TSEAPAPPAT TAPGPAGDQR LPPGTRDVQS PFPV
|
| |