Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4882 |
Symbol | |
ID | 5673222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5856509 |
End bp | 5858314 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641243737 |
Product | ATPase central domain-containing protein |
Protein accession | YP_001509153 |
Protein GI | 158316645 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1222] ATP-dependent 26S proteasome regulatory subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000159946 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.231188 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGGTC CCCGTTCGGG CTCTGGCTCC GGTGGGAGCA CGGGTCGTCC TGGTGACGCC GATTCTCAAC GGTCGGCGTA CGAGAAGGAA GTACACGAAC TCACGACTCA GGTCACCTTC CTGGAGGAAG AAGTGGCCAT GCTGCGGCGG AGGCTGTCTG AATCGCCCCG ACAGGTACGT GTCCTGGAGG AGCGACTAGC CCAGGTACAG GTGGAGCTAC AGACCGCCAC TGGGCAAAAC GACAAGCTCG TCGCCACCCT TCGGGAGGCA CGTGACCAGA TCATCTCGTT GAAGGAGGAG GTCGACCGGC TCGCGCAACC GCCGAGCGGG TACGGCGTCT TCATCCGTGG CTACGACGAC GGCACGGTCG ACGTGTTCAC GCAGGGCAGA AAGCTCCGCG TGACGGTGTC GCCGAACGTC GAAGCCGACG TCCTGCAGCC CGGTCAGGAG GTCATGCTCA ACGAGGCGCT CAACGTCGTG GAGGTTCGCG CGTTCGAGCG GCAGGGCGAG ATCGTCCTGC TCAAGGAGGT CCTCGAGAGC GGTGACCGGG CGCTGGTGAT CGGTCACACC GACGAGGAAC GGGTCGTCAT GCTGGCCCAG CCACTCCTCG ACGGCCCGAT CCGAGCCGGC GACTCGCTCC TCATCGAGCC ACGGTCGGGG TACGCCTTCG AGCGGATCCC CAAGTCCGAG GTCGAGGAGC TGGTCCTCGA AGAGGTCCCA GACATCGGCT ACGAGCAGAT CGGCGGCCTG AAGGGCCAGA TCGAGTCGAT CCGCGACGCG GTCGAGCTGC CGTTCCTCTA CAAGGAACTG TTCCTGGAGC ACAAGCTCAA GCCACCGAAG GGCGTGCTGC TCTACGGCCC GCCCGGCTGT GGCAAGACGC TGATCGCCAA GGCCGTGGCG AACTCCCTGG CCAAGAAGGT CGAGGCACAG ACAGGCCAGG GCTCCGGCCG GGCCTTCTTC CTCAACATCA AGGGCCCGGA GCTGCTCAAC AAGTACGTAG GCGAGACCGA GCGGCAGATC CGGCTGGTGT TCCAGCGGGC GCGCGAGAAG GCGTCCGAGG GCATGCCGGT GATCGTGTTC TTCGACGAGA TGGACTCGAT CTTCCGGACC CGTGGCTCGG GTGTCTCCTC GGACGTGGAG AACACGATCG TCCCGCAGCT GCTCAGCGAG ATCGACGGCG TTGAGCAGCT CGAGAACGTC ATCGTGATCG GCGCGTCCAA CCGAGAGGAC ATGATCGACC CGGCGATCCT GCGGCCGGGC CGGCTCGACG TGAAGATCAA GGTCGAGCGT CCGGACGCCG AAGCGGCCAA GGACATCTTC GCCAAGTACG TCCTGCCCGA GCTCCCGCTG CACGCCGACG ACCTCGCCGA GCACGGAGGT AACCGGGAGG CGACCTGCCA GGGCATGATC CAGCGGGTCG TCGAGCGGAT GTACGCCGAG AGCGAGGAGA ACCGCTTCCT CGAGGTCACC TACGCCAACG GTGACAAGGA GGTCCTGTAC TTCAAGGACT TCAACTCGGG CGCGATGATC GAGAACATCG TGGCCCGGGC GAAGAAGATG GCGGTGAAGG ACCTCATCGA GAGCGGAGTC CGCGGCCTGC GCATGCAGCA CCTGCTGTCG GCGTGCCTGG ACGAGTTCAA GGAGAACGAG GACCTGCCGA ACACCACGAA CCCGGACGAC TGGGCCCGGA TCTCCGGCAA GAAGGGTGAG CGGATCGTCT ACATCCGCAC ACTCGTCACC GGAACCAAGG GCACCGAGGC CGGGCGGTCG ATCGACACCA TCGCGAACAC CGGCCAGTAC CTCTAG
|
Protein sequence | MSGPRSGSGS GGSTGRPGDA DSQRSAYEKE VHELTTQVTF LEEEVAMLRR RLSESPRQVR VLEERLAQVQ VELQTATGQN DKLVATLREA RDQIISLKEE VDRLAQPPSG YGVFIRGYDD GTVDVFTQGR KLRVTVSPNV EADVLQPGQE VMLNEALNVV EVRAFERQGE IVLLKEVLES GDRALVIGHT DEERVVMLAQ PLLDGPIRAG DSLLIEPRSG YAFERIPKSE VEELVLEEVP DIGYEQIGGL KGQIESIRDA VELPFLYKEL FLEHKLKPPK GVLLYGPPGC GKTLIAKAVA NSLAKKVEAQ TGQGSGRAFF LNIKGPELLN KYVGETERQI RLVFQRAREK ASEGMPVIVF FDEMDSIFRT RGSGVSSDVE NTIVPQLLSE IDGVEQLENV IVIGASNRED MIDPAILRPG RLDVKIKVER PDAEAAKDIF AKYVLPELPL HADDLAEHGG NREATCQGMI QRVVERMYAE SEENRFLEVT YANGDKEVLY FKDFNSGAMI ENIVARAKKM AVKDLIESGV RGLRMQHLLS ACLDEFKENE DLPNTTNPDD WARISGKKGE RIVYIRTLVT GTKGTEAGRS IDTIANTGQY L
|
| |