Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0318 |
Symbol | |
ID | 5668742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 381370 |
End bp | 382914 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239249 |
Product | type II secretion system protein E |
Protein accession | YP_001504690 |
Protein GI | 158312182 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0476276 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.440624 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGGCC GCCAGTCGCT CCCCTCCCCG GGGAACCGCC CGCACCTGCC CGGCGTTGCT CGCGCCAGTT CCCCTCTCCC CGGAACGGGT GGAGCCGGCG GGACCGGTGG AATCGGTGGG CTCAGCGCGA CCAGCGGGAT CGACCGAATC GGCCAGATCA GTCAGATCAG TCGATTCGGC GCAATGGCCG GGATGGCGGA GATCGGTGGG ATCACCGGCG GCGGTCCCGG CCGAACAGGC GAACGCGAGG CCGCCCCGAG AATCCCAGGC GCATCCGCGG GCGGCGAGCA ACGACTCGGG CCACTCAGCG TCCTGCTGCG GGACCCGGAG GTCACCGACG TCCTGGTCAA CGGCCCCGAT CAGGTCTGGA TCGAGCGGGC CGGCCGCCTG CGGCACACGG CCGTGCGCTT CCCCGACGAC GCCGCCCTCC GCGGGCTCGC CGTGCGGCTC GCCGGCAGTG GCGGCCGGCG CCTGGACACG GCGATGCCGT TCGTCGACGT CCGGCTGCCC GACGGGATCC GGCTGCACGC GGTTCTCTCC CCGCCCGCGC TCGGCGGGAC GTGCCTCTCG CTGCGCAGGC CACGGGCCCG GCCGTTCACC CTGGACGAGC TCGCCGCGCT GGGAACGATC GGGCCGTCCG AGGTGATCGT GCTGCGCGCC GTCCTCGCCG CCAGGTTGAC GACCGTGATC AGCGGCGGGA CGGGCACCGG GAAGAGCACC CTGCTCGCCG CGCTGCTCGA CGCCGTCCCC GCGGGCGAAC GGATCGTGCT GGTCGAGGAC ACCGCCGAGC TCGTCCTCGC CCGGTTCAAC CTGGTGCGGA TGGAGGCCCG CCCGCCCAAC ATCGAGGGCG CGGGCGAGAT CACCCAACGC GAGCTCGTCC GGCAGGCACT CCGGATGCGC CCGGACCGGC TGGTCGTCGG CGAGGTGCGG GGGCCGGAAG TGCTCGACCT CCTGATGGCG ATGAATACGG GTCACGAAGG GGGGCTCGGT ACGGTTCATG CCAACACAGC CGAGACGGTG CCCGCCCGGT TCGAGGCTCT CGGGGCACTC GCCGGCCTGC CGCGCGCCGC GCTGCACAGC CATCTGGCCG CGGCCGTCCA CGTCGCCGTC CACCTCGGTC GCGACGGCGA CGGCGCCCGC CGCGTCACCG GCATCGGTGT CGTGCGCCGC GACACGGACG GGATCGTGCG GATCGTGCCC GCACTTGCCG GCGGAACCGG CCCGGTCCAC CGCGACACGC ACGGCAACGA CCGAGCACGT GGCGACCAGC GGGCCCACGA CGACCACCGG TCCCACGACG ACCACCGGTC CCGCGGCGGC CAGCGGACGT CCGGCGATCA GCGGCCACCC GGTGGCCGGG GAGTCCACGG CGGGCGTCCG TTCGCCGGCG TCGAGCGCGA GGGGCTTCCC ATCCTGCGGG CGCTCCTGCG CGACCGCGGG TTCTCGCTGT GGCCCTTCGA TCAGGCCACC CCGGTCCGGC GCGGCCCACG GCCGGCCCCG GTCACCGCAC CGCCGCCTCC CGCGAGCGCG CCGGAGGCAC GATGA
|
Protein sequence | MNGRQSLPSP GNRPHLPGVA RASSPLPGTG GAGGTGGIGG LSATSGIDRI GQISQISRFG AMAGMAEIGG ITGGGPGRTG EREAAPRIPG ASAGGEQRLG PLSVLLRDPE VTDVLVNGPD QVWIERAGRL RHTAVRFPDD AALRGLAVRL AGSGGRRLDT AMPFVDVRLP DGIRLHAVLS PPALGGTCLS LRRPRARPFT LDELAALGTI GPSEVIVLRA VLAARLTTVI SGGTGTGKST LLAALLDAVP AGERIVLVED TAELVLARFN LVRMEARPPN IEGAGEITQR ELVRQALRMR PDRLVVGEVR GPEVLDLLMA MNTGHEGGLG TVHANTAETV PARFEALGAL AGLPRAALHS HLAAAVHVAV HLGRDGDGAR RVTGIGVVRR DTDGIVRIVP ALAGGTGPVH RDTHGNDRAR GDQRAHDDHR SHDDHRSRGG QRTSGDQRPP GGRGVHGGRP FAGVEREGLP ILRALLRDRG FSLWPFDQAT PVRRGPRPAP VTAPPPPASA PEAR
|
| |