Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5983 |
Symbol | |
ID | 5674304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7291658 |
End bp | 7292599 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641244831 |
Product | PGAP1 family protein |
Protein accession | YP_001510233 |
Protein GI | 158317725 |
COG category | [R] General function prediction only |
COG ID | [COG1075] Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0206399 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.136813 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTTTCT GTAACGTCGT TGACCCAACC TGCCACGGGA GACAGGTCGT CACGAAGGGG ACGGTCATCG ACACTCGGCT GTTCACCTGG ATCGGCGGCG CCCAGCTCGA CGGCGTGAAG GGCCTCGTGG TCGAGGCCGC CTGCCTGGCC ACCCACGCCG CGCTCTACCC GGCCGCGGCA CTGCCGCGCC GGCGCCGCGA CGACGACGGA CTCGCCGACC GCTACCGGCT CGCCGGCCTG ACACCATTGC AACGCGGCCT GCTCATCGGC GATCCGATGG CGGCCGGGAC GCCGATCCTG CTGGTGCACG GCCTCGTCGA CAACCGGTCC GTGTTCGCCC GGCTGGAACG CTCCCTGCGC CGCCGCGGCT TCACGACTGT GACCTCGGTG GACATCCCGC TGTTCGCGAC GAGCGTCCAG GCGGCCGCGG CGCAGCTCGC CGAGACCGTC GAGCAGGTCG CCGGCCGCCA CGGCGACACC GGCGTGCACA TCGTCGCCCA CTCGCTCGGC GGCCTGGTGG CCCGCTACTA CGTGCAGCGG CTCGGCGGCG GCGACCACGT GCAGACCCTG GTCACGCTGG CGACGCCACA CAACGGCACC CGGCTGGCGT GCCTGGTTCC GAAGGCCGTC TCCTACCGGC TCGTCAGCCA GCTCCGCCCC GGTTCACCGC TGCTGAGGGA GCTCGTCGAG CCGGCTCCCG GCGTCCGGAC CCGCTTCATC GCCGTCGCCG GCGGCCTCGA CACGGTGGTC CGCCCGGGCG AGGCGGCGCT CACCCATCCC GACCTGGTGA CCGAGAACGT GGTCGTCGAG GGGGCGGGGC ACCACGGCCT GCCGTTCAGC AGCGGGGTGG CGCACATGAT CGCCCGCAGG CTGGCCACGG ACTCCGCCGG CGGTCGCCAA CCGTCACAGA CCACCACGAA AGTGCCACAT CTTTGTGGCT AG
|
Protein sequence | MAFCNVVDPT CHGRQVVTKG TVIDTRLFTW IGGAQLDGVK GLVVEAACLA THAALYPAAA LPRRRRDDDG LADRYRLAGL TPLQRGLLIG DPMAAGTPIL LVHGLVDNRS VFARLERSLR RRGFTTVTSV DIPLFATSVQ AAAAQLAETV EQVAGRHGDT GVHIVAHSLG GLVARYYVQR LGGGDHVQTL VTLATPHNGT RLACLVPKAV SYRLVSQLRP GSPLLRELVE PAPGVRTRFI AVAGGLDTVV RPGEAALTHP DLVTENVVVE GAGHHGLPFS SGVAHMIARR LATDSAGGRQ PSQTTTKVPH LCG
|
| |