Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1751 |
Symbol | |
ID | 5670153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2100152 |
End bp | 2101141 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641240672 |
Product | hemolysin A |
Protein accession | YP_001506095 |
Protein GI | 158313587 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1189] Predicted rRNA methylase |
TIGRFAM ID | [TIGR00478] hemolysin TlyA family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00952938 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000670381 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCCCGCA GGATCCGTCT GGACGCGGAG CTCGTCCGGC GCCGGCTCGT CCCCTCCCGG GAACGGGCCG TCGAGGCGAT CGCCGCGGGC CGCGTGCGGG TCGGCGGCGT CACCGCGACG AAGCCGGCGA CGGTCGTCGA CGGTGCGACG TCGATCGTGC TCGCCGTGGA CGACGACCCC GGCTATGCCT CCCGGGGAGC GCACAAGCTG GTCGGCGCGT TCGAGGCGTT CGGCGTCCCA GCGCCCGGGA CGCCCGACCT GTCCGGCGGG CCCGGCCAGC CTGATGTGCC GGATCGGCCT GGTTCGCCCG CGGCGGTGGC GGGGCCGCCG GCGCTCGTGG TCGCCGGGCG GAGGTGCCTC GACGCCGGTG CGTCGACCGG CGGGTTCACC GACGTGCTCC TCCGGTACGG CGCGGCGCGG GTGGTGGCCG TCGACGTCGG ATACGGGCAG CTCGTCTGGC GGCTGCGCTC GGATCCGCGG GTGCGCGTGC TGGACCGGAC GAACGTCCGC AACCTCACGC CCGAGCAGGT CGGGGAGCCG GTGGAGCTGG TCGTGGGCGA CCTCTCGTTC ATCTCGTTGG TCCTGGTGCT GCCCGCGCTG CGCGCGTGCG CCGCGCCGGA CGCCGACTTC GTCCTGCTGG TCAAGCCGCA GTTCGAGGTG GGCCGGGAGT TACTCGGCTC CGGTGGTGTG GTCCGTGATG TGGCCCTGCA CGCTCGGGCG GTGCGCACCG TCGTGACCGC CGCCGAGGGG CTCGGGCTGG GGGTGCGCGG CGTGGCGGCC AGCCCGCTGC CGGGGCCCGC CGGCAACGTC GAGTACCTCG CCTGGCTGCG CGCGGACGTC CGCCCGACTC CAGACGAGGT CGAGGCGATG ATCACCACGG CGATCGAGGC GGGGCCCGCG GGCACGGCGG CCCCCGCCCC AGCGCCGCCG CCGTCGTCCC AGGACGCCGG CCAGGACGCC CCAGCCGACG GCGAAAGGAC CGGCCGATGA
|
Protein sequence | MARRIRLDAE LVRRRLVPSR ERAVEAIAAG RVRVGGVTAT KPATVVDGAT SIVLAVDDDP GYASRGAHKL VGAFEAFGVP APGTPDLSGG PGQPDVPDRP GSPAAVAGPP ALVVAGRRCL DAGASTGGFT DVLLRYGAAR VVAVDVGYGQ LVWRLRSDPR VRVLDRTNVR NLTPEQVGEP VELVVGDLSF ISLVLVLPAL RACAAPDADF VLLVKPQFEV GRELLGSGGV VRDVALHARA VRTVVTAAEG LGLGVRGVAA SPLPGPAGNV EYLAWLRADV RPTPDEVEAM ITTAIEAGPA GTAAPAPAPP PSSQDAGQDA PADGERTGR
|
| |