Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4552 |
Symbol | |
ID | 5672899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5431216 |
End bp | 5432514 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243415 |
Product | cyclase/dehydrase |
Protein accession | YP_001508831 |
Protein GI | 158316323 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGCG CCACCGCCTA CCTGCTGATT CACGAGGGCA TCCGCGCGGA GACGCGCCGC CTGGCCGACT TCGCCGCGCA GCTGGCCGCC GGCCGCCGCT ACGCCGGACC GGCCCAGCTC ACCGCGCTGC GGACGCACCT CGACGAGGTC GTCAACGTGA TCCATCACCA CCATGTCGGC GAGGACACGC ACCTGTGGCC GCTGCTGCGA CGGTTCGCCG ACCCCTTCGA CCGCGTGGAC GGGCTCGACG TCCTGGACGG GCTCGCCGGT GACCACGACG TACTGGACCC GCTGATGGAA CGGGTCCGCA CGGCACTCGC CCGACTGGCA GCGACCGCCA CCACCGGCGC GTCGGACGAC ACCTCCGACC GGGCCACAGC TGAGGGCTCG GCTGAGGGCT CGGCCGGCGC CGCCGCCGAG TTTGCCGCGG CCGCGACTGT GCTGTTCACC CTGATGGACG AGCACCTTGC CGTCGAGGAG TCCATCGTCG TGCCGATCCT GCGCGAGCGG GTACCGGACG ACGAACTGGC CGCCATGGAG AAGCGGATGC AGCGCGGGAG CAAGATCCGG CTCGGGTTCG TACTGCCGTG GCTCGACGCG GCCGACCCGA CCCGGATGGC CGAGACAGCC GCCCAGCTCG GGCCGGTCTT CCCGGCGCTA CTCACCCTCA CCCGCCGTGG CTACCAGCGC CGCGTCCGCG CCGCCTACGG CGTCACCGCC GCGAGCCCAG GACCGGTCAC CCTGCGAGGA CAGGCCGAGA TCGTCATCGA GGCCACCCCG GAGCAGGTGT ACGAGGCGAT CGCGGACGTC ATCCGGATGG CGCGCCACAG CCCGGAGTGC TACCGCTGCG CGTGGCTCGA CGGCGCGGCT GCTCCACTGC CGGGCGCCCG CTTCCGCGGC TGGAACCGTT TCCGGGGCGC CCGCTGGAGC CGGGAATGCG AGATCGTCAC CGCCGAGCCG GGCGTGGCCT TCGCCTACCG CACCGTGCGT ACCAGTACCA GGCCGGACAG CACGCTGTGG CGCTTCGAGC TGACCCCGAC CGCTGCCGGC ACCCGGCTCC GCCAGACGTT CGAGCTCTCC GGCGCGGCGC CCGTCATGGT GTTCGAGCGG CTGAGCGGCC GCACCACCAG CACTCCGAAG GCGATGGCGC GCACGCTCGC CCGACTGCGG GACGACCTCC GCACCAGGTC GGACCGCGTG GGCGGGGCCG ACATTACCGC CGGATCGGAC ACCGCCCGCG GATCCGAGAC CCGCAGCCGC CTCGGCGAGG ATCTGGTCAG CGCGCGAAGC GGCCAGTGA
|
Protein sequence | MASATAYLLI HEGIRAETRR LADFAAQLAA GRRYAGPAQL TALRTHLDEV VNVIHHHHVG EDTHLWPLLR RFADPFDRVD GLDVLDGLAG DHDVLDPLME RVRTALARLA ATATTGASDD TSDRATAEGS AEGSAGAAAE FAAAATVLFT LMDEHLAVEE SIVVPILRER VPDDELAAME KRMQRGSKIR LGFVLPWLDA ADPTRMAETA AQLGPVFPAL LTLTRRGYQR RVRAAYGVTA ASPGPVTLRG QAEIVIEATP EQVYEAIADV IRMARHSPEC YRCAWLDGAA APLPGARFRG WNRFRGARWS RECEIVTAEP GVAFAYRTVR TSTRPDSTLW RFELTPTAAG TRLRQTFELS GAAPVMVFER LSGRTTSTPK AMARTLARLR DDLRTRSDRV GGADITAGSD TARGSETRSR LGEDLVSARS GQ
|
| |