Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1269 |
Symbol | |
ID | 5669682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1526935 |
End bp | 1527825 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641240201 |
Product | acetyltransferase |
Protein accession | YP_001505629 |
Protein GI | 158313121 |
COG category | [R] General function prediction only |
COG ID | [COG0110] Acetyltransferase (isoleucine patch superfamily) |
TIGRFAM ID | [TIGR03570] sugar O-acyltransferase, sialic acid O-acetyltransferase NeuD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.83932 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.337234 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATAG TAATCGAGTT GCGGGAAACG GTCGGCGATA CGGCGCATCC CGGGGCGGCA CACCCTCCGG CCAGGCGACA CCACCCGACG GGATCGACTA CTCAGAGTGT CCGATTGAAC GGGCGCCGTC ATCGCTCTAC GGTCGGCCCC GTGCCAGACC TCGGACGCCA GCTCGTCATC GTGGGAGCCG GCGGCCACGG CCGCGAGCTC CTCGACCTCG TCGAGGCGGT CAACGCCGCC AGCGACCGGG ACATCTACCG GTTCCTCGGG TTCCTCGACG ACGGCGAGGT CGATCCCGAC CCGCTCGCCC GGCGCGGGAG CACCGTGCTG GGCGGGCTCG ACCTGCTGGC CAGCCTCGAG GCCGACTACC TGATCGGCGT CGGCGCAGCG GCCTCACGGG CCCGCATCGA CCGGTACGCC AGCGCGTGCG GACGGCGCCC GGCCACGATC GTCCATCCCC GCGCCCACCT GGGCGGCGAC GTGAAGCTCG GGCCGGGCAC CGTGGTCTGC GCGTTCGCGA GCCTCACCAC CAACATCGAG ACGGGCCGGC ACGTCCTGGT GAACATCGGC GCCGGCATCG CGCACGACTG CCGCCTCGGC GACTACGCGA CGCTCGCCCC CGGCGCCCGG ATCGGCGGTG CCGTCGAGGT CGGCCCCGGC GCCTGGGTGG GGATGCAGGC CAGCGTCGTG CGGAGCCGGC GGATCGGTGC CGGGGCGGTG GTCGGCGCCG GGGCGGTCGT CACCGGCGAC GTGCGCCCCG GACTCGTCGT GGCCGGGGTG CCGGCCCGCC CGATCGACCC GTCCGACGCC GGTGCGGACG CCGCCCGGCC ACCGATAGTG GAACCGAGGA CGCCCGAACC CCAGTCCGGG GCACCCCGAG CCGACCTATG A
|
Protein sequence | MLIVIELRET VGDTAHPGAA HPPARRHHPT GSTTQSVRLN GRRHRSTVGP VPDLGRQLVI VGAGGHGREL LDLVEAVNAA SDRDIYRFLG FLDDGEVDPD PLARRGSTVL GGLDLLASLE ADYLIGVGAA ASRARIDRYA SACGRRPATI VHPRAHLGGD VKLGPGTVVC AFASLTTNIE TGRHVLVNIG AGIAHDCRLG DYATLAPGAR IGGAVEVGPG AWVGMQASVV RSRRIGAGAV VGAGAVVTGD VRPGLVVAGV PARPIDPSDA GADAARPPIV EPRTPEPQSG APRADL
|
| |