Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6554 |
Symbol | |
ID | 5674869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7971427 |
End bp | 7972947 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641245403 |
Product | O-antigen polymerase |
Protein accession | YP_001510797 |
Protein GI | 158318289 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.408434 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCCG ATTGCCTCGC CGATCCGGTC ACCCCGGGTA CCCGGCCGTG GGGCGGCGAG GCGCCCCGTT CTCGCCGCGG CGACATACTC GCCTCTCCGG TCAGCCAGAC GTTTGCCGCG GTGGTGCTCG GCACCGCCGC GGTCGCCGCG GCGGTGCTGC GTGGCCCGGT CGGCGCCGCG GCAGTGCTGG CCGTGCCCGC ACTGGTAGTA CCGCTGCTGC TCAGCCACCG GGACGCGGTG AGCCTACTGA CGTTGTTCGT CTTCGCCCTC TTCTCGGTGC CGGTCTCCTA CCGGCTCGGG CCCGCCGGTG CGCTCGCCGT CCCGATCGGC GTTGGTTGCC TGGCGTGCTG GCTCCGGCAT CGAGCCGCAC CGCGTACCCG CCTCGACCCG GCGTTCCAAC CGGTGCGGGC GGCGACGCTC GTGCTTCTGT GGTTGTTCAC TCTCAGCTTC GCGGTGGCGT TCACCCGGAT CACCACACCG CTGGAGATCC GCTCGGCTGA GCGTTACCTC GTCATCGCCA CCGCGCAGAG CGGGGTGACG CTGGTGGCCG CAGACATGAT CGGGAACCGG GCCAGGCTCG ACACCATCCT GCGCCGGATC GTGCTGGGCG CGACCTTCAT GGCCGTCATC GGTGGAATCC AGTTTCTGAC GGATCGTGAC TACAGCAGCA TCCTGGTGCC CCCGGGCATG TCGGTCGGCC CTTCGCCCCG CGAGGTGATC GAGCTGCGTT CCGACTTCCG GCGGGTCGCC GGTACCGCCG GCCACCCGAT CGAGTTCGGG GTCGTTCTGG CCATGATCCT GCCGTTGGCG CTGCACTACG CGTGCGTTAC TCGCGGGCAA GCGGCGCGGG TCTGGGCGTG GGCGCAAGTC GCTGTGATCG GCGCCGCCAT CCCGACCAGC ATCTCCCGAA GCGCGGTCCT GAGCTTTGCG ATCGGGATCA CGGCTTTCCT CACCGTCCGG GGCTACCGGC GGATCCTGCA CGGCCTGCTG GCGCTGGCCC TGTTCCTGTT CATCTTGCAC GAGATGTTTC CGGGCCTGCT GGAGGAAATC GTGTCGCTGT TCCTCGGCGC GAATAAGGAC CCGAGTGTCG CCGGCCGCAC GGAAGACTAC GCGGCGGTGT GGGAACTGAT CCTGCGACGG CCGTTTCTCG GCCTGGGAAT CGGCACGTTC ATCCCGCAGC AGTACTTCTT CCTCGACAAC CAACTCCTCG GTTCAGTGCT GGAGACCGGC GTCCTCGGTA CCGTGGTCCT ACTGGGCTGG CTCGCCGTTG GTCTGTCCGT GAGCCGCGGA GTTCGGCGCC GGGCGCGCAC AGCGCGCGAC CGGGAACTCG GCCAGACACT GGTGGCCTCG ATTCTGGCGG GTTTCGCCGG CTTCCTCACC TTCGACGCGC TCGGTTTCGC GATCTTCAGC GGCCTGCTGT TCCTACTGGT CGGATGCGCG GGAGCGCTCT GGCGTATGAC GGCATCACCC GAAACACTCA CGCCATCGCC GCAGGCGCTG GCCGCGACGG GCCGGTCATG A
|
Protein sequence | MTADCLADPV TPGTRPWGGE APRSRRGDIL ASPVSQTFAA VVLGTAAVAA AVLRGPVGAA AVLAVPALVV PLLLSHRDAV SLLTLFVFAL FSVPVSYRLG PAGALAVPIG VGCLACWLRH RAAPRTRLDP AFQPVRAATL VLLWLFTLSF AVAFTRITTP LEIRSAERYL VIATAQSGVT LVAADMIGNR ARLDTILRRI VLGATFMAVI GGIQFLTDRD YSSILVPPGM SVGPSPREVI ELRSDFRRVA GTAGHPIEFG VVLAMILPLA LHYACVTRGQ AARVWAWAQV AVIGAAIPTS ISRSAVLSFA IGITAFLTVR GYRRILHGLL ALALFLFILH EMFPGLLEEI VSLFLGANKD PSVAGRTEDY AAVWELILRR PFLGLGIGTF IPQQYFFLDN QLLGSVLETG VLGTVVLLGW LAVGLSVSRG VRRRARTARD RELGQTLVAS ILAGFAGFLT FDALGFAIFS GLLFLLVGCA GALWRMTASP ETLTPSPQAL AATGRS
|
| |