Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0751 |
Symbol | |
ID | 5669167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 876266 |
End bp | 877420 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641239678 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001505115 |
Protein GI | 158312607 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00694641 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000111123 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCGCCG CACCGCGACA CGGTGTCGGT GTTCGTGCCG ACACGGCTGC TCGTGCCGAC ACGGCCGTTC GTGCCGACAC GGCCGGCACG AGTCGTGAGC CACGGCCGGT CGTGGATGTG CTCCTGAGCC GGGGCCTCGA CCCCTATGAC TGGGAACGTC GACACGCCCG GGGCGAGGTT CCGGACGTCT GGCCGTACGG GCTGAACAGG CTTGCCGACC ATGGCTTTCG GACCAGCGGG TCGCTGCCGG CCGTGAGCCA CCGGGCGCCG GGCGCGAGGG CCACCCGTGC GCTGGGGGGC TTCGAGTGGC GGGAGGTCGC CGCGCGTCGG ATCGACGCGG CGGCGGAGGC GGTGCTGTGC TGGGACGAAC GGGCCGGGGT ACCGGCTGCC TCGATGGAGC GGTGGCGGGG CGGCAGACCG GTCGCCACCG GCGTGATCTG GCTGACCGAC TCGGTGGGCC GGCCGCGGGG GGCACTGGCG TTGGCGCCCC GCGCGCTGCG TTCGGCCCAG TTGGTCTGGG CGCTGTCGAG TGCCCAGCTC CCGGTGCTCC GGGACGTCCT GAAGGTCGCC GAGCGCAGAC TCGCCCATCT GCCCTTCGGT ATCGACGCGG ACTTCTTCCG GCCGGCCGGA ACCGATCCGG TGCCCGGTCT GGTGGTCAGC GTCGGAAATG ACCGGCATCG GGACCATGAT ACGCTTCTCG CTGCCATCGC GGACGCGGCT GGAAAAGTCA CTGGGCTGCG CCTGGAACTG GTGACCGGTC GAGAGGTGAC GATTCCCGTA AGGCTCGGGA AGTGGCATCC GAGAATGAGC CACGTCGATC TTGTTGGTCT CTATGCCCGC GCCTCTGTGG TGGTGGTTGC GCTGCGGCCG AACCTACATG TCAGCGGCGT CAGTGTCGTT CTCGAGGCCA TGGCCTCGGG CCGCCCGGTT GTGGTTACCG AAACCCCGGG AATGTCCGAC TACGTCGATC ACGGGCGTAC AGGGCTGCTT GTTCCGCCGG GTGATTCCGG CGTCCTTGCC GCGGAGCTGG CGGGGCTCCT GCTTGACCCC GACCGGGCCG CGGCAATGGG GCGGGCCGGC CGCCAGGCGG TCGAGACGAC GTTCAACACC GGCGCGCAGG CGGGCCGGTT GGCCGGTCTG TTGCGAGGGA TGTGA
|
Protein sequence | MTAAPRHGVG VRADTAARAD TAVRADTAGT SREPRPVVDV LLSRGLDPYD WERRHARGEV PDVWPYGLNR LADHGFRTSG SLPAVSHRAP GARATRALGG FEWREVAARR IDAAAEAVLC WDERAGVPAA SMERWRGGRP VATGVIWLTD SVGRPRGALA LAPRALRSAQ LVWALSSAQL PVLRDVLKVA ERRLAHLPFG IDADFFRPAG TDPVPGLVVS VGNDRHRDHD TLLAAIADAA GKVTGLRLEL VTGREVTIPV RLGKWHPRMS HVDLVGLYAR ASVVVVALRP NLHVSGVSVV LEAMASGRPV VVTETPGMSD YVDHGRTGLL VPPGDSGVLA AELAGLLLDP DRAAAMGRAG RQAVETTFNT GAQAGRLAGL LRGM
|
| |