Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0759 |
Symbol | |
ID | 5669175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 885942 |
End bp | 886964 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641239686 |
Product | glycosyl transferase family protein |
Protein accession | YP_001505123 |
Protein GI | 158312615 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0160185 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGTGC TGGTCGTGTC CTACAACACC GCCGAGCTGA CGGTCCGGTG CCTGGAGTCC GTCCTGGCTG AGCCGGCCGG GGCCGAGATT GAGATCATCG TCGTAGACAA CGCCTCGACG GACGGCTCCC CGGACGCCAT TCGGGACGCG TTCCCGGCGG TGCGGCTGAT CGAGTCGGCG ACGAACCTCG GATTCGGCCG CGCGATCAAC CTGGCGGCCT CCCACGCCGG CGGGGACTAT CTGCTCCTGC TGAACCCCGA CGCCGTCGTC CTCGATCGGG CGGTCGCCGA GATCCTCACC TTCGCCCGCG GGAACCCGGC CTGCGGGCTC TACGGCGGGC GGACGCTGCG GCCAGACGGA TCGGTCGACC CGAGCTCCTG CTGGGGGGCA CCCAGCCTGT GGAGCCTGGC GTGCTTCGGG GTCGGCCTGT CGACGATGTT CCGCGGATCG AGGATCTTCG ATCCGGAGTC GCTCGGGCGC TGGCAGCGCG ACAGCGTCCG CGAGGTCGGT GTCGTCACCG GATGCCTGCT GCTGGTCAGA CGCGACCTCT TCGAACGGCT GGGCGGGTTC GACCCCCGGT TCTTCATGTA CGGGGAGGAC ACCGACCTGT CGATGCGGGC GCGGGCCGCC GGCTACCGCC CCACGATCGT CCCGACCGCG GCGATCGTGC ACCACGTCGG GGCCTCGTCC TCGAACTGGG CGGCCAAGCA CGTGCTCGTC CTCCGCGGGA AGACCACGTT GGCGAGGAAG CACTGGACGG GGTGGCGGCA GGGGTTCTGC CTGGCGATGA TCGTTCTTGG GGTGGCGCTG CGGGCCCTGG CCGATCTGGG GTCGGGGCTG GCCCGTGGGC GGCCGCGGGC GCGTGCGAGC GACTGGCGCG CGCTCTGGCG CCGGCGGCGG GACTGGTGGC CCGGCTACCC CCCGTACACC GAGCCCACGG GCGGCGTCCC CAGCCAGTCG CGCGTCCCGC CCGAGCCCGC ATCCGGCACG TGTCACCCGT CCCGACATCT GACGAGGCCA TGA
|
Protein sequence | MSVLVVSYNT AELTVRCLES VLAEPAGAEI EIIVVDNAST DGSPDAIRDA FPAVRLIESA TNLGFGRAIN LAASHAGGDY LLLLNPDAVV LDRAVAEILT FARGNPACGL YGGRTLRPDG SVDPSSCWGA PSLWSLACFG VGLSTMFRGS RIFDPESLGR WQRDSVREVG VVTGCLLLVR RDLFERLGGF DPRFFMYGED TDLSMRARAA GYRPTIVPTA AIVHHVGASS SNWAAKHVLV LRGKTTLARK HWTGWRQGFC LAMIVLGVAL RALADLGSGL ARGRPRARAS DWRALWRRRR DWWPGYPPYT EPTGGVPSQS RVPPEPASGT CHPSRHLTRP
|
| |