Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5500 |
Symbol | |
ID | 5673831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6659634 |
End bp | 6661001 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641244355 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001509761 |
Protein GI | 158317253 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0983279 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCACGC TGCCGGAGGG CTCACTGCGC CCCACGACCA TTCCGCCGCT GCCGGCACCG CCGTCCGCCG CCGGGCAGGT GCCCCTCGTC GGCACCGGGA TGCCCGGGCT GACCGAGCAG TTCGTCCCGT CGGCGACGTC CGAGCGGCTG CTGGCCGGCC TCGCCGACGC GCTGCCCGCG CAGCCCACAC TGGCGGAGCT CGTCGAGACC TCGGGGATGC GCCGCATCCA CATGCTGGCC TGGCGGGATC TGGACGACCC CGAAGCGGGT GGCTCCGAAC TGCACGCCGA CAAGGTCGCC GAGCGGTGGG CCGCCGCCGG CGTCGACGTC AGCCTGCGCA CCGCCGAAGC ACCCGGCCAC CCCGAGACCA CGCGGCGCAA CGGCTACCAG ATCGTCCGCA AGGCCGGCCG GTACTCGGTG TTCCCGCGGA CGGCGACGTC CGGCGCGCTG GGCCGCACCG GCCCGTGGGA CGGCCTGGTC GAGATCTGGA ACGGGATGCC GTTCTTCTCC CCCGTCTGGG CGCGCTGCCC GCGGGTGGTG TTCCTGCACC ACGTCCACGG CGCGATGTGG CGGATGGTGC TCTCCCCCAA GCTGGCCCAG GTCGGCGAGA CCATCGAGTT CAAGGTGGCG CCGCCGCTGT ACCGGCGCAC CCGCATCCTC ACCCTCTCCC AGTCGTCCCG GGACGAGATC ATCGAGCTGC TCGGCCTGCC CGCGGGGAAC ATCTCGGTGA TTCCCCCGGG CATCGACTCC TCGTTCAGCC CCGCCGGGGA GCGCTCCGCA CGCCCGCTGG TGCTCGCCGT CGGCCGGCTG GTGCCGGTGA AGCGGTTCGA CGTGCTGATC GACTCGCTGA TCCGGGCGCA CGACGAGCAC CCCGCGATGG AGGCCGTGAT CGTCGGCGAG GGCTACGAGC GCCCGGCGCT CGAGGCGCGC ATCGCCGCGG CGGGCGCGGG CGACTGGCTG CGGCTGGTCG GCCGGGTGGA CGACGCGGGT CTTCTCGACC TCTACCGGCG TGCCTGGGTG CTCACCTCGG CCTCCGCCAG AGAGGGTTGG GGCATGACGA TCACCGAGGC GGCCGCCTGC GGGACGCCGT CCGTCGCGAC GAAGATCGCC GGGCACACCG ACGCCGTCGC GGACGGCGTG TCCGGCCTGC TGGTCGAGGA CCCGAACGAC CTGGGCAAGA CCCTGGCCGG CGTGCTGTCC GACCCCGAGC TGCGGGCCCG GCTCTCCGCC GGCGCGCTCG CGCACGCGGC GACGTTCACC TGGGAGCACA CCGCCCGCTC GACCTACCTC GCGCTGGTCA ACGAGGCCGC CCGCCGCCGG CTCGTCCGCC GCCCCGCCCC GAGCTCGCGC TCGGGCGCGC CCCGGTGA
|
Protein sequence | MSTLPEGSLR PTTIPPLPAP PSAAGQVPLV GTGMPGLTEQ FVPSATSERL LAGLADALPA QPTLAELVET SGMRRIHMLA WRDLDDPEAG GSELHADKVA ERWAAAGVDV SLRTAEAPGH PETTRRNGYQ IVRKAGRYSV FPRTATSGAL GRTGPWDGLV EIWNGMPFFS PVWARCPRVV FLHHVHGAMW RMVLSPKLAQ VGETIEFKVA PPLYRRTRIL TLSQSSRDEI IELLGLPAGN ISVIPPGIDS SFSPAGERSA RPLVLAVGRL VPVKRFDVLI DSLIRAHDEH PAMEAVIVGE GYERPALEAR IAAAGAGDWL RLVGRVDDAG LLDLYRRAWV LTSASAREGW GMTITEAAAC GTPSVATKIA GHTDAVADGV SGLLVEDPND LGKTLAGVLS DPELRARLSA GALAHAATFT WEHTARSTYL ALVNEAARRR LVRRPAPSSR SGAPR
|
| |