Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2055 |
Symbol | |
ID | 5670456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2475404 |
End bp | 2476444 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641240977 |
Product | putative glycosyltransferase |
Protein accession | YP_001506398 |
Protein GI | 158313890 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.155199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.594586 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAACC TTCTGGCCGT TGTTGTCTCC TATGGTGGAT CACCCCAGCT CCACCGTCTT CTGGCGACCC TGGCCGGGGT TACCGGCTGC CGGGTCGCGC TGGTCGAGAA CCGGCCCGGC ACGGTGCACG AAGACCTGCC CGCGGGCGTT GAGCTGTACG AGGGGCACGG GAATGTCGGA TATGGCACGG CTGTGAACAT CGCCGTGCTC CGTGCGCTGC GGCCCGGCCC GGCGCAGGAG CCGCGGCCCG CGTCCCCGCC CGAGTGGCTG CTCGTCGTGA ACAGCGACGT GACCATGCCT GAGCGGACCA GGGACCTGAT TCCCGATATC CTCGGTTCGA TACCCGCCGA CGCGGACGTG GTGGGTTTCC CGGTGCGGGC CGAGACCGGC CTCTCCGGCC GGGGCACGGC GGTGCTTCCG AGTATCCGTA CGAATGCGTA CACGGCGGTG CGCGGTGAGG CCGCCGCGGT GGAACGCTGG CCCGAGCTCC GCTATCCCGT CGGGGCGTTC TTCGCGATCC GTACCCAGGC GTTCCTCCGG ATGGGCGGTT TCGACCCGTC CTACTGGATG TACTACGAGG AAACGGACCT TTTCGCCCGG CTGCACGCGT CCGGTGGCCG CATCGCGTGG ATCGACGACT GCTGCCATGT CACCCACGTC GGCGGCGGGA CCGTCGGCCG CGCCGGCCTG ATGTACGCCG AACTCGGCCG TTCGGCCGCC ATCTACGCGC GGCGCCACGG CGACACGCTC GGGCGGGGCT GGCTCGCCGT CCACGTGGCG CAGCTGGCGG CGCTGGCGCT GCGCAAGCTC GCGACCGGCC GGACCCACGA CGCGCTGCGC GCGTCGAGGA TCCTGTCCGG CGTCGCGACG GGGCTGGTCC AGCCTCGCTG GGAGCCCGCC ACCCGGTCCC GCTGGCGGGC CGTTCCGGTC GCCACCCGCC GTGACCTGGG CCGGATGGAC ACCGCCGCGC CCGGGCAGGT GCCGGCGCCC CGGGCGTCCC GCGCGGACCG GTCGCCACAG GTGACCGCCG GGTACCGCTG A
|
Protein sequence | MHNLLAVVVS YGGSPQLHRL LATLAGVTGC RVALVENRPG TVHEDLPAGV ELYEGHGNVG YGTAVNIAVL RALRPGPAQE PRPASPPEWL LVVNSDVTMP ERTRDLIPDI LGSIPADADV VGFPVRAETG LSGRGTAVLP SIRTNAYTAV RGEAAAVERW PELRYPVGAF FAIRTQAFLR MGGFDPSYWM YYEETDLFAR LHASGGRIAW IDDCCHVTHV GGGTVGRAGL MYAELGRSAA IYARRHGDTL GRGWLAVHVA QLAALALRKL ATGRTHDALR ASRILSGVAT GLVQPRWEPA TRSRWRAVPV ATRRDLGRMD TAAPGQVPAP RASRADRSPQ VTAGYR
|
| |