Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5300 |
Symbol | |
ID | 5673634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6376412 |
End bp | 6378109 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641244157 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001509564 |
Protein GI | 158317056 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.225407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.649548 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACAG CCGCGCCCCG GCCCGTTCAC GCCCTCACCG GGCGTCACCT GGTCTTCCTC AACTGGCGCG ACAACGCCCA TCCGCAGGCC GGCGGGGCGG AGCTGTTCTG CCACTCGGTC GCGGAGCGGT TCGCCGCTGC CGGCGTCCGC GTCACCCTGC TCACCTCCCG CCCGCCGGGC GCCGCCGCCG CGACCACGGA CGGCGGGGTC GCCGTGCGCC GCGGCGGGGG CACCTTCGGG GTGTACCCGT CGGTGCTCGC CCGGCTGGCG CGGATGGTCC GCTCCGGGGA GCGGGTCGAC GCCGTCGTCG ACTGCCAGAA CGGCATCCCG TTCTTCAGCC CGCTCGTGCT GCCGAGCCGG ATTCCGGTGG TGCAGGTGCT GCACCACGTC CACCAGAAGC AGTTCCCGCT GTACTTCCCG CGGCCGGTGG CGCGGATCGG CCAGCTACTC GAGACCCCGG GCAGCCGGTG GGTCTACGGC CGCCGGCCGG TGGCCGTGGT CTCGCCGTCC ACCAGGGACG AGGCCCGTGG GGTGCTGGCG CTGCCCGGGG CCCGGTTCCT CGTCCCGAAC GGCGTCACCA TCGCCGGCGG TGATGGTGAC GCCGGCGGCG ATGCCGTCGC CTCCGGCGGC GCGGGCGGGA CGGACGGCCC GTTCGGTGCC GATGGCGCCA TCGGGGCGAG GGCGGCGGCG CCCACGATCG TGTGTGTGGG CCGGCTCGTC CCGCACAAGC GCCTGCACCT GCTGATCGAG GCGCTGCCCG TGCTGGTCGG GCGGCACCCG GGCCTCAGCC TGCACCTCGT CGGCGACGGG CCGGACCGCC GCCGCCTCGC CGACACCGCC GCCCGGTTGA TGCTCACACA GGGCGACGGC TCGTCGGACG CGACCGTGCG CTGGCACGGA TTCGCGGCTC CCGAGGTCCG CGACGCCGTG CTGGCGTCGG CCTGGCTGAC GGTGAACCCC TCCCACGGCG AAGGATGGGG CCTGTCGGTA CTCGAGGCGA ACGGGATGGG GGTACCGGCG GTCGCGTTCC GGGTCCCGGG ACTGCGCGAC TCCGTCCGCG ACGGGGTGAC CGGCTGGCTG GTGGACGAGC CCGGGCAGCT CTCCGACGCG GTCGACCGCG CGTTGACCCT GCTCGCCGAC CCGGCGCGGG CCGGGGAGAT CCGCGCGGCC GCGCGGGCGT GGGCCGGCGG CTTCAGCTGG GACACCAGCG CCGATCTGCT GGCCGCCGTC ATCGGCTCCG AGCTCGACCG GCTCGCCGGG GCCGGCGCGG GCGGCCGCTC GGCCGTGGGC CCGGGCGAGC CTCCGGCCCG GCACGCGCCA GCACGCCCAC TCGCCACCCG CCCGCCGCGG GACCGGCGGC GCCGCGACGA CCAGGCGACC TGGGTCGAGT TCGATCTGGC GCCGGGTGCG GAGGTTCCCG TGCTGCGCCG GACCGATCTG GTCTTCGAGG TCGCCCCGGA ATCCGGTGCG GCCGGCCCGG AGCCGGGGCT GCCGGGCCCG GGGCGCCGGT TCGTCGCGCT GTTCTACGGG GCCGACAGCA CCGGGGCCCG CACCGCGCTG GCCCGTCGCG GGCTGCGGCC GGCGCAGCGG CCCCGCGCGG CCACCGGCGA GGACCTGCTC CTCGCCGCCA CGCACGCCGG CCCGAACCAC GCCAGCGCCA GCGCCAGCGG CGTACGGCTA CGGGACATGG CCGGCTGA
|
Protein sequence | MTTAAPRPVH ALTGRHLVFL NWRDNAHPQA GGAELFCHSV AERFAAAGVR VTLLTSRPPG AAAATTDGGV AVRRGGGTFG VYPSVLARLA RMVRSGERVD AVVDCQNGIP FFSPLVLPSR IPVVQVLHHV HQKQFPLYFP RPVARIGQLL ETPGSRWVYG RRPVAVVSPS TRDEARGVLA LPGARFLVPN GVTIAGGDGD AGGDAVASGG AGGTDGPFGA DGAIGARAAA PTIVCVGRLV PHKRLHLLIE ALPVLVGRHP GLSLHLVGDG PDRRRLADTA ARLMLTQGDG SSDATVRWHG FAAPEVRDAV LASAWLTVNP SHGEGWGLSV LEANGMGVPA VAFRVPGLRD SVRDGVTGWL VDEPGQLSDA VDRALTLLAD PARAGEIRAA ARAWAGGFSW DTSADLLAAV IGSELDRLAG AGAGGRSAVG PGEPPARHAP ARPLATRPPR DRRRRDDQAT WVEFDLAPGA EVPVLRRTDL VFEVAPESGA AGPEPGLPGP GRRFVALFYG ADSTGARTAL ARRGLRPAQR PRAATGEDLL LAATHAGPNH ASASASGVRL RDMAG
|
| |