Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0542 |
Symbol | |
ID | 5668959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 628130 |
End bp | 629374 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239469 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001504907 |
Protein GI | 158312399 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCCTGC GTTATGGCTT CCTCAGCACC TATCCGCCCA CCCAGTGCGG TCTCGCGACC TTCACGGCCG CCCTGTTCGA CGAGCTGAAC AGTCCAGCGC CCGGTGCGTC CAGTGGGGTG GTGCGCCTCC TGGACGCCAC CGACCGAGCC GGTGTCGTGC GAGCTGGTGC CACGCGAGCC GGTGCCACGG CCGAGCCGGC CGAGCCGGTG GCCAGACCAC CGGGTCGATC CGGTGGGGCC GGCCGGCCGG TTCTGGTCGG AGATCTCGTC GCCGGCACGC CGGGCGGTCC GCGTGCGGCG GCACGGCTGC TGAACGGCTT TGACGTCGTC GTGGTGCAGC ACGAGTACGG CGTGTATGGA GGTCCGGACG GCGACGAGGT GCTCGCGGTG CTCGACGCCC TCGACGTTCC GGTGATCGTC GTCCTGCACA CCGTGCTGGT GAAGCCGACC TCGCACCAGC GGCATGTCCT GGAGTCCGTG GTCGCGTCGG CGGACGCGGT GGTCGTGATG ACCGAGACCG CGCGGATCCG GCTCGTCGAG GGTTTCCAGG TCCATCCACG ACGGGTGGTG GTCATCCCGC ACGGCGCGGC GGACAACCGC CGGGCGCCAG CGGAGCACAG CGGCGGGCCG ACCATCCTCA CCTGGGGGCT GATCGGGCCC GGGAAGGGCA TCGAGTGGGG CATCGCCGCG ATGGCGGACC TTGCCGACCT CGATCCCGCC CCGCACTACG TCATCGCGGG CCAGACCCAT CCGAAGGTCC TCGCGAGGGA GGGCGAGGCC TACCGGGAGG GGCTGGCCGC CCGGGTCCGC GACCTCGGCC TGACCGGCTC GGTCAGCTTC GACGATCGTT ACCTGGACCC GGTGTCCCTC ACGGAGCTCG TGCGCCAGGC CGACGTCGTC CTGTTGCCGT ACGACTCGGT CGACCAGGTG ACCTCCGGTG TCCTCATCGA GGCGGTCGCC GCGCTCCGGC CGATCGTCGC CACCCGGTTC CCGCACGCGG TCGAGCTCCT CGGCGACGGC AGCGGACTGC TCGTGCCGCA CCGGGACCCA GCGGCCATCG CCGCCGCGGT GCGCCGCATA ACGACAGACG AGACGGTGAG CGCCGGCCTG GCCAGCGCCG CGGCCGTCCA GGCCCCCGAC CTGCTGTGGC CGGCGGTCGC CGGCCGCTAC CGGCGGCTGG CCGCCGGACT GGTCGCCCGC ACCGGGAGCC GGCCGTCGAC CGCGCCCGTG CCGGTGGCCC GGTGA
|
Protein sequence | MPLRYGFLST YPPTQCGLAT FTAALFDELN SPAPGASSGV VRLLDATDRA GVVRAGATRA GATAEPAEPV ARPPGRSGGA GRPVLVGDLV AGTPGGPRAA ARLLNGFDVV VVQHEYGVYG GPDGDEVLAV LDALDVPVIV VLHTVLVKPT SHQRHVLESV VASADAVVVM TETARIRLVE GFQVHPRRVV VIPHGAADNR RAPAEHSGGP TILTWGLIGP GKGIEWGIAA MADLADLDPA PHYVIAGQTH PKVLAREGEA YREGLAARVR DLGLTGSVSF DDRYLDPVSL TELVRQADVV LLPYDSVDQV TSGVLIEAVA ALRPIVATRF PHAVELLGDG SGLLVPHRDP AAIAAAVRRI TTDETVSAGL ASAAAVQAPD LLWPAVAGRY RRLAAGLVAR TGSRPSTAPV PVAR
|
| |