Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5172 |
Symbol | |
ID | 5673506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6205118 |
End bp | 6206626 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641244026 |
Product | undecaprenyl-phosphate galactose phosphotransferase |
Protein accession | YP_001509436 |
Protein GI | 158316928 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00967107 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGCTG GCACGCAGTT CGACTTCGCC CACGTCGAGA CCTCGCCCGA CGCGACCAGC TACCGGGCCC AGGTCGGCTG GGAGCGGCGC TACGTCCGGC TGCTGGTGCT CTTCGACGCG ATCGCCTGTG TGATCTCCGC GGGCCTGGCC TACTTCGTCC GCTTCGGGGA CGTCGTCGAC TTCGACACCG AGCCGCCCTC CTCGAAGCCG TACATCATCA TGACGGTCCT GCTGCCGCTG GCCTGGGTGC TGGCGATGTC GCTCAACCGC GCCTACGAGA GCCGCTTCCT CGGCGGCGGG TCGGAGGAGT TCCGGCGGGT CGTCAACGCC GCCGCCCGGC TCACCGCGCT GGTCGCCGTC GCCTCCTACG CGACGAAGGC CGAGATCGCC CGTAGCTACG TGCTCATCGC CTTCCCGGCC GCCACGCTGC TGTCGGTGGC CGGCCGCTCC GCCGGGCGCG GCATCCTGCA CCGCATGCGG CGGCAGGGGC GTTGCCTGCA CCGTGTCCTC GTCGTGGGGG CCGGCGAGTC CGCGGCCACC CTCGTCCGGC TCGCCCAGCG CGACCCGACC ACCGGCTGGT CCGTCGTCGG TGTCTGCCTG GACCGCCTGC CCGGCCGGCA CAGCCACGAC CGCCCCGAGC GCAGCGGGTT CGACCTGCTC GGCGTGCCGA TCGTCGGCAC CTCGGAGAAC CTGCACACCG CCATCCAGGC GACCCACGCG ACCACCGTCG CGATCGGCCC GCAGATGGAC GGCGAGACAC TGCGCCGGGT CCTGTGGGCC CTCGAGGGCA GCGACGTGGA CGTCCTGGTC AGCTCGGCGC TGACGGACGT GACCGGGCCG CGGATCTCGA TCCGGCCGGT GGCCGGCCTG CCGCTGCTCC ACATCGAGGA GCCCGAGCTC AGCGGCACGC GCCGGCTGAT GAAGATGGCC TTCGACCGGA TCGTCGCCGG CACCGCGATC CTGCTGTTCG CTCCGCTGCT GATCGGGCTC GGCCTGGCGG TGCGGTTCAC CAGCCGCGGT CCGGCGATCT TCAAACAGAT CCGGGTCGGG CGCGGCGGCA GCGAGTTCCG GATGTACAAG TTCCGTTCGA TGTATGTAGA CGCCGAGCAG CGCAAGGCCG AGCTCGAGTC GAGCAACGAG CGCGCCGAGG GCCTGCTGTT CAAGATGCGG GACGACCCTC GGATCACCAA GGTGGGCAAG TTCCTCCGGA AGTGGTCGCT CGACGAGCTG CCCCAGCTGT TCAACGTGGT CAACGGCAGC ATGTCGCTGG TCGGCCCGCG CCCGCCGCTG CCCTCGGAGG TCGCGCGCTA CGAGGACGAC GTCTACCGCA GGCTGATGGT CAAGCCCGGC CTCACCGGGC TGTGGCAGAT CAGCGGGCGC AGCGACCTGG AGTGGAACGA GTCCGTCCGG CTCGACCTGC GCTATGTCGA GAACTGGTCG CTGGCCATGG ACTTCGTCAT CCTGTGGCGC ACGCTGTTCG CCGTGCTGCG CCGCGAGGGC GCCTACTGA
|
Protein sequence | MPAGTQFDFA HVETSPDATS YRAQVGWERR YVRLLVLFDA IACVISAGLA YFVRFGDVVD FDTEPPSSKP YIIMTVLLPL AWVLAMSLNR AYESRFLGGG SEEFRRVVNA AARLTALVAV ASYATKAEIA RSYVLIAFPA ATLLSVAGRS AGRGILHRMR RQGRCLHRVL VVGAGESAAT LVRLAQRDPT TGWSVVGVCL DRLPGRHSHD RPERSGFDLL GVPIVGTSEN LHTAIQATHA TTVAIGPQMD GETLRRVLWA LEGSDVDVLV SSALTDVTGP RISIRPVAGL PLLHIEEPEL SGTRRLMKMA FDRIVAGTAI LLFAPLLIGL GLAVRFTSRG PAIFKQIRVG RGGSEFRMYK FRSMYVDAEQ RKAELESSNE RAEGLLFKMR DDPRITKVGK FLRKWSLDEL PQLFNVVNGS MSLVGPRPPL PSEVARYEDD VYRRLMVKPG LTGLWQISGR SDLEWNESVR LDLRYVENWS LAMDFVILWR TLFAVLRREG AY
|
| |