Gene Franean1_5172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5172 
Symbol 
ID5673506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6205118 
End bp6206626 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content71% 
IMG OID641244026 
Productundecaprenyl-phosphate galactose phosphotransferase 
Protein accessionYP_001509436 
Protein GI158316928 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00967107 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCTG GCACGCAGTT CGACTTCGCC CACGTCGAGA CCTCGCCCGA CGCGACCAGC 
TACCGGGCCC AGGTCGGCTG GGAGCGGCGC TACGTCCGGC TGCTGGTGCT CTTCGACGCG
ATCGCCTGTG TGATCTCCGC GGGCCTGGCC TACTTCGTCC GCTTCGGGGA CGTCGTCGAC
TTCGACACCG AGCCGCCCTC CTCGAAGCCG TACATCATCA TGACGGTCCT GCTGCCGCTG
GCCTGGGTGC TGGCGATGTC GCTCAACCGC GCCTACGAGA GCCGCTTCCT CGGCGGCGGG
TCGGAGGAGT TCCGGCGGGT CGTCAACGCC GCCGCCCGGC TCACCGCGCT GGTCGCCGTC
GCCTCCTACG CGACGAAGGC CGAGATCGCC CGTAGCTACG TGCTCATCGC CTTCCCGGCC
GCCACGCTGC TGTCGGTGGC CGGCCGCTCC GCCGGGCGCG GCATCCTGCA CCGCATGCGG
CGGCAGGGGC GTTGCCTGCA CCGTGTCCTC GTCGTGGGGG CCGGCGAGTC CGCGGCCACC
CTCGTCCGGC TCGCCCAGCG CGACCCGACC ACCGGCTGGT CCGTCGTCGG TGTCTGCCTG
GACCGCCTGC CCGGCCGGCA CAGCCACGAC CGCCCCGAGC GCAGCGGGTT CGACCTGCTC
GGCGTGCCGA TCGTCGGCAC CTCGGAGAAC CTGCACACCG CCATCCAGGC GACCCACGCG
ACCACCGTCG CGATCGGCCC GCAGATGGAC GGCGAGACAC TGCGCCGGGT CCTGTGGGCC
CTCGAGGGCA GCGACGTGGA CGTCCTGGTC AGCTCGGCGC TGACGGACGT GACCGGGCCG
CGGATCTCGA TCCGGCCGGT GGCCGGCCTG CCGCTGCTCC ACATCGAGGA GCCCGAGCTC
AGCGGCACGC GCCGGCTGAT GAAGATGGCC TTCGACCGGA TCGTCGCCGG CACCGCGATC
CTGCTGTTCG CTCCGCTGCT GATCGGGCTC GGCCTGGCGG TGCGGTTCAC CAGCCGCGGT
CCGGCGATCT TCAAACAGAT CCGGGTCGGG CGCGGCGGCA GCGAGTTCCG GATGTACAAG
TTCCGTTCGA TGTATGTAGA CGCCGAGCAG CGCAAGGCCG AGCTCGAGTC GAGCAACGAG
CGCGCCGAGG GCCTGCTGTT CAAGATGCGG GACGACCCTC GGATCACCAA GGTGGGCAAG
TTCCTCCGGA AGTGGTCGCT CGACGAGCTG CCCCAGCTGT TCAACGTGGT CAACGGCAGC
ATGTCGCTGG TCGGCCCGCG CCCGCCGCTG CCCTCGGAGG TCGCGCGCTA CGAGGACGAC
GTCTACCGCA GGCTGATGGT CAAGCCCGGC CTCACCGGGC TGTGGCAGAT CAGCGGGCGC
AGCGACCTGG AGTGGAACGA GTCCGTCCGG CTCGACCTGC GCTATGTCGA GAACTGGTCG
CTGGCCATGG ACTTCGTCAT CCTGTGGCGC ACGCTGTTCG CCGTGCTGCG CCGCGAGGGC
GCCTACTGA
 
Protein sequence
MPAGTQFDFA HVETSPDATS YRAQVGWERR YVRLLVLFDA IACVISAGLA YFVRFGDVVD 
FDTEPPSSKP YIIMTVLLPL AWVLAMSLNR AYESRFLGGG SEEFRRVVNA AARLTALVAV
ASYATKAEIA RSYVLIAFPA ATLLSVAGRS AGRGILHRMR RQGRCLHRVL VVGAGESAAT
LVRLAQRDPT TGWSVVGVCL DRLPGRHSHD RPERSGFDLL GVPIVGTSEN LHTAIQATHA
TTVAIGPQMD GETLRRVLWA LEGSDVDVLV SSALTDVTGP RISIRPVAGL PLLHIEEPEL
SGTRRLMKMA FDRIVAGTAI LLFAPLLIGL GLAVRFTSRG PAIFKQIRVG RGGSEFRMYK
FRSMYVDAEQ RKAELESSNE RAEGLLFKMR DDPRITKVGK FLRKWSLDEL PQLFNVVNGS
MSLVGPRPPL PSEVARYEDD VYRRLMVKPG LTGLWQISGR SDLEWNESVR LDLRYVENWS
LAMDFVILWR TLFAVLRREG AY