Gene Franean1_6542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6542 
Symbol 
ID5674857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7956832 
End bp7958334 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content68% 
IMG OID641245391 
Productundecaprenyl-phosphate galactose phosphotransferase 
Protein accessionYP_001510785 
Protein GI158318277 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.13322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACAG ATGCCGTGCC GACTGTCCTG GGGGGAGATG CGCCCGTCCC GGCCATCCCT 
CGGCAGCGGT GTCCGGACGA ATGGCCGGCG GCCGGCAGCC AGCGGTGGAA GGCACGGCTG
TGCACGCAGC TCGTGGTGAT CGACACTTTG GCGATCGTCG TCGCGATCGT CTGCGCCTAC
CTCCTCCGCT TCGGCATCGA AGCGGAGGCG ACCGCTCGGG GGATGTGGTA CCTGTCCATC
GGCACCTGCA TCGCATTGGC GTGGCTTGTC ATGCTGGGTG CCACCGACGC CTACGCGACG
AGATACCTCG GCGTGGGGAC CGAGGAGTAC CGGCGCATCA GCGTCGGGAC CTTCCGGCTG
TGGGGCGCCA CGACAATCGG GTGCTACGTC CTGCGGGCGG AGGTCGCGCG GGGCTTCTGC
CTGATCGCCC TGCCCCTCGG ACTCCTGCTA CTGCTGACCG GCCGGGCGCT CGCCCGCCTC
CGGTTGGTCT CCGTTCGGCG GACCGGGCAT GCCCGCCACC GGGTCGTGGT GGTCGGCGAC
TCTCGTTCGG TCTTGGAACT CGTCGGTGAG TTCCATCATG AACCCGCGGC CGGATTCGAC
GTCATCGGCG TGTGTGTGCC GAAGGGGGCC CGACGCCCCG ACGGCAACGT CGGTGCTCCC
GTGCTCGGAT CATTGGAACA GGTGAGCTCC GTCGTCGCCG CTACCGGCGC CGACACGGTT
GTGGTCACCT CCTCCGCATC GCTCGACATC GAGACAGCGA AGGGTATTGC CTGGGAGCTG
GAGGGCACCG GGGTCGACCT GGTCGTGGCC CCGCCCCTGG GCGGGATTGC CGGGCCGCGC
GTGTCACTGC GGCCGGTCGC CGGCCTCCCT CTCCTCCACG TGGAGGAGCC GGTGTTCACC
GGCTGGCGAA AGTTCGCCAA GAACATGCTG GACCGGGCCC TCGCCGCCGT CGCTCTCGTC
GTTCTGTGCC CGTTGCTACT CGCCGTCGTG CTGCTCATCC GTTTCGACAG CAGCGGACCC
GCGCTGTTCC GGCAGACACG AACGGGTAAG GACGGACGCG ACTTCGAGAT CCTGAAGTTC
CGCACCATGT ATGTGGACGC CGAGCAGCGC CGGGCGGTAC TGGAGAATCA CAACGAGGCC
GACGGGCTCC TTTTCAAGAT TCGTGACGAC CCCCGGGTGA CCCGGGTCGG GCGGACGCTC
CGTCGGCTGT CCGTCGACGA ACTGCCCCAG TTCGTCAACG TCCTGCGGGG GGAGATGTCG
CTGGTAGGTC CGCGGCCCCC GCTGCCGTCG GAAGTCGCTC GGTACAGCGG ACCCGTACAC
CGCCGATTGA AGGTCAAGCC CGGCCTCACC GGCCTGTGGC AGGTGAGCGG ACGCTCGGAG
CTGCCGTGGC GGGATGCCGT CCGGCTCGAC CTCTACTACG TGGAGAACTG GTCCATCATG
TTCGATCTGA TCATCATCCT GCGGACCGTC CGGGCGGTAC TCGGTAGGTC GGGAGCGTTC
TGA
 
Protein sequence
METDAVPTVL GGDAPVPAIP RQRCPDEWPA AGSQRWKARL CTQLVVIDTL AIVVAIVCAY 
LLRFGIEAEA TARGMWYLSI GTCIALAWLV MLGATDAYAT RYLGVGTEEY RRISVGTFRL
WGATTIGCYV LRAEVARGFC LIALPLGLLL LLTGRALARL RLVSVRRTGH ARHRVVVVGD
SRSVLELVGE FHHEPAAGFD VIGVCVPKGA RRPDGNVGAP VLGSLEQVSS VVAATGADTV
VVTSSASLDI ETAKGIAWEL EGTGVDLVVA PPLGGIAGPR VSLRPVAGLP LLHVEEPVFT
GWRKFAKNML DRALAAVALV VLCPLLLAVV LLIRFDSSGP ALFRQTRTGK DGRDFEILKF
RTMYVDAEQR RAVLENHNEA DGLLFKIRDD PRVTRVGRTL RRLSVDELPQ FVNVLRGEMS
LVGPRPPLPS EVARYSGPVH RRLKVKPGLT GLWQVSGRSE LPWRDAVRLD LYYVENWSIM
FDLIIILRTV RAVLGRSGAF