Gene Francci3_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3854 
Symbol 
ID3905602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4617096 
End bp4618709 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content71% 
IMG OID637881180 
Productundecaprenyl-phosphate galactosephosphotransferase 
Protein accessionYP_482933 
Protein GI86742533 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.147783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGACA CTGTCTTCAG CCATGCACAG GTGGGGTTGG CCGCGGGGCC GGGCACGGAT 
GCCATCCCGA AACCGCTGGA CGGGGCCGAT GAGCCCGCCC GGTACGACGG GTACGAGGTG
GGCGCGGGCC CCGGCGCGGA CGGAACATCC GACGGTGTCG TCGACTACCG GGCGAAGGTT
CGCTGGGAAC GGCCGTACGG CTGGCTGCTC GTGGGTTCCG ACGCGGCCGC CTGCGTGGTC
GCGGCGGGAG TGGCCTACCT GGTCCGGTTC GGCGGACTGG TTGAGTTCGA CCTGAAGCCG
GTCTCCGCGA CGCCGTATGT CGCCATGTCG ATCCTGCTGC CCGTCGCGTG GGTGTTGTCG
ATGTACCTGA GCCGGGCGTA CGAGAGCCGG TTCCTCGGCA GCGGCTCGGA GGAGTTCCGC
CGGGTCCTCA ACGCCGCCGC GCAGCTCACC GCGGTGGTGG CCGTCCTGTC CTATGCGACC
AAGGCCGAGC TCGCGCGTGC CTACGTGCTG ATCGCCTTCC CGACGGCGCT GCTGCTGACC
GTGGCCGGCC GCGCCACCGC GCGCGGCTAC CTGCACCGGA TGCGCCGGGC GGGCCGCTGC
CTGCACCGGG TGCTGGTGGT CGGGGCGGGG GACTCCGCGG CGGCGTTGGT CCGGCTGGCC
CAGCGGGATC CGACGTCGGG TTGGTCGGTC GTCGGTGTCG TGCTCGACCG CTCGCCCGGC
CGGCACAGTC ACGACTCCCC GGAACGCAGT GGGTTCGACC TGCTCGGGGT GCCGATCGTC
GGCACCTCGG AAACCCTGCA CACGGCCATC CGGGCGACGT ACGCCACCAC GGTTGCCATC
AGTCCGCAGA TGGACGGCGA GACGTTGCGC CGGGTGCTGT GGACGCTGGA GGGCAGCGAC
GTCGACGTGC TGGTCTCCTC GGCGCTGACC GACGTGACCG GGCCGCGGAT CTCGATTCGT
CCGGTGGCCG GGCTGCCGCT GCTGCACATC GAGGAGCCGG AGCTCACCGG CGCGCGGCGG
GCCATGAAGG GCCTGTTCGA CCGCAGTGTG GCCGCGGGGG TTCTCCTGCT GTGCGCGCCG
TTGTTCCTGG CCCTGGCGCT GGCGGTGCGC CTGACCAGTC GTGGGCCGGC CATTTTCAAG
CAGATCCGGG TCGGCCGGGG TGGTGAGCAC TTCACCATGT ACAAGTTCCG GTCGATGTAC
GTCGACGCGG AGGCGCGCAA GGCGGAGCTG GAGTCGCGCA ACGAGCGGGC CGAGGGGCTG
CTGTTCAAGA TGCGTGACGA CCCGCGGATC ACCAAGGTCG GGAAGTTCCT GCGCAAGTGG
TCGCTCGACG AGCTGCCGCA GCTGCTCAAC GTCGCGAACG GCACGATGTC GCTGGTGGGG
CCGCGTCCGC CGCTGCCGTC GGAGGTCGCC CGCTACGAGG ACGACGTGCA CCGCCGGCTG
ATGGTGAAGC CGGGGCTGAC CGGCCTGTGG CAGATCAGCG GCCGGTCGGA CCTCGAATGG
GACGAGTCGG TGCGGTTGGA TCTGCGGTAC GTGCAGAACT GGTCGCTGCC CCTCGACTTC
TACATCCTCT GGCGCACCGT GTTCGCGGTC CTGCGCCGCG AGGGTGCGTA CTGA
 
Protein sequence
MTDTVFSHAQ VGLAAGPGTD AIPKPLDGAD EPARYDGYEV GAGPGADGTS DGVVDYRAKV 
RWERPYGWLL VGSDAAACVV AAGVAYLVRF GGLVEFDLKP VSATPYVAMS ILLPVAWVLS
MYLSRAYESR FLGSGSEEFR RVLNAAAQLT AVVAVLSYAT KAELARAYVL IAFPTALLLT
VAGRATARGY LHRMRRAGRC LHRVLVVGAG DSAAALVRLA QRDPTSGWSV VGVVLDRSPG
RHSHDSPERS GFDLLGVPIV GTSETLHTAI RATYATTVAI SPQMDGETLR RVLWTLEGSD
VDVLVSSALT DVTGPRISIR PVAGLPLLHI EEPELTGARR AMKGLFDRSV AAGVLLLCAP
LFLALALAVR LTSRGPAIFK QIRVGRGGEH FTMYKFRSMY VDAEARKAEL ESRNERAEGL
LFKMRDDPRI TKVGKFLRKW SLDELPQLLN VANGTMSLVG PRPPLPSEVA RYEDDVHRRL
MVKPGLTGLW QISGRSDLEW DESVRLDLRY VQNWSLPLDF YILWRTVFAV LRREGAY