Gene Francci3_1577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1577 
Symbol 
ID3903712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1891571 
End bp1893142 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content67% 
IMG OID637878914 
Productundecaprenyl-phosphate galactosephosphotransferase 
Protein accessionYP_480682 
Protein GI86740282 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.673615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.183885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGG CCGGGGACAC CGCACGACGA AGGCCCGAGG TGTCTGACTC GACCGGGGGG 
CGGGACGAGG ACCCCGCGGT GCTCTCGTCG GTGGCGGCCA CGCCCCGGGA CACGACCGTC
CCGGCTATTC CGCGACAACG GCAACCAGAG CCGCCGGCAG CCGGGCCGCC GGAGTGGAAG
GTCAGGCTGA CCGCCAGAAT CTTCATCATC GATGCCACGG CAATCACGGT CGCGCTTACC
TGTTCCTATC TCCTCCGGTT CGGCATGAAT GCGAACCCAA CCGTCCACGG GGCGTCATAT
CTTTCCGTCG CGATAGGTAT CGGACTGGGC TGGATCGCCA TGCTCGGCGC CGCCGACACC
TATCAGACGA AATATCTGGG CATCGGAACC GAGGAATACC GGCGGATCAG CGTCGCGACA
TTCCGACTGT GGGGCACGAC GGCGATCCTC TCCTACGTCC TGCGCGCAGA AGTCGCTCGC
GGGTTCTGTC TGGTGGTCCT GCCGCTGGGG CTACTGCTGC TGATTACCGG TCGGATGCTC
GCCCGCCGAC GGCTGGTGGC GGCTCGGCAG GCCGGCCGTG CCCGCCACCG GGTCGTCGTC
GTCGGCGACC GTAGCACGGT GGCCGAACTC GTCAGCGAGC TCCGATATGA ACCCGCAGCC
GGATTCGAGA TCGTCGGCGC ATGTCTACCC CGGCAGGACG ACTACTCGGC CGATTATTCT
CCCTTTCCCG TCCTCGGCGC CCTGCCGGCG CTGCGGTCCA CCGTCGCCCG CGCCATGGCC
GACACGGTGG TGGTCGCCTC GTCCGTGGCG GTCAACATCG AAGCCGCCAA GCGGATCGCC
TGGGACCTCG AGGGGACCGG TGTCGATCTT GTCATCGCGT CGAGCATGGC CGGAATCGCC
GGGCCGCGGG TATCCCTGCG GCCGATCGCC GGTCTCCCCC TGCTACACGT GGAGAGCCCG
GTCTATACCG GTTGGCGAAA GGTGGCCAAA GACATCTTCG ACCGGGTCCT TGCCGCCGTG
GCCCTCGTTA CCCTGTCCCC GCTGCTGCTT CTCGTCGCGT TGACCATCCA GGTTGACAGC
ACCGGACCGG CATGGTTTCG CCAGACCCGG GTGGGTAAGG ACGGCCGCGA GTTCCAGATC
CTCAAGTTCC GGACGATGTA CGTGGATGCC GAACGGCGCC GGGCAGCGCT GGAGGAGCGC
AACGAGGCCG ACGGACCACT TTTCAAAATT CGCGACGACC CCCGCGTCAC TCGGGTCGGA
CGAACGCTAC GGCACCTGTC GATCGACGAG CTGCCGCAAC TCGTCAACGT CCTCCGCGGC
GAGATGTCGC TGGTGGGGCC GCGGCCGCCG CTGCCGGCAG AGGTCGCTCA ATATCACGAC
TCCGTCCACC GTCGATTCAA GGTCAAGCCC GGCCTGACCG GACTGTGGCA GGTGAATGGG
CGTTCAGAAC TGCCCTGGCG GGACGGGGTG CGACTCGACC TCTACTACGT AGAGAATTGG
TCGATCATGC TCGACCTCGC CATCATCGCC CGGACTGTTA GCGCCGTGCT GCGGCGGTCC
GGCGCATTCT AG
 
Protein sequence
MAKAGDTARR RPEVSDSTGG RDEDPAVLSS VAATPRDTTV PAIPRQRQPE PPAAGPPEWK 
VRLTARIFII DATAITVALT CSYLLRFGMN ANPTVHGASY LSVAIGIGLG WIAMLGAADT
YQTKYLGIGT EEYRRISVAT FRLWGTTAIL SYVLRAEVAR GFCLVVLPLG LLLLITGRML
ARRRLVAARQ AGRARHRVVV VGDRSTVAEL VSELRYEPAA GFEIVGACLP RQDDYSADYS
PFPVLGALPA LRSTVARAMA DTVVVASSVA VNIEAAKRIA WDLEGTGVDL VIASSMAGIA
GPRVSLRPIA GLPLLHVESP VYTGWRKVAK DIFDRVLAAV ALVTLSPLLL LVALTIQVDS
TGPAWFRQTR VGKDGREFQI LKFRTMYVDA ERRRAALEER NEADGPLFKI RDDPRVTRVG
RTLRHLSIDE LPQLVNVLRG EMSLVGPRPP LPAEVAQYHD SVHRRFKVKP GLTGLWQVNG
RSELPWRDGV RLDLYYVENW SIMLDLAIIA RTVSAVLRRS GAF