Gene Francci3_0930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0930 
Symbol 
ID3906094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1087983 
End bp1089173 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content73% 
IMG OID637878264 
Productglycosyl transferase family protein 
Protein accessionYP_480043 
Protein GI86739643 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.72237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCG TGCTGTTCGT GGTGCCGCCC CTGACCGGGC ATGTCAACCC GGCGGTCGGG 
GTCGCCGCCG AGCTGGCCGC CCGTGGTCAC GAGGTGGCGC TCGCCGGGTA CGCGGGCGTC
ATCGGATCGT TGATCCCGCC GGAGCTGGCC CTGCTGGCGT TACCCGAGGC GGGCCTGGGC
GAGAAGTGGT CCCGGATCCA GGACGCGTCT CGGGCGCTGC GGGGGCCGGC GTCGCTGAAG
TTCCTGTGGG AGGACTTCCT GCTCCCGCTC GGCTCGATGA TGGCCCCGGC GATCGACAGG
ATCATCGACG ACTTCCAGCC CGACCTGCTC GTCATCGACC AGCAGGCGGT GGGCGCCGCG
CTCGTCGCCC GTCGCCGGGG CCTGCGCTGG GCGACCCTCG CGGCGACCTC CGCCGAGTTC
GACAATCCTT ACGGGGTGCT CGCGGGCCTG GGGCAGTGGG TCGTCGACCG CCTGCGGGAG
TTCCAGACCG GGCACGGAGT GCCGCCCGAC GAGGCTGCCA TCGGGGACCT GCGCTTCTCC
GAGGCGTTGA CCCTCGTCTT CTCCGTCCGG GAAATGCTGC ACAATCCCGG GATCCCGGAC
TACGCGGTCT TCGTCGGCAG CGCGGTCGGG AAACGGGCCG GGGCGGGCGA GTTCCCCTGG
GACTGGCTCG ACCCGGCCCG CCGGGCGGTG CTCGTCTCCC TCGGCACGGT GACCCGGGAG
GCCGGCGGTC GGTTTCTGCG GGCGGCGGCC GAGGGGCTGC TCGGCCTGCC CGAGCGGGTA
CAGGCGATCG TCGTCGCCAC ACCCGGGCTC GTTGACGACC TCGCCGCCGC GGCGCCCGAC
GATCTGCTGG TCGCGCCGTT CGTGCCGCAG GTCGCGTTAC TTCCGCGGTT GTCGGCGGTG
GTGTGCCACG CCGGCAACAA CACCGTCTGT GAGTCGCTGT CACACGGGGT TCCGCTGGTG
GTCGCCCCGG TTCGCGACGA TCAGCCGATC ATCGGCGAGC AGGTCGTCCG TAACGGGGCC
GGGGTGCGCG TCAAGTTCGG TCGGGCCGGA CCCACCGCCG TGCGTTCCGC GGTGACGGCC
GTGCTGGACG ATCCGTCGTA CCGGGCCGCC GCCGCCCGGA TGCGGGCCGC CTTCGCCGCG
GCCGGCGGGG TGGCCGCCGC CGCCGATCAC CTTGAGAAGC TGGCGGTCTG A
 
Protein sequence
MSRVLFVVPP LTGHVNPAVG VAAELAARGH EVALAGYAGV IGSLIPPELA LLALPEAGLG 
EKWSRIQDAS RALRGPASLK FLWEDFLLPL GSMMAPAIDR IIDDFQPDLL VIDQQAVGAA
LVARRRGLRW ATLAATSAEF DNPYGVLAGL GQWVVDRLRE FQTGHGVPPD EAAIGDLRFS
EALTLVFSVR EMLHNPGIPD YAVFVGSAVG KRAGAGEFPW DWLDPARRAV LVSLGTVTRE
AGGRFLRAAA EGLLGLPERV QAIVVATPGL VDDLAAAAPD DLLVAPFVPQ VALLPRLSAV
VCHAGNNTVC ESLSHGVPLV VAPVRDDQPI IGEQVVRNGA GVRVKFGRAG PTAVRSAVTA
VLDDPSYRAA AARMRAAFAA AGGVAAAADH LEKLAV