Gene Franean1_7281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7281 
Symbol 
ID5675582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8891779 
End bp8893059 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content68% 
IMG OID641246118 
Producthyaluronan synthase 
Protein accessionYP_001511506 
Protein GI158318998 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.265868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.277341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTGGT TCGGCGTCGC GTTCGATTTC GTTCGCGATC ATCGTTCTCT GGTCCCGCTC 
GGGATCGCGG GCGTTGTCTC CTGGGTGGTG TGGCTCACTC GCCGCCTGCT CTCCACCCGG
TACCGCCCCG TCCGGAACAA CTTCCGAGCC AGTACCTCGG TGATCGTTCC ATCGTTCCGG
GAAGACCCCG ACGTCCTCAT CCGGTGCCTG GAGACCTGGC TCTCCCAGCA GCCGGACGAG
ATCATCATCA TCCCGGATGT GGAGGACACC GAGCTCATAG CGAGGCTCGC CCAGCGCGCC
GACCCGACGG TCCGCGTGAT CCCGTTCGTC CACGAGGGCA AGCGTTCGGC GCTGGGGGTC
GGCCTGTCCG CGGCGACCAG GGACATCGTC GTGCTCTGCG ACTCCGACAC CGCTTGGGAG
CCGGGGCTGC TCGCCGCGGT GCAGATGCCG TTCGTCGACC CGCAGGTCGG CGGGGTGGGA
ACCCGGCAGA ACGTCTACGA GCCGCGCAGC AGCGTGTGGC GGCGGGTCGC GAACTGGCTC
GTCGACATCC GCTACCTCGA CTACGTGCCG GCGCAGGGCC GGGTCGGCGC CGTCGCCTGC
CTGTCCGGGC GCACGGCGGC CTACCGGCGC TCGGCGATCC TGCCCGTGCT GCACAACCTG
GAGCACGAGT TCTTCCTCGG CCGGCGGTGC ATCGCCGGTG ACGACGGCCG GCTGACCTGG
CTGGTGCTCG CGTCCGGCTA CAAGACCATG CACCAGCACA CGGCGCACGC GATGTCGATG
TTCCCCGACA ACCTGCGGGC CTTCATCAAG CAGCGGGTGC GCTGGAGCCG GAACTCCTAC
CGGACCTACC TGACCGCCAT CTACAAAGGC TGGCTGTGGC GGCAGCCACT GATCACCCAG
GTCAGCGTGC TGCAGATCGT GCTCACCCCG CTGACCATGG GTGTCGCGAT GACCTACTTC
GTGCTGTGGA TGTTCCGGCC GGAGGCGAAC GCCCCGATCA TCGCGATCGC CTGGCTGCTG
CTCGGGCGGT TCATCCGCGG GCTCTCCCAC CTCAAGGAGC ACCCGCGGGA CATCTTCATC
CTCCCGCTCA CAGTGTTGAT GATCATCGTC GTCGCGCTGC CCATCAAGAC CTGGGCGTTC
GTGTCGATGA ACAAGCAGGG CTGGCTGACC CGGCGCTCCG ACCTCATCGG CGGGGAGGGC
CAGACCGACG CCTCTACGCG AACCAGCCCG GCCGCGAGCC CCCGCCCGGC GACAGCGACC
GCGATGGGTG GTACCCGATG A
 
Protein sequence
MEWFGVAFDF VRDHRSLVPL GIAGVVSWVV WLTRRLLSTR YRPVRNNFRA STSVIVPSFR 
EDPDVLIRCL ETWLSQQPDE IIIIPDVEDT ELIARLAQRA DPTVRVIPFV HEGKRSALGV
GLSAATRDIV VLCDSDTAWE PGLLAAVQMP FVDPQVGGVG TRQNVYEPRS SVWRRVANWL
VDIRYLDYVP AQGRVGAVAC LSGRTAAYRR SAILPVLHNL EHEFFLGRRC IAGDDGRLTW
LVLASGYKTM HQHTAHAMSM FPDNLRAFIK QRVRWSRNSY RTYLTAIYKG WLWRQPLITQ
VSVLQIVLTP LTMGVAMTYF VLWMFRPEAN APIIAIAWLL LGRFIRGLSH LKEHPRDIFI
LPLTVLMIIV VALPIKTWAF VSMNKQGWLT RRSDLIGGEG QTDASTRTSP AASPRPATAT
AMGGTR