Gene Francci3_3257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3257 
Symbol 
ID3904428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3858261 
End bp3860135 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content67% 
IMG OID637880582 
Productcellulose synthase (UDP-forming) 
Protein accessionYP_482343 
Protein GI86741943 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.660262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGGTCG ATCTTGGCGA GGAGTTCCTT GCCGCCGAGC TGGGCGAACA AAGGAGAGAC 
TCTTCCGCCG CGGCCTCGGT GATGCCCCCC AGGGAATATG CGGCGGATCG GATTGCGCCA
AGGAACGGTG CCGTCGACGG AGCGCCGGTC CCAGAAGCGC CGGTCCCAGA AGCTCGGAAC
GAGTCGCTGC CAAGCCCGCC GTCGGACCGA CTGGTGTACG CCTACCTCGG CCGACAGCGG
CGCTGGGTAC TACTGTGCAT GACCGCGTCG TTCACCCTTG CCTCGTTCAG CCTGGTCAAG
TTCGCGCTGC TGGCGTCGGC GTTGTGGGTG TTTCTCCTGC TACTCGGGCT CAACGCGATT
TGTTCGGCCT TCACGATTGT GGCCACCCAG CACCGGCGGC GCGTAACGAG ACAGAGCCAT
GAGGGCCTGG TCAGCGGCTG GCGGCCGATC TACTGCCCCA GCGTCGACAT CTTTCTTCCC
AGCGCGGGTG AACCTCTCCC GGTGTTGCTC AACACCTATT CACACATCGC CCGGGTCAGG
TGGCCGGGGC AGCTACGGGC GTACGTCCTC GACGACAGCG GCCGCCCGGA GGTCGCCGCC
GCGGCCGCCG CCCACGGCTT CAGCTACCTG AGCCGGCCGG ACCGGGGCCG GATGAAGAAG
GCCGGTAACA TCCAGTTCGG GTTCGAGCAC TCGCGCGGCG ACTACATCGC CATCTTCGAC
GCCGACTTCT GCCCGCGGCC GGACTACCTC TTTCACCTGG CACCGTACCT CGACGACCCG
AGCGTCGGTA TCGTGCAGAG CCCGCAGCAC TTCGATACGA AACGGTCGAT GGGCTGGCTG
CAGCGGACCG CCGGCGCCAC CCAGGAGCTC TTCTATCGCT GGGTGCAGCC GTCCCGCGAT
GCCGTCGGCG TCCCGATCTG CGTCGGAACC AATGCGATCT ACCGGCGCGG CAGCTTGCGG
AAGGCAGGCG GCTTCGCCCA GATCGAACAC AGCGAGGACG TCTTCACCGG TGTCAAGCTG
CTGGCGGCGG GCTACACCAC CCGCTATGTA CCGGTGGTGC TGGCGACTGG CCTGTGCCCG
GCTGACCTGG CCGGTTTCAT CAACCAGCAG TACCGCTGGT GCAGCGGCTC CATGGCCCTG
CTGCGGACCC GTCAGCTACG CCACATCAGA CTCGACTGGA AACAGCGGGT CTGCTTCTGG
AGCGGGTTCC TCTACTATAT TTCCACCGCG ATCAACGTCT TTACCATTAA CATTCCTGGC
CTGTTGATGG TGTACGTCTA CCCGGAGCTG GTGCGGCCCT ATAATTTCCT GCCCTTCCTG
GCCGCCGCCT GGGTGTGGCT GGTCCTGCTC CCGGCCACCA GCCGCGGTCG ATGGCGTTTC
GAAGTGCTGC GGGTGCAACT TGTGTACAGC TTCTGCCACG CTGTCGCGAT CCTGCACATG
GTGCGTGGTC GCACGGCATC CTGGGTGGCG ACCGGCTCCG TGCGCAAAAG GGGGAACCCG
ATGGTCCGCG GCGTGGTTCG GACCGCCTTC TGCTGGCTAC TGCTGACGAC GTGCGCGACA
TGGACCGGCA TCGGTCTCGA CGTCTGGAGG TTCGGCTGGG GAAACTTTTG GCTCGTGATT
CTGTTTCAGC TCGGCCACAG CTATCTGAGC GTCCCGCTCC TGTTCGACCT GAGCCGCCTG
CTCGTGGGAC GCGGCGACCG GCCGGCACGC AGCCGGCACC GCGCCAGGCA GTCTCAGCCC
GACTCCGTGG TCCGGGCATG GACCGGCGGT CCGCCGGCTC ACAGCGCGCC GGCTCACAGC
GCTCCGGCTC ACAGCGCTCC GGTCTCGCCC TCGCACGGCG CTCCACCCGC CAGCGCGGCC
CTGCGCCAGG GGTGA
 
Protein sequence
MEVDLGEEFL AAELGEQRRD SSAAASVMPP REYAADRIAP RNGAVDGAPV PEAPVPEARN 
ESLPSPPSDR LVYAYLGRQR RWVLLCMTAS FTLASFSLVK FALLASALWV FLLLLGLNAI
CSAFTIVATQ HRRRVTRQSH EGLVSGWRPI YCPSVDIFLP SAGEPLPVLL NTYSHIARVR
WPGQLRAYVL DDSGRPEVAA AAAAHGFSYL SRPDRGRMKK AGNIQFGFEH SRGDYIAIFD
ADFCPRPDYL FHLAPYLDDP SVGIVQSPQH FDTKRSMGWL QRTAGATQEL FYRWVQPSRD
AVGVPICVGT NAIYRRGSLR KAGGFAQIEH SEDVFTGVKL LAAGYTTRYV PVVLATGLCP
ADLAGFINQQ YRWCSGSMAL LRTRQLRHIR LDWKQRVCFW SGFLYYISTA INVFTINIPG
LLMVYVYPEL VRPYNFLPFL AAAWVWLVLL PATSRGRWRF EVLRVQLVYS FCHAVAILHM
VRGRTASWVA TGSVRKRGNP MVRGVVRTAF CWLLLTTCAT WTGIGLDVWR FGWGNFWLVI
LFQLGHSYLS VPLLFDLSRL LVGRGDRPAR SRHRARQSQP DSVVRAWTGG PPAHSAPAHS
APAHSAPVSP SHGAPPASAA LRQG