Gene Francci3_3473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3473 
Symbol 
ID3905207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4141243 
End bp4143243 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content72% 
IMG OID637880795 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_482555 
Protein GI86742155 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.467717 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAGAC GTAGAGTAGA CGGAGTGGGT CATGGTCGGT GGACAGCGCA CATCGCACGG 
CGCGTCTCCC GGCTCAACCT CCGTCAGCGG GTCGGCCTGC AGATCACTCT GGACAGTGCG
GCGCTGGTCC TGGGCCTCAT CTCGGCCCAG ATCGGCCGCC TGGACTTCAC GCTCGGCGCG
CTGAGCGATC TCGGCTTCTG GGTCATCTGC ACGCTCGCCG TCTGCGTGCT CCACTTCCTG
GGCACCGCCC TGCACCTCTA CCTGGGGCGC TACCGCTTCG GCGGCTTCGA GGAGGTCCTC
GGCCTGCTCG TGTCGGTCCT GCTCACGGTC GTCAGCGTGG CCGTCGTCAT CGCCGCGTTC
GGCTCCCCGA GACCCGTCCC GCTGAGCGTG CCGCCACTGG GCGGCGCGGT GGCCCTGGTC
GTGATGTTCG GCATCCGCTA CCTCTGGCGG CTGACCGAGG AACACCTCCG TCGGCCGAGC
AGGGAGCACG CCGAACCGCT CCTCGTGTTC GGCGCCGGTG ACGGGGCTGA ACGGGTACTG
GCGGCCATGC TGCGCACCCG GAACAGTCCT TACTACCCGG TCGCGCTGCT CGACGACGAC
CCGAGCACAC ACAACCTCCA GCTACTCGGG GTCCGGGTAC GTGGTGGCCG GGAGCGAATC
GGCGCCGTCG CCGAGTCGAC CGGGGCCAGG ACCCTGCTCG TGGCGATCCC CAGCGCTGAC
GGCCCACTGC TGCGGGAGAT CAGCGCAATC GCCGAGGGCG CCGGGCTGAC CGTGAAGGTG
CTTCCCCGCG TCGCGGACCT GATCGACGGG CGGGTCGGCG TCGGGGACAT CCGCGACCTC
GACCTCGCCG ACCTCCTCGG CCGGCGGCAG ATCCGCACCG ACATGTCCGC CGCCGCGAGC
TACCTCGCGG GCCGACGGGT GCTCGTGACG GGGGCGGGCG GATCGATCGG TTCGGAGCTG
TGCCGTCAGA TCTCCGGCTA CGGGCCGGCC GAACTGATCA TGCTGGACCG GGACGAGTCG
GCACTGCGCG CGGTGCAGCT GTCGATCTCC GGCCGGGCGA TGCTCGACGA CGACGCCATC
GTGCTGGGCG ACATCCGCGA CATCGACCTG ATGACCACGC TGTTCACGGA GCGCCGGCCC
GAGGTCGTCT TCCACGCCGC CGCGCTCAAA CACCTCCCGC TGCTCGAACG CTTCCCCGGC
GAGTCGGTGA AGACCAACGT CTGGGGCACT CTGACGATCC TGGAGACGGC CGTGGCCTGC
GGCGTCGACC GGCTGGTCAA CATCTCCACG GACAAGGCGG CGAACCCGAC GAGCGCCCTC
GGCTACTCGA AGCGGATCAC CGAGCGGCTC ACCGCGTGCC TCGCCCGCCG GGCCCGCGGA
ACGCTGGTCA GCGTCCGGTT CGGCAACGTC CTGGGCAGCA ACGGCTCCGT CCTGACCGTC
TTCGCCGGCC AGCTGGCCGC CGGCGGGCCG ATCACCGTCA CCCACCCCGA GGTCACCCGG
TACTTCATGA CCATCCACGA GGCGGTGCAA CTGGTCCTGC AGGCCGGGGC GCTGGGATCA
CCCGGCGAGG CCCTCGTGCT CGACATGGGC GAGCCGGTGC GCATCGCGGA CGTGGCCGCC
CGGCTCGTGG CTCGGGAGAA CCGGCCGATC GAGATCGTCT ACACCGGGCT CGGCCCCGGC
GAGAAGCTCC ATGAGGAGCT CCTCGGTGCG GGCGAGGACG ACCATCGACC ACACCACCCG
CTGATCTCGC ACGTGGACGT GCCCGCCCTG GACCCGACCC ACGCCCTCGC CCTCGATCCC
TGGGCCCCGC CGGAGGAGGT GCTGGCCGAA CTCGCGGCCC TCGCCGGCGC GGACGCCGCG
GCGGACGAGG TCCCGGCAGG CGCGGACCGA CCCGGGGACG GGGGCGCGAC CGCAGGCGGA
CCCCTGGCCG CCGCGGACGT GACCGGGCGG ATTCCCGTCC AGCCCACGGC GTCCAACCAT
CAACCGCATC CGGCCCGGTG A
 
Protein sequence
MWRRRVDGVG HGRWTAHIAR RVSRLNLRQR VGLQITLDSA ALVLGLISAQ IGRLDFTLGA 
LSDLGFWVIC TLAVCVLHFL GTALHLYLGR YRFGGFEEVL GLLVSVLLTV VSVAVVIAAF
GSPRPVPLSV PPLGGAVALV VMFGIRYLWR LTEEHLRRPS REHAEPLLVF GAGDGAERVL
AAMLRTRNSP YYPVALLDDD PSTHNLQLLG VRVRGGRERI GAVAESTGAR TLLVAIPSAD
GPLLREISAI AEGAGLTVKV LPRVADLIDG RVGVGDIRDL DLADLLGRRQ IRTDMSAAAS
YLAGRRVLVT GAGGSIGSEL CRQISGYGPA ELIMLDRDES ALRAVQLSIS GRAMLDDDAI
VLGDIRDIDL MTTLFTERRP EVVFHAAALK HLPLLERFPG ESVKTNVWGT LTILETAVAC
GVDRLVNIST DKAANPTSAL GYSKRITERL TACLARRARG TLVSVRFGNV LGSNGSVLTV
FAGQLAAGGP ITVTHPEVTR YFMTIHEAVQ LVLQAGALGS PGEALVLDMG EPVRIADVAA
RLVARENRPI EIVYTGLGPG EKLHEELLGA GEDDHRPHHP LISHVDVPAL DPTHALALDP
WAPPEEVLAE LAALAGADAA ADEVPAGADR PGDGGATAGG PLAAADVTGR IPVQPTASNH
QPHPAR