Gene Francci3_0276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0276 
Symbol 
ID3905718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp317381 
End bp320365 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content72% 
IMG OID637877604 
ProductWD-40 repeat-containing protein 
Protein accessionYP_479392 
Protein GI86738992 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.741402 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACGAG TCTTCGTCAG CCATGCGGGC TCCGACGCGG AGCTGGCCGA CAGAGTGCAT 
CAGACGCTGA CCCTTCTTGG CCACCGGGTG TTCCTCGCTC AGAACATCCG GGTGGGCGTC
CAGCCCGGCG ACATGTGGAA GGATCGGTTG CACCGCGAGC TGCGAGCCGC CGACGCCGTG
GTCTGTGTCA TCACCAACGC CTACAACGCC TCCCCGTGGT GCGCCTACGA ACTGGGAATC
GCCTACGAGG TCGGCAGCCT GCTGCTGCCG CTGCGCGTGG AGCCGACGGT CACCTCCCCG
CTTCTGGAGG ACCGGCAGCA CGTTCACCTG CACGAGAATC CGGGGTGGGC CGACAACCTG
AACATTGGGC TGCGGCGCGT CGAGGCCGGC GGCGGGCTGG GCTGGGCGGA CGACCGGTCC
CCGTTCCCGG GTCTGCGGCC GTTCACCCGG GACATGGCCA AGGTGTTCTA CGGGCGCGGC
AACGAGCTGC GGCGACTCAC CCACCGGCTA CGCTCCCTCG GTGAACAGCG TCTGCTACTG
GTGGTCGGCG CCTCCGGATG CGGCAAGTCG TCGCTGGTCA CCGCCGGCCT GCCGGCCCAG
CTGGCGGGCG AGCCCGGCTG GGAGGTGGCC GACCCCTTCT GGCCGGGCCG GGACCCGGTC
CGCAAGCTCG CCCACACCCT GACCCAGGCT GCGAGGCGTG TCGGCCAGCA GTGGGCGGTC
ACCGACATCG ACCGGCAGCT CAGGGAGGAC GACCTGGCCC TGATGGGGCT GGCCGAGGAA
CTTCTGACGG CCGGCTCCGG CCCGGTGCGG GACAGCCTGC TGCTACTGAT CATCGATCAG
GGCGAGGAGC TGTTCACCCG ATCGGACGAA GAGGAACGGG CCCGGTTGGC GACCCTGTTG
CGGCATGGGG TCACCGGTCC CGTGCGGGTG GTCATGGCGC TTCGCTCCGA GTTCCAGGAC
CAGGTCTTCG CCCTGCCCGA ACTGAGCGGC CCCCCCGAAC AGGGCGGGGT CAACCTGCAC
GTCTTCCCCC TCCGCCCGCT GGCCCGGGAC ATGCTGCCGG TCGTCATCAC CGAGCCGGCC
CGGGTAGCCG GCCTGCACGT GGAGCCGGAG CTGGTCGCCA GGATGGTCGC GGACACCGAG
GACGGCGAGG CGCTGCCGCT GCTGGCGTTC ACGCTCCATG AGCTGGCCAA GGACCAGACC
CGCGGCGGCA CCCTGTCGAT GGACCGGTAC CACGAGCTGG GTGGCGTGCG GGGGGCGCTG
GCCCGCCGCG CCGACGCCGT CCTCGACGAC GCCGTGGAGA CGAGCGGGCT GGGCCGGGAG
GAGATCCTGG TCGGCATGGC CGAGCTCGCC ACGCTCGACG AGGCCGGTCG GCGGACCCGC
CGACGCATCG ACCTCAAGGA CCTGCCGTCC GACGCGTTGC GGAGAACCTT CCAGGTCTTC
GTGAACCAAT GGCTGCTGTC CACCCCCAGC GACAGCCTGG GGACCGCCGT CGGGGTCGTC
CACGAGGCGT TGCTGACCGC CTGGAAGCCA CTGGATGCCG TCATCCGGGA ACGGGAAGCC
GCCCTGCTCA GTGAGGGCAA GGTCGAGCGC GCCGCCGCCG AGTGGGAGCA CAACGACGGC
CCGCTCTGGG ACGAGGACCG GCTGACCACC GCCATCACGA CGCTGTGGCA GTCCTATGAG
CGTTTCCGCG CCGGTGAGGA GCCGGTGGTC CATCTCAAAC CGGCCGGGCG GAGGTTCATT
GACGCGTGCG ACCAGCGGAT CGAGGAAGCC CGTCGCCAGG AACGTACGCG CGTCGAGGCG
GCCCGCCGCC GGGAACGTCA CGAACGTCAA CGCCGCCAGC AGGTCATCGC TGTCCTTTCC
GTGCTCCTCG TCTTCACCGT CGTCGGGGCC GCGGTCGCCA CCTGGCAGTC CCTCACCGCC
CGGCAGGCGC AGCGCACCGC GGAACGGGCG CAGCGCACCA CCGAACGGGC GCGGCACGCC
CTGATCACTC ACACCCTGCT CGCCGACGCC GACGCCGCGC GGGCGACGGA CCAGGTGCGC
GCGTTACGAC TCGCCATCGC CGCCGAGGAC ATCGACCCCG GTACCGGTGG TGCGGCCGGC
GGTAAGGCCG CCAGTGGTGA GGCCGCCAGT GGTGAGGCCG GCGGTAAGGC CGACAGGGGC
GCCGACAGCC GGGCGAACTT GCTGCAAACC CTCGCCGAGG ACCCGAACCC ACTCGCCAGC
CGCGGCGATC CCGTCACCTC GGTGGCGTTT TCCCCGGATG GACGCATCCT GGCCGTCGGC
GGCGCGAACG CAACGGTCGT GCTGTGGGAT GTGACGGAGG GTCAACATCG TCGGATCGAC
CCGCCCCTCA CCGGCCAGCA CAGCGGGATC ACGTCGGTGA CGTTCTCGCC GGACAGCCGG
AACCTCGACG CGACCGGCGT CGACGGGACG GTGCTGCGGT GGGACGTCAC CGACCCAGAC
CGGCCACAAT TGCGGGGGCG TGTGAACGCC GCCGCGGCCC TCCCCACGCC GTGGGATGCG
TTCTCCCCGG ACGGGCGCCT CAGAGCCACC GGCCAGGACG GATCAATCAT GCTGGCGGAT
GTCACCGCCC CGGATCGGCC CCGGCCGCCC AGCATCCTTC TCGCCGACCC CGGACCGGTC
CGGGCCGTGG CCTTCTCCCC GCACGGGCAG GTCCTGGCCG CGGTCGGATC TGGCCGGAGG
GTATGGCTCT GGGATACCAG CGTCACCCCG CCGCGGCAGA TCGGGCAGCC GCTGACCGGG
CACACCCGCT CCGTCCTGTC GCTGGCGTTT TCCCCGGACG GCGGGACGCT GGCCTCCGGC
GGCAACGATG GCACCGTCCG GCTGTGGCGG CTCGCCGCGA TCGACGCGTT CCGCCGCAAC
GGCGTCGTCG AGTACGCCTG CCACGAGGCC CAGGGCGGGC TCGACCGGTC GGCGTGGAGC
TTCCACATTC CCGGCCTGCC CTACCAGGAC ACCTGCGCGG GGTGA
 
Protein sequence
MARVFVSHAG SDAELADRVH QTLTLLGHRV FLAQNIRVGV QPGDMWKDRL HRELRAADAV 
VCVITNAYNA SPWCAYELGI AYEVGSLLLP LRVEPTVTSP LLEDRQHVHL HENPGWADNL
NIGLRRVEAG GGLGWADDRS PFPGLRPFTR DMAKVFYGRG NELRRLTHRL RSLGEQRLLL
VVGASGCGKS SLVTAGLPAQ LAGEPGWEVA DPFWPGRDPV RKLAHTLTQA ARRVGQQWAV
TDIDRQLRED DLALMGLAEE LLTAGSGPVR DSLLLLIIDQ GEELFTRSDE EERARLATLL
RHGVTGPVRV VMALRSEFQD QVFALPELSG PPEQGGVNLH VFPLRPLARD MLPVVITEPA
RVAGLHVEPE LVARMVADTE DGEALPLLAF TLHELAKDQT RGGTLSMDRY HELGGVRGAL
ARRADAVLDD AVETSGLGRE EILVGMAELA TLDEAGRRTR RRIDLKDLPS DALRRTFQVF
VNQWLLSTPS DSLGTAVGVV HEALLTAWKP LDAVIREREA ALLSEGKVER AAAEWEHNDG
PLWDEDRLTT AITTLWQSYE RFRAGEEPVV HLKPAGRRFI DACDQRIEEA RRQERTRVEA
ARRRERHERQ RRQQVIAVLS VLLVFTVVGA AVATWQSLTA RQAQRTAERA QRTTERARHA
LITHTLLADA DAARATDQVR ALRLAIAAED IDPGTGGAAG GKAASGEAAS GEAGGKADRG
ADSRANLLQT LAEDPNPLAS RGDPVTSVAF SPDGRILAVG GANATVVLWD VTEGQHRRID
PPLTGQHSGI TSVTFSPDSR NLDATGVDGT VLRWDVTDPD RPQLRGRVNA AAALPTPWDA
FSPDGRLRAT GQDGSIMLAD VTAPDRPRPP SILLADPGPV RAVAFSPHGQ VLAAVGSGRR
VWLWDTSVTP PRQIGQPLTG HTRSVLSLAF SPDGGTLASG GNDGTVRLWR LAAIDAFRRN
GVVEYACHEA QGGLDRSAWS FHIPGLPYQD TCAG