Gene Francci3_4184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4184 
Symbol 
ID3907149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4988440 
End bp4993413 
Gene Length4974 bp 
Protein Length1657 aa 
Translation table11 
GC content73% 
IMG OID637881512 
Producthypothetical protein 
Protein accessionYP_483261 
Protein GI86742861 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0532898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGACCG CATCTGCGCT CAGCTCCGCG CTCTCGCTCA CCGCGCAGAT GATGCGACCT 
CGCCCGGAAC GGCCGGACAT GGGGGGCTTG CTCATAGCAC AGCGACAACG CACTGGCACG
GTCAGACTCA GCCGCGCCCC GAGGATAACG TTCGGGTTCC GCGGCATTCC GCCCGGGCGC
CGTCGGATTG ATCTATGGTT GATGCGCCAG CTGGATGACC TGGTCGGGGG CATCTGGCCG
CTTATGTCGG GCACGGGGGG CAGGGCGAGA GCTGTGGCTC GTTCATCGAC GTCGCGGACG
TCGCGGTTGG TTCTGGGATC CGCCGGTGCG CGGGTTCTCG TCGTCGGCAC GGGCGGGCTC
GGTGGGCCCG GCGTCGACCG GGCGGGCGAG GTCGGCGCCG TGGTAGCGCG GGCCGTGGCC
GCGGTCGGCG GCGTGTTCGT GGAGCGCTGC GGGCTGGCCG AGGAAAGCCT GCGCACGGTC
GTCGACCCGG CGACCCCGCT CGATCTCGGC GAGGCTCTCG CGGCGGCCGT CGAGCAGGCC
ACCGAGGCTC TCCTCGTCTA CTACGCCGGC ATCGTCCTCG TCGACGCCGA CGGGATGATC
CATCTGGCGA CGGCCGTGTC GGACAGCCGT CCCCATCGCC TGGCCTACAC GGCCCTGCCG
TTGTCGTCGG TCCTGGCGAC GATGGCGACG AGCCGGGCCG CCACCCGGCT CATCGCCGTC
GACGGTCTGT GGGTGCGGAC CACGGTCGCC AACTCCACCG TCCCGCCGGT GCCACCTCGC
GTTCCCGGTC TTCGCGAGGA TGTTCCCGGT CCTCGTGAAG ATGCGGGGGG CGGCGTCAAC
GCGGCCGGCG ACGGCGTGGA GCTGCTCGTG CTGACCTCGC CGCTGGCCGC GCACACCGGG
CCGGCCGCGG ATGACGGACC GCCGCTGTCC GCCGGCCTGG TCCGGCTCCT CGTCAACGGC
GACCCGGACG GCCCGGTCGA GCTGAGCGTC GGGCAGGTGA CCCGGCGGCT CGCCCAACGT
CTCACCGCGG CCGGGAACGC AGTGCTGCAT TCCGCCGCCG CCTGGTCGGA TGGGGTGATC
CTCGCCGGCA ACGCCGCGCA CACCCCGCCG ATCGCCGACC TGGAGCCGGC GCCGACCACC
GTGCTCACCG ACGGCCCCGG CCCGGTGGTG ACTCCAACCC CGCGACAGCC GGCGGGGCTT
CCCAGCGCAC CGGACGGCGG GGGGACCACC GGTGAGGCCG CCTGCCCCTA TCCCGGTCTC
GCCTCGTTCG ACGCCGGCAC GCAGCGCTGG TTCTTCGGGC GGGAGCGAGC GACGGCCGAG
GCGCTGTCGC GGCTGGAGGA ACGGCTGGAC GGCGCGGGTC CGCTGGTGCT CGTCGGTGCC
TCCGGCTCGG GAAAGTCGTC GCTGCTGGGC GCCGGGCTGC TCCCGGCGCT CGCCCGGGGC
GCAGTGGCGG GCGCCCGCGG TTGGCCGCAG GTCCTGATGA CACCCACGGA TCACCCGGCC
CGCGAGCTGG CCCGCACGGT GGCGCAGGCG GCCGGGCTGT CCGTGCTCGC GGCGGCACGG
CTCGACGTCG TCTCGCAACC CGCCCGTTTC GTCGGGGCGC TGTCGGATCT GCTCGCCCTG
GGTCCGGCCG CCGCCGGCGG CGGGACCACC CCGCCGACCC GGCGGGTGGT GATCGTCGTC
GACCAGTTCG AGGAGACGTT CACCCTGTGC GCGGATGAGG AGGAGCGCGG CGCGTTCATC
CGCGCGCTGG TCGCCGCGTG CCGCGGGGAC GGGGCCACCC GTCCACCGGC GGTCGTCGTC
ATCGGCGTCC GCGCCGACTT CTATGGCTGG TGCGCGGCCT ATCCGGAGCT GGTGGACGCG
CTCCAGACGG GGCAGGTCGT GGTCGGGCCG ATGACCGCGG CGGAGGTGCG CGACGCCATC
GTGAAGCCGG CCGAGGCGGT AGGGCTCACG GTGGAGCCGG GCCTGGTCGA TCTGATGCTG
CACGACATCG GCGCCGACGA GCACGGCGTC GTCGCCGACC CCGGATCGTT GCCCCTGCTC
GCGCACGCGC TGCGCGCCAC CTGGCGGGAA CGCGAGAACG GCACTCTGAC GGTGGCCGGT
TACCTGCGGG CCGGGGGTCT ACGGGGGGCC ATCGCCCGGA CGGCGGAGGA GACCTTCGGC
ACGCTGGACG CGGCCGGCCA GGATGCGGCC TGGCAGATCC TGCTACAGAT GATCCAGTTC
GGGGATGGGA CGGACGACAC CCGCCGCCGC GTCTCACGCA CCGAGCTGCT GTCCACGTCG
GCGTCAGCCC AGAACTCGGC GTCAGCCCAG AACTCGGCGT CAGCCCAGAA CTCGGCGTCA
GCCCAGAACT CGGCGTCAGC CCAGAACGTC TCCGCCGTCC TCGACGCGCT GGTCGCCGCC
CGACTCGTCA CCGTGGACGC CGACACCGTC GAGATCTCCC ACGAGGCGCT GCTGCGCTCG
TGGCCGCGGC TGCGCGAGTG GATCGAGGTC GATCGTGCCG CGGCGGTGAC CCGACAGCGG
CTCACCGACG CCGCGGCCAC CTGGGACGAC GGCGGGCGGG ACCCGTCCTA CCTGTTCGCC
GGCAGTCGGC TGGCGACCGT CCGGGAGTGG GCGCAGGCCG GGCAGCGGCA GACGGCACTG
AGCCCGGTCG CGCGGGACTT TCTGGCGGCC TCCCTCGCGG CCGAGCAGGC GGCGCGGCGG
GCCGAGGTCC GGCGGACCAG GCGGCTGCAC CAGCTCGTCG CCGCTCTCAC GGTGCTGACC
CTGATCGCCG GTGGAGCGAT GGCGTTCGCC TTCCGGCAAC GGGCCCAGAT CGCCGGCGAG
CGGGACCGGG CGCTGTCGGT GGCGATCGGC CGGCAGGCCG ACCAGCTACG CAGGTCCGAC
CCGCTGGCCG CCGCCCAGCT CGGCGCCGTC GCCTACCGCC TGGCGCCGAC GGCGGAGGCG
CGCAGCGCCG TGCTGTCGAC GTTCGCGAGC GACTACGGCT TCGCCACCCG TCACCTCGGC
ACGCAGACCC GTCCGATCGG CGCGGTCGCC TACAGCCGGG ACGGCAGGCT GGTGGCGACG
GCGAGCGACG ACTGGACGGT CGCCCTGTGG TCCGCGTCAG ACCCCACCCG CCGCACCCCG
CTCAGCACGA TCAGGGGCCA CCGGCTGGCG GTGAAATCCG TGGCGTTCAG CCCGGACGGC
CGGTTCGTCG CCACCGGGAG CGACGACCGG ACGGTCCGGC TGTGGACCAT CACCGATCCC
ACTCACCCGG TGCTCGCGGC GACGTTACCC GGCGCCTCCG ACTCGATCCA CGGGCTGGCC
TTCCGCTCCG ACAGCCGGGT CCTTGCCACC GGCGGGTACG GATCACAGGT GCGGCTGTGG
GACGTCTCCG ACGTGAGCGC GATCCGGCCG GCCGGCGCGA TCACCGCCCA CACCGCGAAC
GTCCGTGCCG TCGCGTTCAG TCCCGACGGC ACCGAGCTGC TGACCGGTGG CGACGACGGC
CGGGCCCTGC TGTGGGACGT GCGGGACCTC GCCGCCCCGG CGGAGCTGAG CGAGCTGAAG
GGCGAGGCGG GCCCGAATGT CGTGCTCAGC GCGAACACGG CCGTCCGTGC GGTCGCCATC
AGCCCGGACG GGAAGACGGC GGTGACCGGC AGCGACGACA CCACCGCGCG GGTCTTCGAT
CTCACCGACC CGCGACGGCC GACCACCCTC CAGGTCATCG ACGAGGTGCG ATCGGTCGAG
GCGGTGGCGG TCAACGTGGA CGGCCTCGTG GCCGTGGGGG TGAACAGCGC GGTCGAGATC
TACGATCCGT CCCGCCATTT CGACGCCATC GCGGTCCTGC CGCACGCGTC GAAGCCGTGG
GCGTTGGCCT TCAGCCCGGA CGGGCGAACC CTCGCCTCCG GTGCCGACGA CCGCACCCTG
CGCCTGTGGG ACGTGCCGGG GCCGGTGCTG GTCGGGCACC ACCGCTTCCT GTGGTCGACG
GCGGTCCATC CGAACGGGCG GGTCCTAGCC ACCGGGGACT ACGGGCAGAC CGTCCGGCTG
TGGGATGCGC ATGACCCCGA TCGGCCGCTA CCGGCCGCCA CGTTGACCGG ATTCAACGCC
GCGGTCGGCA GCGTCCGGTT CACCCCGAAC GGCAGGGTCC TCATCGCCGG CAGCCTCGAC
TACGCCACCG ACTTCGACGG CATGGTCACG CTGTGGGACG TCGCCGACAT CCACCGTCCC
AGGCTGCTGT CCACCATCCA TCCCGGCATC GGCGGGATCA AGGACCTGGA GGTCAGCCCG
GACGGCCGGA CCCTCGCGCT GGCCGGCACG AGCGCCCGGC TGGCCCTGGT GGACATCTCC
GCCCCGGCCA CGCCCCGGGG GCGGTCCGTC CTCGCAGGGC ACCGCTACGA CGTCGCCACC
GTCAGTTTCA GCCCGGACGG GCGCACCCTG GCGTCCGGAA GCAGCGACCG GACCATCCGG
TTGTGGGACA TCACGACACC GGACCGACCG GCGGGCATCA GCACGATCAC GGGCTTCACC
AGCGCCCTGA TCACCATCGC CTTCAGCCGG GACGGAAGGA CCCTCGCCGT CGGCAGCTTC
GACACGAAGG TCGCCCTGTG GGACGTCACG GACCGCCGGG CCCCCCGCGC GCTGCCCTCG
ATCATCGGGT TGGGGGCCGG CGTCAACTCC ATCGAATTCA GCTCCGACGG CCGGCTCATG
GCGGCGGGCG TCGGCGGCGC GAACGGCGTG GTGCGGCTGT GGGACATGTC CGATCCGGCC
GACCCGGTGC CGTACGCCAC CTTCAGCGGC CGCTACAACG GCTTCGCTGC GGTCGCCTTC
GCCCCGGACA CCGCCTTCAT CGTCGGAGCG CCCTACCAGG TGATCGGATT CGTCTGGGAC
CTCGACCCGG CGAGCGTCGT CACGCGGCTG TGCGCGCGGG CCGGCGACCC GCTCACCCGC
GAGGAATGGA AACAGTACCT GCCGGGCCTC GACTACGATC CGCCGTGCGG CTGA
 
Protein sequence
MATASALSSA LSLTAQMMRP RPERPDMGGL LIAQRQRTGT VRLSRAPRIT FGFRGIPPGR 
RRIDLWLMRQ LDDLVGGIWP LMSGTGGRAR AVARSSTSRT SRLVLGSAGA RVLVVGTGGL
GGPGVDRAGE VGAVVARAVA AVGGVFVERC GLAEESLRTV VDPATPLDLG EALAAAVEQA
TEALLVYYAG IVLVDADGMI HLATAVSDSR PHRLAYTALP LSSVLATMAT SRAATRLIAV
DGLWVRTTVA NSTVPPVPPR VPGLREDVPG PREDAGGGVN AAGDGVELLV LTSPLAAHTG
PAADDGPPLS AGLVRLLVNG DPDGPVELSV GQVTRRLAQR LTAAGNAVLH SAAAWSDGVI
LAGNAAHTPP IADLEPAPTT VLTDGPGPVV TPTPRQPAGL PSAPDGGGTT GEAACPYPGL
ASFDAGTQRW FFGRERATAE ALSRLEERLD GAGPLVLVGA SGSGKSSLLG AGLLPALARG
AVAGARGWPQ VLMTPTDHPA RELARTVAQA AGLSVLAAAR LDVVSQPARF VGALSDLLAL
GPAAAGGGTT PPTRRVVIVV DQFEETFTLC ADEEERGAFI RALVAACRGD GATRPPAVVV
IGVRADFYGW CAAYPELVDA LQTGQVVVGP MTAAEVRDAI VKPAEAVGLT VEPGLVDLML
HDIGADEHGV VADPGSLPLL AHALRATWRE RENGTLTVAG YLRAGGLRGA IARTAEETFG
TLDAAGQDAA WQILLQMIQF GDGTDDTRRR VSRTELLSTS ASAQNSASAQ NSASAQNSAS
AQNSASAQNV SAVLDALVAA RLVTVDADTV EISHEALLRS WPRLREWIEV DRAAAVTRQR
LTDAAATWDD GGRDPSYLFA GSRLATVREW AQAGQRQTAL SPVARDFLAA SLAAEQAARR
AEVRRTRRLH QLVAALTVLT LIAGGAMAFA FRQRAQIAGE RDRALSVAIG RQADQLRRSD
PLAAAQLGAV AYRLAPTAEA RSAVLSTFAS DYGFATRHLG TQTRPIGAVA YSRDGRLVAT
ASDDWTVALW SASDPTRRTP LSTIRGHRLA VKSVAFSPDG RFVATGSDDR TVRLWTITDP
THPVLAATLP GASDSIHGLA FRSDSRVLAT GGYGSQVRLW DVSDVSAIRP AGAITAHTAN
VRAVAFSPDG TELLTGGDDG RALLWDVRDL AAPAELSELK GEAGPNVVLS ANTAVRAVAI
SPDGKTAVTG SDDTTARVFD LTDPRRPTTL QVIDEVRSVE AVAVNVDGLV AVGVNSAVEI
YDPSRHFDAI AVLPHASKPW ALAFSPDGRT LASGADDRTL RLWDVPGPVL VGHHRFLWST
AVHPNGRVLA TGDYGQTVRL WDAHDPDRPL PAATLTGFNA AVGSVRFTPN GRVLIAGSLD
YATDFDGMVT LWDVADIHRP RLLSTIHPGI GGIKDLEVSP DGRTLALAGT SARLALVDIS
APATPRGRSV LAGHRYDVAT VSFSPDGRTL ASGSSDRTIR LWDITTPDRP AGISTITGFT
SALITIAFSR DGRTLAVGSF DTKVALWDVT DRRAPRALPS IIGLGAGVNS IEFSSDGRLM
AAGVGGANGV VRLWDMSDPA DPVPYATFSG RYNGFAAVAF APDTAFIVGA PYQVIGFVWD
LDPASVVTRL CARAGDPLTR EEWKQYLPGL DYDPPCG