Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4184 |
Symbol | |
ID | 3907149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4988440 |
End bp | 4993413 |
Gene Length | 4974 bp |
Protein Length | 1657 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637881512 |
Product | hypothetical protein |
Protein accession | YP_483261 |
Protein GI | 86742861 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0532898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGACCG CATCTGCGCT CAGCTCCGCG CTCTCGCTCA CCGCGCAGAT GATGCGACCT CGCCCGGAAC GGCCGGACAT GGGGGGCTTG CTCATAGCAC AGCGACAACG CACTGGCACG GTCAGACTCA GCCGCGCCCC GAGGATAACG TTCGGGTTCC GCGGCATTCC GCCCGGGCGC CGTCGGATTG ATCTATGGTT GATGCGCCAG CTGGATGACC TGGTCGGGGG CATCTGGCCG CTTATGTCGG GCACGGGGGG CAGGGCGAGA GCTGTGGCTC GTTCATCGAC GTCGCGGACG TCGCGGTTGG TTCTGGGATC CGCCGGTGCG CGGGTTCTCG TCGTCGGCAC GGGCGGGCTC GGTGGGCCCG GCGTCGACCG GGCGGGCGAG GTCGGCGCCG TGGTAGCGCG GGCCGTGGCC GCGGTCGGCG GCGTGTTCGT GGAGCGCTGC GGGCTGGCCG AGGAAAGCCT GCGCACGGTC GTCGACCCGG CGACCCCGCT CGATCTCGGC GAGGCTCTCG CGGCGGCCGT CGAGCAGGCC ACCGAGGCTC TCCTCGTCTA CTACGCCGGC ATCGTCCTCG TCGACGCCGA CGGGATGATC CATCTGGCGA CGGCCGTGTC GGACAGCCGT CCCCATCGCC TGGCCTACAC GGCCCTGCCG TTGTCGTCGG TCCTGGCGAC GATGGCGACG AGCCGGGCCG CCACCCGGCT CATCGCCGTC GACGGTCTGT GGGTGCGGAC CACGGTCGCC AACTCCACCG TCCCGCCGGT GCCACCTCGC GTTCCCGGTC TTCGCGAGGA TGTTCCCGGT CCTCGTGAAG ATGCGGGGGG CGGCGTCAAC GCGGCCGGCG ACGGCGTGGA GCTGCTCGTG CTGACCTCGC CGCTGGCCGC GCACACCGGG CCGGCCGCGG ATGACGGACC GCCGCTGTCC GCCGGCCTGG TCCGGCTCCT CGTCAACGGC GACCCGGACG GCCCGGTCGA GCTGAGCGTC GGGCAGGTGA CCCGGCGGCT CGCCCAACGT CTCACCGCGG CCGGGAACGC AGTGCTGCAT TCCGCCGCCG CCTGGTCGGA TGGGGTGATC CTCGCCGGCA ACGCCGCGCA CACCCCGCCG ATCGCCGACC TGGAGCCGGC GCCGACCACC GTGCTCACCG ACGGCCCCGG CCCGGTGGTG ACTCCAACCC CGCGACAGCC GGCGGGGCTT CCCAGCGCAC CGGACGGCGG GGGGACCACC GGTGAGGCCG CCTGCCCCTA TCCCGGTCTC GCCTCGTTCG ACGCCGGCAC GCAGCGCTGG TTCTTCGGGC GGGAGCGAGC GACGGCCGAG GCGCTGTCGC GGCTGGAGGA ACGGCTGGAC GGCGCGGGTC CGCTGGTGCT CGTCGGTGCC TCCGGCTCGG GAAAGTCGTC GCTGCTGGGC GCCGGGCTGC TCCCGGCGCT CGCCCGGGGC GCAGTGGCGG GCGCCCGCGG TTGGCCGCAG GTCCTGATGA CACCCACGGA TCACCCGGCC CGCGAGCTGG CCCGCACGGT GGCGCAGGCG GCCGGGCTGT CCGTGCTCGC GGCGGCACGG CTCGACGTCG TCTCGCAACC CGCCCGTTTC GTCGGGGCGC TGTCGGATCT GCTCGCCCTG GGTCCGGCCG CCGCCGGCGG CGGGACCACC CCGCCGACCC GGCGGGTGGT GATCGTCGTC GACCAGTTCG AGGAGACGTT CACCCTGTGC GCGGATGAGG AGGAGCGCGG CGCGTTCATC CGCGCGCTGG TCGCCGCGTG CCGCGGGGAC GGGGCCACCC GTCCACCGGC GGTCGTCGTC ATCGGCGTCC GCGCCGACTT CTATGGCTGG TGCGCGGCCT ATCCGGAGCT GGTGGACGCG CTCCAGACGG GGCAGGTCGT GGTCGGGCCG ATGACCGCGG CGGAGGTGCG CGACGCCATC GTGAAGCCGG CCGAGGCGGT AGGGCTCACG GTGGAGCCGG GCCTGGTCGA TCTGATGCTG CACGACATCG GCGCCGACGA GCACGGCGTC GTCGCCGACC CCGGATCGTT GCCCCTGCTC GCGCACGCGC TGCGCGCCAC CTGGCGGGAA CGCGAGAACG GCACTCTGAC GGTGGCCGGT TACCTGCGGG CCGGGGGTCT ACGGGGGGCC ATCGCCCGGA CGGCGGAGGA GACCTTCGGC ACGCTGGACG CGGCCGGCCA GGATGCGGCC TGGCAGATCC TGCTACAGAT GATCCAGTTC GGGGATGGGA CGGACGACAC CCGCCGCCGC GTCTCACGCA CCGAGCTGCT GTCCACGTCG GCGTCAGCCC AGAACTCGGC GTCAGCCCAG AACTCGGCGT CAGCCCAGAA CTCGGCGTCA GCCCAGAACT CGGCGTCAGC CCAGAACGTC TCCGCCGTCC TCGACGCGCT GGTCGCCGCC CGACTCGTCA CCGTGGACGC CGACACCGTC GAGATCTCCC ACGAGGCGCT GCTGCGCTCG TGGCCGCGGC TGCGCGAGTG GATCGAGGTC GATCGTGCCG CGGCGGTGAC CCGACAGCGG CTCACCGACG CCGCGGCCAC CTGGGACGAC GGCGGGCGGG ACCCGTCCTA CCTGTTCGCC GGCAGTCGGC TGGCGACCGT CCGGGAGTGG GCGCAGGCCG GGCAGCGGCA GACGGCACTG AGCCCGGTCG CGCGGGACTT TCTGGCGGCC TCCCTCGCGG CCGAGCAGGC GGCGCGGCGG GCCGAGGTCC GGCGGACCAG GCGGCTGCAC CAGCTCGTCG CCGCTCTCAC GGTGCTGACC CTGATCGCCG GTGGAGCGAT GGCGTTCGCC TTCCGGCAAC GGGCCCAGAT CGCCGGCGAG CGGGACCGGG CGCTGTCGGT GGCGATCGGC CGGCAGGCCG ACCAGCTACG CAGGTCCGAC CCGCTGGCCG CCGCCCAGCT CGGCGCCGTC GCCTACCGCC TGGCGCCGAC GGCGGAGGCG CGCAGCGCCG TGCTGTCGAC GTTCGCGAGC GACTACGGCT TCGCCACCCG TCACCTCGGC ACGCAGACCC GTCCGATCGG CGCGGTCGCC TACAGCCGGG ACGGCAGGCT GGTGGCGACG GCGAGCGACG ACTGGACGGT CGCCCTGTGG TCCGCGTCAG ACCCCACCCG CCGCACCCCG CTCAGCACGA TCAGGGGCCA CCGGCTGGCG GTGAAATCCG TGGCGTTCAG CCCGGACGGC CGGTTCGTCG CCACCGGGAG CGACGACCGG ACGGTCCGGC TGTGGACCAT CACCGATCCC ACTCACCCGG TGCTCGCGGC GACGTTACCC GGCGCCTCCG ACTCGATCCA CGGGCTGGCC TTCCGCTCCG ACAGCCGGGT CCTTGCCACC GGCGGGTACG GATCACAGGT GCGGCTGTGG GACGTCTCCG ACGTGAGCGC GATCCGGCCG GCCGGCGCGA TCACCGCCCA CACCGCGAAC GTCCGTGCCG TCGCGTTCAG TCCCGACGGC ACCGAGCTGC TGACCGGTGG CGACGACGGC CGGGCCCTGC TGTGGGACGT GCGGGACCTC GCCGCCCCGG CGGAGCTGAG CGAGCTGAAG GGCGAGGCGG GCCCGAATGT CGTGCTCAGC GCGAACACGG CCGTCCGTGC GGTCGCCATC AGCCCGGACG GGAAGACGGC GGTGACCGGC AGCGACGACA CCACCGCGCG GGTCTTCGAT CTCACCGACC CGCGACGGCC GACCACCCTC CAGGTCATCG ACGAGGTGCG ATCGGTCGAG GCGGTGGCGG TCAACGTGGA CGGCCTCGTG GCCGTGGGGG TGAACAGCGC GGTCGAGATC TACGATCCGT CCCGCCATTT CGACGCCATC GCGGTCCTGC CGCACGCGTC GAAGCCGTGG GCGTTGGCCT TCAGCCCGGA CGGGCGAACC CTCGCCTCCG GTGCCGACGA CCGCACCCTG CGCCTGTGGG ACGTGCCGGG GCCGGTGCTG GTCGGGCACC ACCGCTTCCT GTGGTCGACG GCGGTCCATC CGAACGGGCG GGTCCTAGCC ACCGGGGACT ACGGGCAGAC CGTCCGGCTG TGGGATGCGC ATGACCCCGA TCGGCCGCTA CCGGCCGCCA CGTTGACCGG ATTCAACGCC GCGGTCGGCA GCGTCCGGTT CACCCCGAAC GGCAGGGTCC TCATCGCCGG CAGCCTCGAC TACGCCACCG ACTTCGACGG CATGGTCACG CTGTGGGACG TCGCCGACAT CCACCGTCCC AGGCTGCTGT CCACCATCCA TCCCGGCATC GGCGGGATCA AGGACCTGGA GGTCAGCCCG GACGGCCGGA CCCTCGCGCT GGCCGGCACG AGCGCCCGGC TGGCCCTGGT GGACATCTCC GCCCCGGCCA CGCCCCGGGG GCGGTCCGTC CTCGCAGGGC ACCGCTACGA CGTCGCCACC GTCAGTTTCA GCCCGGACGG GCGCACCCTG GCGTCCGGAA GCAGCGACCG GACCATCCGG TTGTGGGACA TCACGACACC GGACCGACCG GCGGGCATCA GCACGATCAC GGGCTTCACC AGCGCCCTGA TCACCATCGC CTTCAGCCGG GACGGAAGGA CCCTCGCCGT CGGCAGCTTC GACACGAAGG TCGCCCTGTG GGACGTCACG GACCGCCGGG CCCCCCGCGC GCTGCCCTCG ATCATCGGGT TGGGGGCCGG CGTCAACTCC ATCGAATTCA GCTCCGACGG CCGGCTCATG GCGGCGGGCG TCGGCGGCGC GAACGGCGTG GTGCGGCTGT GGGACATGTC CGATCCGGCC GACCCGGTGC CGTACGCCAC CTTCAGCGGC CGCTACAACG GCTTCGCTGC GGTCGCCTTC GCCCCGGACA CCGCCTTCAT CGTCGGAGCG CCCTACCAGG TGATCGGATT CGTCTGGGAC CTCGACCCGG CGAGCGTCGT CACGCGGCTG TGCGCGCGGG CCGGCGACCC GCTCACCCGC GAGGAATGGA AACAGTACCT GCCGGGCCTC GACTACGATC CGCCGTGCGG CTGA
|
Protein sequence | MATASALSSA LSLTAQMMRP RPERPDMGGL LIAQRQRTGT VRLSRAPRIT FGFRGIPPGR RRIDLWLMRQ LDDLVGGIWP LMSGTGGRAR AVARSSTSRT SRLVLGSAGA RVLVVGTGGL GGPGVDRAGE VGAVVARAVA AVGGVFVERC GLAEESLRTV VDPATPLDLG EALAAAVEQA TEALLVYYAG IVLVDADGMI HLATAVSDSR PHRLAYTALP LSSVLATMAT SRAATRLIAV DGLWVRTTVA NSTVPPVPPR VPGLREDVPG PREDAGGGVN AAGDGVELLV LTSPLAAHTG PAADDGPPLS AGLVRLLVNG DPDGPVELSV GQVTRRLAQR LTAAGNAVLH SAAAWSDGVI LAGNAAHTPP IADLEPAPTT VLTDGPGPVV TPTPRQPAGL PSAPDGGGTT GEAACPYPGL ASFDAGTQRW FFGRERATAE ALSRLEERLD GAGPLVLVGA SGSGKSSLLG AGLLPALARG AVAGARGWPQ VLMTPTDHPA RELARTVAQA AGLSVLAAAR LDVVSQPARF VGALSDLLAL GPAAAGGGTT PPTRRVVIVV DQFEETFTLC ADEEERGAFI RALVAACRGD GATRPPAVVV IGVRADFYGW CAAYPELVDA LQTGQVVVGP MTAAEVRDAI VKPAEAVGLT VEPGLVDLML HDIGADEHGV VADPGSLPLL AHALRATWRE RENGTLTVAG YLRAGGLRGA IARTAEETFG TLDAAGQDAA WQILLQMIQF GDGTDDTRRR VSRTELLSTS ASAQNSASAQ NSASAQNSAS AQNSASAQNV SAVLDALVAA RLVTVDADTV EISHEALLRS WPRLREWIEV DRAAAVTRQR LTDAAATWDD GGRDPSYLFA GSRLATVREW AQAGQRQTAL SPVARDFLAA SLAAEQAARR AEVRRTRRLH QLVAALTVLT LIAGGAMAFA FRQRAQIAGE RDRALSVAIG RQADQLRRSD PLAAAQLGAV AYRLAPTAEA RSAVLSTFAS DYGFATRHLG TQTRPIGAVA YSRDGRLVAT ASDDWTVALW SASDPTRRTP LSTIRGHRLA VKSVAFSPDG RFVATGSDDR TVRLWTITDP THPVLAATLP GASDSIHGLA FRSDSRVLAT GGYGSQVRLW DVSDVSAIRP AGAITAHTAN VRAVAFSPDG TELLTGGDDG RALLWDVRDL AAPAELSELK GEAGPNVVLS ANTAVRAVAI SPDGKTAVTG SDDTTARVFD LTDPRRPTTL QVIDEVRSVE AVAVNVDGLV AVGVNSAVEI YDPSRHFDAI AVLPHASKPW ALAFSPDGRT LASGADDRTL RLWDVPGPVL VGHHRFLWST AVHPNGRVLA TGDYGQTVRL WDAHDPDRPL PAATLTGFNA AVGSVRFTPN GRVLIAGSLD YATDFDGMVT LWDVADIHRP RLLSTIHPGI GGIKDLEVSP DGRTLALAGT SARLALVDIS APATPRGRSV LAGHRYDVAT VSFSPDGRTL ASGSSDRTIR LWDITTPDRP AGISTITGFT SALITIAFSR DGRTLAVGSF DTKVALWDVT DRRAPRALPS IIGLGAGVNS IEFSSDGRLM AAGVGGANGV VRLWDMSDPA DPVPYATFSG RYNGFAAVAF APDTAFIVGA PYQVIGFVWD LDPASVVTRL CARAGDPLTR EEWKQYLPGL DYDPPCG
|
| |