Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0194 |
Symbol | |
ID | 3903221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 228556 |
End bp | 231681 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637877525 |
Product | hypothetical protein |
Protein accession | YP_479314 |
Protein GI | 86738914 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.394359 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACT ACCACGAGGT CGTCTTCGAG TCCGAGATCT GCGCGTACCT GGAGGCTCAT GGGTGGCTGT ACTCGGCGGG CGACTCCGGG TATGACCGTG AGCGGGCGCT CTTCCCCGCG GATGTGTTCG GCTGGTTGGA GGAGACTCAG CCGGCGGCGT ACGGGAAGGC GTTGAAGGCG GCCGGGTCGG CGGCGAAGTT CCTCGATGTG CTGGCCACGG CGCTCGACAG GCCGCTGGAG CACGGCGGCG GGACGTTGAA CATCCTGCGC AACGGGGTCG TCTACATCGG TGGTGGCCGG TTGAAGCTGG CGCAGTTCCG GCCGGAGACC AGCCTGAACG CGACGACGGT GGCGCAGTAT GCGGCGATGC GGGTGCGGGT GATGCGGCAG GTGCGCTTCT CCACCGCCGA TCAGCGCAGC ATCGACCTGG TGTTCTTCGT CAACGGGCTG CCGGTGGCGA CGGTGGAGCT GAAGACGGAC TTCACGCAGT CGCTGGACGA GGCGATCAGC CAGTACCGCA AGGACCGGCG CCCGGTCACC AACGGCCGGG CGGAGCCGTT GCTGTCGTTC GGGCATCGGG CGCTGGTGCA CTTCGCGGTC TCCAACGACC TGGCGGCGAT GACCACCAGG TTGGAGGGGG AGAAGACGCA CTTTTTGCCG TTCAACATCG GCCACGACGG CGGGGCGGGG AACCCGCCAG GTGCCGAGGG GCGGTCGGCG ACGGCGTACC TGTGGGAGCG GGTCTGGGAG AAGGACGCCT GGCTCACCAT CGTCGGGCGG CTGATGATCG TGGAGACCCG GGAGGAGTGG GACGTCGCGA CGGGGACGTC GGTGCGACGT ACCAGCATGC TCTTCCCGCG GTTCCACCAG TGGGAGGCCG TGACGACCAT CGTCGACGCC GTACGCGAGG AGGGCGTCGG CCACCGGTAC CTGATCGAGC ACTCGGCCGG GTCGGGGAAG ACGAACACGA TCGCCTGGAC CGCACACCGG CTGGCGCGGC TGCACGTGGA CGACGAGAAG GTCTTCGACA CGGTCCTCGT GGTCGTGGAC CGGACGGTGC TTGACGGGCA GCTTCAGGAG GCGATCCGGC AGATCGACGG GTCCGGCAGG ATCGTGGCCA CGATCAGCCC GGAGGACGTC CGCAAGGCCG GCGCGACGTC GAAGTCCGGG CTGCTGGCCG CCGCGCTGCG GAACGGCGAG CTGATCATCG CGGTGACGGT GCAGACGTTC CCGTTCGCGA TGGACGAGAT CCAGGCGGAC AAGGGGCTGA CGGGCAGGAA GTTCGCCGTG ATCGCCGACG AGGCGCACTC GTCGCAGTCC GGCAGGATCT CCTCCCAGCT GAAAGCCGTG CTCACCGCCG AGGAGATCAA GGACCTCGCC GACGGCGGCG AGGTCGCCCT GGAATCGCTT CTGGCGGCGC GGATGAGCGA GCGGGCCGAC TCGCCGAACA TCTCCTACTT CGCGTTCACC GCGACCCCGA AGGCCAAGAC GCTGGAGATG TTCGGGCGGA AGGGGCCGGA CGGGAAGCCG GTCGAGTTCC ACCTGTACTC GATGCGGCAG GCGATCGAGG AGGGGTACAT CCTCGACGTT CTGCGTGGCT ATCAGTCGTA CGACACCGCG CTGAAGATCG CCGGTAAGGC CACCGCCGAC AGCGAAGTCG AGGAAAGCGC CGCGCGTAAG GGGCTGATGC GGTGGGTGAA GCTGCACCCG ACCAACATCA GCCAGAAAGT CCAGATCATC GTCGAGCACT TCCACGCCAA CGTCGCCCAC CTGCTGGAGG GCAGGGCGAA GGCGATGGTC GTCACCGACT CGCGCAAGGC CGCGGTGAAG TACAAGAAGG CGATCGACGC CTACATCGCT CGCCGGGTCG CGGAGGATCC GTCGTACACC TACCGCACGC TGGTCGCCTT CTCCGGGTCG GTGACGATGG ACGAGAACGA GGTGTGGACC TCGGAGTGGG GGCCGGTGCC GGCGGAGGAC GTCGAGTTCA CCGAGACCAA CCTCAACCCC GGCGCCGGCG CGGACCTGGC CGCGGCGTTC AAGGGCGGGA CCTACACGAT CATGCTGGTG GCCAACAAGT TCCAGACCGG CTTCGACCAG CCGCTGCTCT CGGCGATGTA CGTCGACAAG AAGCTCTCCG GGGTCACCGC CGTGCAGACG CTCTCCCGGC TCAACCGCAC CCACCGCACC GCGGGCGGGG AGATCAAGCG CACGACGTTC GTCCTCGACT TCGTGAACAC GCCCGACGAC ATCCGGGCCG CGTTCGAGCC GTACTTCACC GGTGCGACCC TGGAGACCGA GACCGACCCG TACGTCGTCG CCCACCTCGC CGCCAAGCTC GCCCAGACGG GGATCTACAC CGCCGACCAG GTACGGAACG TCGCCGAGTT GTGGGTGAAG CGGAAGGGTA ACAACGCGCT CTCGGCCGCG ATCGCTCCGG CGAAGCACGA GTTCGCGAGC CGCTACGCCG CCGCGATCGA GGCCGATGAC AAGGTCACGC TCAGCACCCT CGACCTGTTC CGTCAGGACG TCTCCACCTA CGTCCGGCTC TACGACTTCA TGAGCCAGAT CGTCGACTAC GGCGACCCGC ACCTGGAGAT GCTCTCCATC TTCCTACGCC TCCTGGAGAA GGTCATCGCC GACTCCTCCT GGGCCGCCGA GGTCGACCTC TCCGACGTCG TCCTGGTCGG GGTCAGACAC GAGAAGCGGA TCGCCGTCGA CATCTCGCTG ACCGGCGACG GCGAGCTCAA GGGAATCAGC GCCGCCGGAA CCGGCGCCCG CAAGGAGCCC AGGTACGTCG CGCTCCAGGT CGTGATCGAC AAGATGAACG ACCTCTTCGG CGCCGAGTCC TTCACCGAGT CGCAGATCCG CGAGTTCGTC GACGGCCTGG TCCAGCGACT CCTCGCCTAC CCCGACCTCG TCAGGCAGAC CCAGGTCAAC TCGAAGAAGC AGTTCATGGA CTCCGACGAC TTCAAGGCCG TCGTCACCGA GGCCGTCCTC GACAACCAGG AAGCCCACAA CACCATGGCC GACTACTTCT TCAGCGACGG CCCTGGGATC AACAGCGTCA TCCTTGCCCT CGCGGACGCC TTCTACGAGG TCGCCACGTC ACAGGAGACC GACTGA
|
Protein sequence | MADYHEVVFE SEICAYLEAH GWLYSAGDSG YDRERALFPA DVFGWLEETQ PAAYGKALKA AGSAAKFLDV LATALDRPLE HGGGTLNILR NGVVYIGGGR LKLAQFRPET SLNATTVAQY AAMRVRVMRQ VRFSTADQRS IDLVFFVNGL PVATVELKTD FTQSLDEAIS QYRKDRRPVT NGRAEPLLSF GHRALVHFAV SNDLAAMTTR LEGEKTHFLP FNIGHDGGAG NPPGAEGRSA TAYLWERVWE KDAWLTIVGR LMIVETREEW DVATGTSVRR TSMLFPRFHQ WEAVTTIVDA VREEGVGHRY LIEHSAGSGK TNTIAWTAHR LARLHVDDEK VFDTVLVVVD RTVLDGQLQE AIRQIDGSGR IVATISPEDV RKAGATSKSG LLAAALRNGE LIIAVTVQTF PFAMDEIQAD KGLTGRKFAV IADEAHSSQS GRISSQLKAV LTAEEIKDLA DGGEVALESL LAARMSERAD SPNISYFAFT ATPKAKTLEM FGRKGPDGKP VEFHLYSMRQ AIEEGYILDV LRGYQSYDTA LKIAGKATAD SEVEESAARK GLMRWVKLHP TNISQKVQII VEHFHANVAH LLEGRAKAMV VTDSRKAAVK YKKAIDAYIA RRVAEDPSYT YRTLVAFSGS VTMDENEVWT SEWGPVPAED VEFTETNLNP GAGADLAAAF KGGTYTIMLV ANKFQTGFDQ PLLSAMYVDK KLSGVTAVQT LSRLNRTHRT AGGEIKRTTF VLDFVNTPDD IRAAFEPYFT GATLETETDP YVVAHLAAKL AQTGIYTADQ VRNVAELWVK RKGNNALSAA IAPAKHEFAS RYAAAIEADD KVTLSTLDLF RQDVSTYVRL YDFMSQIVDY GDPHLEMLSI FLRLLEKVIA DSSWAAEVDL SDVVLVGVRH EKRIAVDISL TGDGELKGIS AAGTGARKEP RYVALQVVID KMNDLFGAES FTESQIREFV DGLVQRLLAY PDLVRQTQVN SKKQFMDSDD FKAVVTEAVL DNQEAHNTMA DYFFSDGPGI NSVILALADA FYEVATSQET D
|
| |