Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5421 |
Symbol | |
ID | 5673752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6548449 |
End bp | 6553194 |
Gene Length | 4746 bp |
Protein Length | 1581 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641244276 |
Product | putative type II restriction enzyme, methylase subunit |
Protein accession | YP_001509682 |
Protein GI | 158317174 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.689282 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAGCT TCGACGCGGT GGTCGTCGGC GAGTCGTGGA TCTCCGAGCA CTACCTGACC TCCGACGCCC GCACCGGCAG CTTCCTCGCC GAGGTGCTGG CGCTGCGCGT CCGGTGGAAG GCCGACGAGG ACGACGGCCA TCCCACCGCG CGCTCGGCGC TGCGCGACGC GGCCACCCCG CTGGCGCGGC TGTTCGGCGG GCTGGGTGAG CGGCCGTCCG ACGAGGACGT CCGCGAGGTA CACCGTGAGG TGCGCCGGGC GTTGCGGCTC GACCGGGAGC CGACGTCCTG GGTATCGGAC CGGTCCGGGG ACGAGGTGCG GCTGCCCGCC GCGGTGGTGC ACCCCTCCCC CACCGGGACG GCGCTGCTCA TCCTGGAAGC CACCCCCGCC GGGGCGGTCG AGGACATCCT GACCACGCCG GACGCGGCCG GGCCGCGGGT CGGGCAGCTC CTCGACCCGG CGGTCGTCGA CGGCAAGCCG CAGGCGGCCC TCGCGAAGGT CGTCTCCAGT GTGTTCCTGA CCGACGACGC CCCGCCGTTC GTGCTGGTAC AGGCCGGCAG ATGGGTGCTA CTCGCCGAGC GCGGCCGGTG GCCGGAGGGG CGCTGGCTCG CCGTCGACCT GGGTGTCGTC GTCGACCGGC GGGACGTCCG GGCGGCCGGC GAGCTGGAGC ACGTCGCCGC CATGGCCGGG CCGGACCTGC TGCTGCCGGC CGAGGACGGC GTGGCCCCGT GGCTCGGCCT GCTGGAGAAG TCGGTACAGC ACACCGTCGG CGTCTCCGCG GACCTGCGCG AGGGTGTCCG ACTGTCGGTG GAGATCATCG CGAACGAGGT GGTGGCGCGG CGGCGGGCGG CGGGGCTGGA CGTCCTCGAC GTCCCGAACC TCGGCCGGGA CCTGACGCGG CAGTCGCTGC GGTTCCTGTA CCGGATCCTG TTCCTGCTGT ACGCGGAGGC GTCGCCGCAG CTGGAGGTGC TGCCCGTCCA GGCGCCCGAA TACCAGGCCG GCTACGGGCT GGACCGGCTG CGCGACCTGA TCCTCACCGA CCTGTCGTCG GATCGTGCCC GCCGCGGCCG GCACCTGTAC GAGTCGCTGC ACCGGCTGTT CGTCCTAGTC GACAAGGGCC ACACCACGGG CACGGGCACG GGCACGGGCA CGAACGCGGA CGCCGACACG GGCACGGGCA CGGGCGCCGA GTCGGTGGCG CCGGGCGGGG ACGGGCTGGT GTTCCAACCG CTGCGCTCCG ACCTGTTCGC GGCGGACGCC ACGGCGCTCA TCGACGAGGT CGGCCTCGGC AACGAGGCGC TGCAGCGGGT GCTGCGCCAC CTGCTGCTGA CGAAGGTGAA GAAGGGCAGC GACCGCGGGT TCATCTCCTA CGGCGAGCTC GGCATCAACC AGCTCGGCGC CGTCTACGAG GGGCTGATGT CCTGGTCGGG GATCATCGCC GACACCGACC TGTACGAGGT CGCGAAGGGC GGCGACCCGT CCGGCGGAAC GTGGCTGCTG CCCGCCGAGC GCGTCAACGA GGTGCCCGCC GACTCATTCG TCCTCGACCG GGACGAGAAC ACCGGCGAGC CGAAGCCGAG GCTGCATCCG CGCGGGTCGT TCGTGTACCG GCTCGCCGGG CGGGAGCGGC AGCAGTCGGC GTCGTACTAC ACGCCGGAGG TCCTCACCCG CTCGGTCGTG CACCACGCCA TCGAGGAACT GCTCGACCAG GACGGGACGA AGACCCGCGC CGCGGACATC CTGGCGATGA CCATCTGCGA GCCGGCGCTC GGGTCGGGCG CGTTCGCGAT CGAGGCCGTC CGCCAGCTCG CCGCCGCCTA CCTGAGCCGG GCGCAGGACG AGCGTGGCGA ACGCATCCCC GCCGAGCAGT ACCCGGCCGA ACTGCAACGG GTGAAGGCAT ATCTGGCGTT GCACCAGGTG TACGGGGTGG ACCTGAACGC CACCGCCGTC GAGCTCGCCG AGGTCTCCCT GTGGCTCGAC ACGATGCAGG CCGGCCTCGC CGCACCCTGG TTCGGCCTGC ACCTGCGCCG CGGCAACAGC CTGATCGGCG CCCGCCGCGC CGTCTACGCA CCGGCACTGC TGAAGAAGAA GGAATGGCTG ACAACCGTTC CGACGGACGT CGGCCTGCAC AACCCGGACG CGCCGGGCTC GACGCCCCCG GTCGGCACCG GCATTCACCA CTTCCTGCTG CCGGCGGCTG GATGGGGCGC GGTCGTCGAC ACCGCGGAGG CGAAAACGTA TGCACCTGAG TCCCGGGAGA GGCTGCGGGC CTGGCGGGCC GGGGTGCTCC GTACCCCGAG CAAGGAGCAG CAGAACCGGC TGGCGCGGCT CGCCCTACGC GTCGAGGTGC TGTGGGAACT CGCGCGGCGC CGCCTGGAAA TCGCCGAGGC CGGCATCCGT CGCGCCACAT CGGTGTGGGG CGCAGACGGT TCCGCGGAAC CGCCCCGCGA GCCGGTGACC CGCGAGCAGG TCGAAGCCGT CCTGCACAAC CCGAACAGCG CCTACCGACG GCTGCGCCGC GTGATGGACG CCTGGTGCGC CCTCTGGTCC TGGCCACTGA CCACCGACGT CCCGCCCCCC GATCTGGACG CCTGGATCGG CGGCCTGGAA GCCCTGCTCG GCAAGACCAC GAAGCCCGGC CGGAAGGAAA GCGACGGCCA GACCGGCTTC GCCGACGACC TCTCCTGGCG AGGACTCGAC GACGCCGAAG ACCTCGACCT CGGCTTCGCC GACGCGATGC CCGTCAAGGA GGCACTGAGC CGCATCTCCT GGCTCGGGGT GGCCGCCGAG ATCGCCGACC GCCAGGGCTT CTTCCACTGG GAACTCGACT TCCCCCAGGT CTTCACCCGC GGCGGCTTCG ACCTCCAGGT AGGAAACCCC CCCTGGGTCC GCCCCGACTG GGACGAAGCC GGCGTTCTTG CAGAGTCCGA CCCATGGTGG ACCCTAGTCC CTAACATTGC CGAGGAGCTG AAGCTTGAAC GCCGGAAACA CACCTTTCGC GTTCCCTCAG CGGAGACTTG GTACCTCGAC CAGCGAGCCG AACAAGCTGG CCTCACCGCC CACCTCGGAA GTTCGGTCGA ACGACCCCTG CTACACGGCC TTCAGCCGGA TCTGTACCGC TGCTTCATGG ACGGTTCCTG GCGTTTGACA CATCCGGACG GGATCATCTC TCTGATCCAC CCGGGATCTC ATTTCACGGA ATTGCGAGCC GCAAAATTCC GCAGAGAAGC ATACCACCGA CTTCGCCGCC ACTGGCACTA CCGAAATGAA AGAAAACTAT TCGAAGAGGT GCATCACGCA AAAGAGTTCG GGGTGCACAT CTACGGGAAA TCGCAACCCG CAATAGAACT CATGCAAGCG ACATGGCTAT ACGAGCCATT CATTCCAGAG CGATCCCTAT TGCATGATGG ATCGGGAAGC GAACCTGGAA TCAAGGACGA TCAAGGACAT TGGGACACCC GGCCACACCG TGCACGCATA ACATCTGTCA ACAAAGAAAT CCTTACCGAG TGGGCAACAC TACTAGATAA ACCTGGTACG CCTCACGACG AGGCAAGGAT GCTATTTCCG ATCAATGCCT CCAGTGCGAC CGTACTGGCC AAGCTCTCCA AAGCTCCGCG ACTAGGTTCT CTCGATATCA ACTGGACGGC CGGCTGGCAT GAATCAGCGG ATCGCCGGAG AGGGTGGTTC GAACGCCGCC CCGCAACGCC TTCAGCATGG TCAGACGTGA TTCTACAAGG CACTCATATT ACCGTTGCAG CACCCTTGTT TCAGCAGGCT ACCGAGACGT CTAAGAACCG CCAGGATACC GAACCGATAG ACTTGGACGA GCTCCCGGAG AGCTTCATCC CTAGGACTAT CTATCAACGC TCCCGACTCG CCGACGAATA CCACACTGCA TACCCACATT GGGACGGTTC TCCTAGTACT AAAAAATTTC GTATAGCCTG GCGAGAAATG GCTGATCCAG GTAACGCGCG AAGCCTCCAG ACTGCACTGA TACCGCCGGG ACCGGCTCAC ATTCACGCCG TAAATACCTT GTCGACCTCA GATCCGATGA ACCTTGTCAT TGCCGCCGGA ACATCCGCTT CCCTCGTTGC CGACTTCCTT GTCAAGAACG GCAATTCCAG CCACATATCA CTCTCGGTAA TGACAAAACT TCCACACGTA AAAAGACACG AACTGCAGCC CGAACTAATG GTGCGAACGT TGCGACTCAA CTGTCTGGTA CGACCGTACG CATCATTGTG GGAGGAGCTG TACTCGCCCA CCTGGCGGGA CGATCGGTGG GTTCCGGGGG TTGGGGTGGC CTATCGCGAG CGGCCGGACA TTGGAATGAT CACCGCAGGG TGGGAGTGGG GCACGCCACT GCGGCGGGCG GCGGATAGGC GGCAGGCGCT TGTCGAGATC GACGCGATCG TGGCGGTGAT GCTCGGGCTT ACCGCTGACG AGCTGGTCAC CGTCTACCGG ACGCAGTTTC CCGTCCTGCA GGATTACGAG CGCAAGGCGC GGTACGACTC GTTCGGGCGG CAGTTGCCGG TCGATCTCGT GAAGCAGCTC GACAAGGCGA CCGCGGCCGG GGAGCGGGTC CATACGCTCA ACGGGCCCGA TCGCACCTAC GTCGGGCCGT TCAGCGGGGT GAACCGGGAG GCCGACCTGC GGATCGCGCA CGAGCACTTC AGCCGGCTGA CCGCCGAGCG GGAAGCGAAG AAGGCGCAGG TCCAGGCAGA GGAGCAGCTG CCCTGA
|
Protein sequence | MPSFDAVVVG ESWISEHYLT SDARTGSFLA EVLALRVRWK ADEDDGHPTA RSALRDAATP LARLFGGLGE RPSDEDVREV HREVRRALRL DREPTSWVSD RSGDEVRLPA AVVHPSPTGT ALLILEATPA GAVEDILTTP DAAGPRVGQL LDPAVVDGKP QAALAKVVSS VFLTDDAPPF VLVQAGRWVL LAERGRWPEG RWLAVDLGVV VDRRDVRAAG ELEHVAAMAG PDLLLPAEDG VAPWLGLLEK SVQHTVGVSA DLREGVRLSV EIIANEVVAR RRAAGLDVLD VPNLGRDLTR QSLRFLYRIL FLLYAEASPQ LEVLPVQAPE YQAGYGLDRL RDLILTDLSS DRARRGRHLY ESLHRLFVLV DKGHTTGTGT GTGTNADADT GTGTGAESVA PGGDGLVFQP LRSDLFAADA TALIDEVGLG NEALQRVLRH LLLTKVKKGS DRGFISYGEL GINQLGAVYE GLMSWSGIIA DTDLYEVAKG GDPSGGTWLL PAERVNEVPA DSFVLDRDEN TGEPKPRLHP RGSFVYRLAG RERQQSASYY TPEVLTRSVV HHAIEELLDQ DGTKTRAADI LAMTICEPAL GSGAFAIEAV RQLAAAYLSR AQDERGERIP AEQYPAELQR VKAYLALHQV YGVDLNATAV ELAEVSLWLD TMQAGLAAPW FGLHLRRGNS LIGARRAVYA PALLKKKEWL TTVPTDVGLH NPDAPGSTPP VGTGIHHFLL PAAGWGAVVD TAEAKTYAPE SRERLRAWRA GVLRTPSKEQ QNRLARLALR VEVLWELARR RLEIAEAGIR RATSVWGADG SAEPPREPVT REQVEAVLHN PNSAYRRLRR VMDAWCALWS WPLTTDVPPP DLDAWIGGLE ALLGKTTKPG RKESDGQTGF ADDLSWRGLD DAEDLDLGFA DAMPVKEALS RISWLGVAAE IADRQGFFHW ELDFPQVFTR GGFDLQVGNP PWVRPDWDEA GVLAESDPWW TLVPNIAEEL KLERRKHTFR VPSAETWYLD QRAEQAGLTA HLGSSVERPL LHGLQPDLYR CFMDGSWRLT HPDGIISLIH PGSHFTELRA AKFRREAYHR LRRHWHYRNE RKLFEEVHHA KEFGVHIYGK SQPAIELMQA TWLYEPFIPE RSLLHDGSGS EPGIKDDQGH WDTRPHRARI TSVNKEILTE WATLLDKPGT PHDEARMLFP INASSATVLA KLSKAPRLGS LDINWTAGWH ESADRRRGWF ERRPATPSAW SDVILQGTHI TVAAPLFQQA TETSKNRQDT EPIDLDELPE SFIPRTIYQR SRLADEYHTA YPHWDGSPST KKFRIAWREM ADPGNARSLQ TALIPPGPAH IHAVNTLSTS DPMNLVIAAG TSASLVADFL VKNGNSSHIS LSVMTKLPHV KRHELQPELM VRTLRLNCLV RPYASLWEEL YSPTWRDDRW VPGVGVAYRE RPDIGMITAG WEWGTPLRRA ADRRQALVEI DAIVAVMLGL TADELVTVYR TQFPVLQDYE RKARYDSFGR QLPVDLVKQL DKATAAGERV HTLNGPDRTY VGPFSGVNRE ADLRIAHEHF SRLTAEREAK KAQVQAEEQL P
|
| |