Gene Franean1_5421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5421 
Symbol 
ID5673752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6548449 
End bp6553194 
Gene Length4746 bp 
Protein Length1581 aa 
Translation table11 
GC content67% 
IMG OID641244276 
Productputative type II restriction enzyme, methylase subunit 
Protein accessionYP_001509682 
Protein GI158317174 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.689282 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGCT TCGACGCGGT GGTCGTCGGC GAGTCGTGGA TCTCCGAGCA CTACCTGACC 
TCCGACGCCC GCACCGGCAG CTTCCTCGCC GAGGTGCTGG CGCTGCGCGT CCGGTGGAAG
GCCGACGAGG ACGACGGCCA TCCCACCGCG CGCTCGGCGC TGCGCGACGC GGCCACCCCG
CTGGCGCGGC TGTTCGGCGG GCTGGGTGAG CGGCCGTCCG ACGAGGACGT CCGCGAGGTA
CACCGTGAGG TGCGCCGGGC GTTGCGGCTC GACCGGGAGC CGACGTCCTG GGTATCGGAC
CGGTCCGGGG ACGAGGTGCG GCTGCCCGCC GCGGTGGTGC ACCCCTCCCC CACCGGGACG
GCGCTGCTCA TCCTGGAAGC CACCCCCGCC GGGGCGGTCG AGGACATCCT GACCACGCCG
GACGCGGCCG GGCCGCGGGT CGGGCAGCTC CTCGACCCGG CGGTCGTCGA CGGCAAGCCG
CAGGCGGCCC TCGCGAAGGT CGTCTCCAGT GTGTTCCTGA CCGACGACGC CCCGCCGTTC
GTGCTGGTAC AGGCCGGCAG ATGGGTGCTA CTCGCCGAGC GCGGCCGGTG GCCGGAGGGG
CGCTGGCTCG CCGTCGACCT GGGTGTCGTC GTCGACCGGC GGGACGTCCG GGCGGCCGGC
GAGCTGGAGC ACGTCGCCGC CATGGCCGGG CCGGACCTGC TGCTGCCGGC CGAGGACGGC
GTGGCCCCGT GGCTCGGCCT GCTGGAGAAG TCGGTACAGC ACACCGTCGG CGTCTCCGCG
GACCTGCGCG AGGGTGTCCG ACTGTCGGTG GAGATCATCG CGAACGAGGT GGTGGCGCGG
CGGCGGGCGG CGGGGCTGGA CGTCCTCGAC GTCCCGAACC TCGGCCGGGA CCTGACGCGG
CAGTCGCTGC GGTTCCTGTA CCGGATCCTG TTCCTGCTGT ACGCGGAGGC GTCGCCGCAG
CTGGAGGTGC TGCCCGTCCA GGCGCCCGAA TACCAGGCCG GCTACGGGCT GGACCGGCTG
CGCGACCTGA TCCTCACCGA CCTGTCGTCG GATCGTGCCC GCCGCGGCCG GCACCTGTAC
GAGTCGCTGC ACCGGCTGTT CGTCCTAGTC GACAAGGGCC ACACCACGGG CACGGGCACG
GGCACGGGCA CGAACGCGGA CGCCGACACG GGCACGGGCA CGGGCGCCGA GTCGGTGGCG
CCGGGCGGGG ACGGGCTGGT GTTCCAACCG CTGCGCTCCG ACCTGTTCGC GGCGGACGCC
ACGGCGCTCA TCGACGAGGT CGGCCTCGGC AACGAGGCGC TGCAGCGGGT GCTGCGCCAC
CTGCTGCTGA CGAAGGTGAA GAAGGGCAGC GACCGCGGGT TCATCTCCTA CGGCGAGCTC
GGCATCAACC AGCTCGGCGC CGTCTACGAG GGGCTGATGT CCTGGTCGGG GATCATCGCC
GACACCGACC TGTACGAGGT CGCGAAGGGC GGCGACCCGT CCGGCGGAAC GTGGCTGCTG
CCCGCCGAGC GCGTCAACGA GGTGCCCGCC GACTCATTCG TCCTCGACCG GGACGAGAAC
ACCGGCGAGC CGAAGCCGAG GCTGCATCCG CGCGGGTCGT TCGTGTACCG GCTCGCCGGG
CGGGAGCGGC AGCAGTCGGC GTCGTACTAC ACGCCGGAGG TCCTCACCCG CTCGGTCGTG
CACCACGCCA TCGAGGAACT GCTCGACCAG GACGGGACGA AGACCCGCGC CGCGGACATC
CTGGCGATGA CCATCTGCGA GCCGGCGCTC GGGTCGGGCG CGTTCGCGAT CGAGGCCGTC
CGCCAGCTCG CCGCCGCCTA CCTGAGCCGG GCGCAGGACG AGCGTGGCGA ACGCATCCCC
GCCGAGCAGT ACCCGGCCGA ACTGCAACGG GTGAAGGCAT ATCTGGCGTT GCACCAGGTG
TACGGGGTGG ACCTGAACGC CACCGCCGTC GAGCTCGCCG AGGTCTCCCT GTGGCTCGAC
ACGATGCAGG CCGGCCTCGC CGCACCCTGG TTCGGCCTGC ACCTGCGCCG CGGCAACAGC
CTGATCGGCG CCCGCCGCGC CGTCTACGCA CCGGCACTGC TGAAGAAGAA GGAATGGCTG
ACAACCGTTC CGACGGACGT CGGCCTGCAC AACCCGGACG CGCCGGGCTC GACGCCCCCG
GTCGGCACCG GCATTCACCA CTTCCTGCTG CCGGCGGCTG GATGGGGCGC GGTCGTCGAC
ACCGCGGAGG CGAAAACGTA TGCACCTGAG TCCCGGGAGA GGCTGCGGGC CTGGCGGGCC
GGGGTGCTCC GTACCCCGAG CAAGGAGCAG CAGAACCGGC TGGCGCGGCT CGCCCTACGC
GTCGAGGTGC TGTGGGAACT CGCGCGGCGC CGCCTGGAAA TCGCCGAGGC CGGCATCCGT
CGCGCCACAT CGGTGTGGGG CGCAGACGGT TCCGCGGAAC CGCCCCGCGA GCCGGTGACC
CGCGAGCAGG TCGAAGCCGT CCTGCACAAC CCGAACAGCG CCTACCGACG GCTGCGCCGC
GTGATGGACG CCTGGTGCGC CCTCTGGTCC TGGCCACTGA CCACCGACGT CCCGCCCCCC
GATCTGGACG CCTGGATCGG CGGCCTGGAA GCCCTGCTCG GCAAGACCAC GAAGCCCGGC
CGGAAGGAAA GCGACGGCCA GACCGGCTTC GCCGACGACC TCTCCTGGCG AGGACTCGAC
GACGCCGAAG ACCTCGACCT CGGCTTCGCC GACGCGATGC CCGTCAAGGA GGCACTGAGC
CGCATCTCCT GGCTCGGGGT GGCCGCCGAG ATCGCCGACC GCCAGGGCTT CTTCCACTGG
GAACTCGACT TCCCCCAGGT CTTCACCCGC GGCGGCTTCG ACCTCCAGGT AGGAAACCCC
CCCTGGGTCC GCCCCGACTG GGACGAAGCC GGCGTTCTTG CAGAGTCCGA CCCATGGTGG
ACCCTAGTCC CTAACATTGC CGAGGAGCTG AAGCTTGAAC GCCGGAAACA CACCTTTCGC
GTTCCCTCAG CGGAGACTTG GTACCTCGAC CAGCGAGCCG AACAAGCTGG CCTCACCGCC
CACCTCGGAA GTTCGGTCGA ACGACCCCTG CTACACGGCC TTCAGCCGGA TCTGTACCGC
TGCTTCATGG ACGGTTCCTG GCGTTTGACA CATCCGGACG GGATCATCTC TCTGATCCAC
CCGGGATCTC ATTTCACGGA ATTGCGAGCC GCAAAATTCC GCAGAGAAGC ATACCACCGA
CTTCGCCGCC ACTGGCACTA CCGAAATGAA AGAAAACTAT TCGAAGAGGT GCATCACGCA
AAAGAGTTCG GGGTGCACAT CTACGGGAAA TCGCAACCCG CAATAGAACT CATGCAAGCG
ACATGGCTAT ACGAGCCATT CATTCCAGAG CGATCCCTAT TGCATGATGG ATCGGGAAGC
GAACCTGGAA TCAAGGACGA TCAAGGACAT TGGGACACCC GGCCACACCG TGCACGCATA
ACATCTGTCA ACAAAGAAAT CCTTACCGAG TGGGCAACAC TACTAGATAA ACCTGGTACG
CCTCACGACG AGGCAAGGAT GCTATTTCCG ATCAATGCCT CCAGTGCGAC CGTACTGGCC
AAGCTCTCCA AAGCTCCGCG ACTAGGTTCT CTCGATATCA ACTGGACGGC CGGCTGGCAT
GAATCAGCGG ATCGCCGGAG AGGGTGGTTC GAACGCCGCC CCGCAACGCC TTCAGCATGG
TCAGACGTGA TTCTACAAGG CACTCATATT ACCGTTGCAG CACCCTTGTT TCAGCAGGCT
ACCGAGACGT CTAAGAACCG CCAGGATACC GAACCGATAG ACTTGGACGA GCTCCCGGAG
AGCTTCATCC CTAGGACTAT CTATCAACGC TCCCGACTCG CCGACGAATA CCACACTGCA
TACCCACATT GGGACGGTTC TCCTAGTACT AAAAAATTTC GTATAGCCTG GCGAGAAATG
GCTGATCCAG GTAACGCGCG AAGCCTCCAG ACTGCACTGA TACCGCCGGG ACCGGCTCAC
ATTCACGCCG TAAATACCTT GTCGACCTCA GATCCGATGA ACCTTGTCAT TGCCGCCGGA
ACATCCGCTT CCCTCGTTGC CGACTTCCTT GTCAAGAACG GCAATTCCAG CCACATATCA
CTCTCGGTAA TGACAAAACT TCCACACGTA AAAAGACACG AACTGCAGCC CGAACTAATG
GTGCGAACGT TGCGACTCAA CTGTCTGGTA CGACCGTACG CATCATTGTG GGAGGAGCTG
TACTCGCCCA CCTGGCGGGA CGATCGGTGG GTTCCGGGGG TTGGGGTGGC CTATCGCGAG
CGGCCGGACA TTGGAATGAT CACCGCAGGG TGGGAGTGGG GCACGCCACT GCGGCGGGCG
GCGGATAGGC GGCAGGCGCT TGTCGAGATC GACGCGATCG TGGCGGTGAT GCTCGGGCTT
ACCGCTGACG AGCTGGTCAC CGTCTACCGG ACGCAGTTTC CCGTCCTGCA GGATTACGAG
CGCAAGGCGC GGTACGACTC GTTCGGGCGG CAGTTGCCGG TCGATCTCGT GAAGCAGCTC
GACAAGGCGA CCGCGGCCGG GGAGCGGGTC CATACGCTCA ACGGGCCCGA TCGCACCTAC
GTCGGGCCGT TCAGCGGGGT GAACCGGGAG GCCGACCTGC GGATCGCGCA CGAGCACTTC
AGCCGGCTGA CCGCCGAGCG GGAAGCGAAG AAGGCGCAGG TCCAGGCAGA GGAGCAGCTG
CCCTGA
 
Protein sequence
MPSFDAVVVG ESWISEHYLT SDARTGSFLA EVLALRVRWK ADEDDGHPTA RSALRDAATP 
LARLFGGLGE RPSDEDVREV HREVRRALRL DREPTSWVSD RSGDEVRLPA AVVHPSPTGT
ALLILEATPA GAVEDILTTP DAAGPRVGQL LDPAVVDGKP QAALAKVVSS VFLTDDAPPF
VLVQAGRWVL LAERGRWPEG RWLAVDLGVV VDRRDVRAAG ELEHVAAMAG PDLLLPAEDG
VAPWLGLLEK SVQHTVGVSA DLREGVRLSV EIIANEVVAR RRAAGLDVLD VPNLGRDLTR
QSLRFLYRIL FLLYAEASPQ LEVLPVQAPE YQAGYGLDRL RDLILTDLSS DRARRGRHLY
ESLHRLFVLV DKGHTTGTGT GTGTNADADT GTGTGAESVA PGGDGLVFQP LRSDLFAADA
TALIDEVGLG NEALQRVLRH LLLTKVKKGS DRGFISYGEL GINQLGAVYE GLMSWSGIIA
DTDLYEVAKG GDPSGGTWLL PAERVNEVPA DSFVLDRDEN TGEPKPRLHP RGSFVYRLAG
RERQQSASYY TPEVLTRSVV HHAIEELLDQ DGTKTRAADI LAMTICEPAL GSGAFAIEAV
RQLAAAYLSR AQDERGERIP AEQYPAELQR VKAYLALHQV YGVDLNATAV ELAEVSLWLD
TMQAGLAAPW FGLHLRRGNS LIGARRAVYA PALLKKKEWL TTVPTDVGLH NPDAPGSTPP
VGTGIHHFLL PAAGWGAVVD TAEAKTYAPE SRERLRAWRA GVLRTPSKEQ QNRLARLALR
VEVLWELARR RLEIAEAGIR RATSVWGADG SAEPPREPVT REQVEAVLHN PNSAYRRLRR
VMDAWCALWS WPLTTDVPPP DLDAWIGGLE ALLGKTTKPG RKESDGQTGF ADDLSWRGLD
DAEDLDLGFA DAMPVKEALS RISWLGVAAE IADRQGFFHW ELDFPQVFTR GGFDLQVGNP
PWVRPDWDEA GVLAESDPWW TLVPNIAEEL KLERRKHTFR VPSAETWYLD QRAEQAGLTA
HLGSSVERPL LHGLQPDLYR CFMDGSWRLT HPDGIISLIH PGSHFTELRA AKFRREAYHR
LRRHWHYRNE RKLFEEVHHA KEFGVHIYGK SQPAIELMQA TWLYEPFIPE RSLLHDGSGS
EPGIKDDQGH WDTRPHRARI TSVNKEILTE WATLLDKPGT PHDEARMLFP INASSATVLA
KLSKAPRLGS LDINWTAGWH ESADRRRGWF ERRPATPSAW SDVILQGTHI TVAAPLFQQA
TETSKNRQDT EPIDLDELPE SFIPRTIYQR SRLADEYHTA YPHWDGSPST KKFRIAWREM
ADPGNARSLQ TALIPPGPAH IHAVNTLSTS DPMNLVIAAG TSASLVADFL VKNGNSSHIS
LSVMTKLPHV KRHELQPELM VRTLRLNCLV RPYASLWEEL YSPTWRDDRW VPGVGVAYRE
RPDIGMITAG WEWGTPLRRA ADRRQALVEI DAIVAVMLGL TADELVTVYR TQFPVLQDYE
RKARYDSFGR QLPVDLVKQL DKATAAGERV HTLNGPDRTY VGPFSGVNRE ADLRIAHEHF
SRLTAEREAK KAQVQAEEQL P