Gene Franean1_3637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3637 
Symbol 
ID5672004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4309272 
End bp4313129 
Gene Length3858 bp 
Protein Length1285 aa 
Translation table11 
GC content70% 
IMG OID641242521 
ProductTPR repeat-containing protein 
Protein accessionYP_001507941 
Protein GI158315433 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGACG CAGAGCTGCT GGTGGATCTT GATTCTGGCA GGCTATCGAC CACGATCGGA 
CTGGGATCAC CGATCGATGC GGCGGAGCTG GAGGACCTGC GCTGGTATCT GGAGGATTAT
CTCCAGACGC CGTTCGGGGT TTACTCTGAC CGCGGTTCCC GGATCGCGGG CCAGCTCGCC
GACTGGGGCC GGGCGCTGTT CGACTCGGTG CTCGAGGCGG TGCGGGTAGG CCGCGAGGAC
GCCGACTTTC CTGTCCGGGG ACTCTCGGAG ATCGTAGTGC GATCGGAAAT TCCCGAACGG
CTCGGGCTGC CGTGGGAACT GATGCGGGCG CCGGGCGCGT CCGTGCCGCT CGTCCTGGAC
GGAATCGGGG TAACCCGATG CCTGGTCGCG AAGTCGCCGG ATGATTCCGT TGACGCGGCC
GGAGACCGGC TCCGCGTGCT CATGGTGATT TCCCGGCCCG CGAGTGTCAG GGATGTCGGC
TATCAGATGA TAGCTCGCCC GCTGCTACGG TCCATGGCGT TGGCACACGG CGAGGTGGAC
CTGGAAGTGC TGCGCCCGCC GACGCTGGAG GCGCTGGCAG CTCGGCTGCG GGGCGCGCGC
GAGGCGGGGG CGCCCTTCCA GGTGGTGCAT TTCGACGGGC ACGGCGCCAC CGACCGAGGC
GCGCTGGCGT TCGAGAGACC GGGTGGTGGC GCGGACTACG TACCGGCCGG GCGCCTCGCG
GGTGTACTCG CGGCGGCGGA CGTCCCGGTC GCGGTGCTCA ACGCATGCCA GTCGGGGGCG
ATCGGGAAGC GGCTGGAGGC GGCGGTCGCG ACTGGTCTGG TGTTGGGTGG CGTCGATGCC
GTCGTGGCCA TGTCCTACCG GGTCTACGCG GTGGCCGCCG CCGAGTTCAT GACGGCGTTC
TACGGTCAGC TGCTTACGGG CGGCACCATA AGCGAGGCAG TGCGCGCGGG CCGTTCGCGG
ATGGCGCAGC ACCCGGGACG GCCCAGCCCC AAGGGCGAAC TGCCCCTGGA GGACTGGGCC
GTCCCGGTCT ACTACCGTCG CCGCGAGGTC CGCTTTCCGC AACTGCGCGC GGTGCCGCCG
GTGTCCGGCG GCGGCCCGGA CCGCGACGAG CTGAAATCGG AGGCGGAGTT CGTCGGCCGC
GACGACCTGT TCTGCACCCT GGAAATGGCC GCGCGGACCG ACCGGGTGCT TGTACTGCAC
GGCCCCGGCG GCACCGGGAA GACGGAGCTC GCCAAGGCGT TCGGACGATG GTGGCGCGAC
ACCGGCGGAG TCGACCGCCC GGACGGCGTC TTCTGGCACT CGTTCGAACC GGGGGTCCCC
TCCCTTGGCC TGGGCGGTGC CATCCGGGAG ACCGGCCTGC GCCTGTACGG TCCCGAGTTC
GCACTAAGAG ATCCCGCTGC CCGCCGTGAC CTGGTCCTGG AGCAGCTGCG CGCGCGCCGG
CTGCTGCTGA TATGGGATAA CTTCGAATCG GTCGCCACGA TGCCCGGCGA CGTCGCCTCG
CCGCTCGACG AGACCGCATG CGTCGAGCTG AAGGACTTCC TGCGCGAGGC GGCTCACGGG
CAAAGCACGA TCCTGGTGAC CAGCCGCACC CCTGAAAGCT GGCTAGGTGA CGTGCGCCGC
CTCGCTGTCG AGGGCATGCT CCCGCACGAG GCCATCGAGT ACGCCGATCA GGTTCTCGCG
CCCTTTCCCG CGGCTGCGGG CCGGCGGGCG GACCGGTCGT TCGGCGAATT GCTGGAATGG
CTAGAGGGGC ACCCGCTGAG CATGCGGCTG ATCCTTCCGC ACCTGGCGAC CACGGAACCA
GCCGCCCTGC TCGCGGGCCT CCATGGCACC CAGCCCCTGC CCATAGCGGA GGAGGGTGCC
GCAGCAGGAG AAAACGCCAG AACGCGGTCC CTGACAGCCA GCGTCGGGTA TTCCGTCGTG
CACCTTTCGC CCGCGGCCCG GCGCCTGCTC GTCGGGCTGA GCCTGCTGCG CGGCGTCGCC
GACGCGGAGA TCCTCGGTGC CTTCTCCGCC CAGGCGGACA CGCCGCAGCG ATTCCGCGGC
GTCAGCGCCG ACGAGTGGGA CATGGTGCTC GACGAGGCCG CTCGCCTGGG CCTGCTGACC
ACCCGCGCCG ACGGCGGTTA CGGGATCCAC CCCGCGCTCC CCGCGTATCT CGCCGCTCAG
TGGCAGGCCG AGGAACCCAT GATCTACCCT GGTGCCCGGG CCGCGGCCGA CAGTGCCCTG
CTCGCCGCCT GGGCCGCGGC CTGCCAGCTC TGGGGCAAGA CGATGGTCGC CGAGGCCAGT
GCGGAAGCGT ACGCGTTCAT CGACCGGAAC CGCCGGACGA TGGGACGCAT GCTCGGCTAC
GCGCTCGACA ACCGCCGCTG GGCCGCGGCG GAGCCGATCT ACATCACGCT GTACCAGTAC
CTGGACCACA GCGGTTTGGC GGAGGAGGCC CGTGGCTGGT CGGACCGCGC CCGGCGCGCG
GTCGAGGGTC CCGGCGGGGC TCTGTCCGCC GGGAAACTGC CCACGCTCCG GCTGGAGGAC
CTGCGAGACC CGCGGGCGCT GTCCCCGGCA AAGCTGGCGG AATACCGGTC GGTTCTTTCC
CTGTGGAGGG CCGTCCTGCC GGCCTTAGCC TTGTGGAACG TGGTCGTGAT GTCGCAGGCG
AACACGCTGA CCCGGGCCGG GGACCTCGAC GCCGCCGAAG CCCGCTACCA GGAGATTCTC
CAGGTGCAGG AGAACACGCC GGGGCTGTCG CCGGCAGCAC GTGCCCTGAC CTACGGCAAG
CTCGGATTCG TCGCAGAAGA GCGAGGGCGG TTCGCTCGGG CGGACGAGAT GTACCGCAGG
GCCCTGGTCA TCCAGGAGGA GGGGGGCGAC CGGCAGGGGA TCGCGCAGTC ACTTCGTCAC
CTGGGCAGCG TGGCGCTGGA TCAAGGACGC TGGGACGAAG CCGCGCAGCA CTACCGGCGG
TCGCTCGCGA TCTTGAAGGA GACGCCCGGA GACCTGCACA GCACCCAGCT CGTCTATGGC
AAGCTCGGCG ACCTGGCGCT GCTTCGCGGC CATCTCGACG AGGCGGAGGA CTGGCACGCC
GAGTCGCTGG CCCTAGCGGA ACAGCGACAC GACTGGGACG GCATGGCGCG CTCCCTGCAC
AGCCTTGGAA TGCTTTCCTT CAAGCGCGGT CAACTGGCGG AGGCGGAACG CTGGTACCTT
CGCTGCCTGG CGGCGGCCGA GAGAGTCCGT GACCGGCCTG GCATGGCGGC CGTCTACCAT
CAGCTCGGCA TGGTGGCACT GAGACGCGGC CAGGACGAGT CGCGGGAATG GTTCCGCCGG
TCCCTGGCCC TCCACGATGA GCTGGGTGAC CGTAGCGGCG TTGCCAGGGA TCACCGCATG
CTCGGGATGA CCGCGATCGC GGGTGGCCAG GCGGACGAGG CCAAACGGCT GCTGTTGGAG
TCGCTCGCCG CCGTGGTGGA CATTGGTGAG CAGAAGAGCA TCATCGAGTG CTACCACCTG
CTCGGAATGC TGGCGCACCA GCAGGGCTGG TGGGACGAAG CCGAACGGTG GCTGCGGAAG
TGCCTCGCCC TGGAGGAGAA GATGGGGCAC GAGGCCGGCG TCGCAGTCTG CTGCCTGCAA
TTGGGCGTAC TCGCACAAGC CCGGAACGAC GACGCCCGAG CTTTGGGATG GACGGTTCGC
GCTCTGGCCG TGGCCCGGCG TTTCCCGCAG TTGGCGGTCC GACTGGCATC CGGCCCGGTG
CTGAGCACAG TGACGAGTCG GCTCGGGATC GGGGCGGTGG AGGAGTGCTG GGAGCGCGTG
ACCGGCGAGT CCTTGCCCGG CGATGTCCGC CTTCTTGCCC TGGAGCACGC CCACGACGAA
CCTCTAGCGG ACAGGTGA
 
Protein sequence
MADAELLVDL DSGRLSTTIG LGSPIDAAEL EDLRWYLEDY LQTPFGVYSD RGSRIAGQLA 
DWGRALFDSV LEAVRVGRED ADFPVRGLSE IVVRSEIPER LGLPWELMRA PGASVPLVLD
GIGVTRCLVA KSPDDSVDAA GDRLRVLMVI SRPASVRDVG YQMIARPLLR SMALAHGEVD
LEVLRPPTLE ALAARLRGAR EAGAPFQVVH FDGHGATDRG ALAFERPGGG ADYVPAGRLA
GVLAAADVPV AVLNACQSGA IGKRLEAAVA TGLVLGGVDA VVAMSYRVYA VAAAEFMTAF
YGQLLTGGTI SEAVRAGRSR MAQHPGRPSP KGELPLEDWA VPVYYRRREV RFPQLRAVPP
VSGGGPDRDE LKSEAEFVGR DDLFCTLEMA ARTDRVLVLH GPGGTGKTEL AKAFGRWWRD
TGGVDRPDGV FWHSFEPGVP SLGLGGAIRE TGLRLYGPEF ALRDPAARRD LVLEQLRARR
LLLIWDNFES VATMPGDVAS PLDETACVEL KDFLREAAHG QSTILVTSRT PESWLGDVRR
LAVEGMLPHE AIEYADQVLA PFPAAAGRRA DRSFGELLEW LEGHPLSMRL ILPHLATTEP
AALLAGLHGT QPLPIAEEGA AAGENARTRS LTASVGYSVV HLSPAARRLL VGLSLLRGVA
DAEILGAFSA QADTPQRFRG VSADEWDMVL DEAARLGLLT TRADGGYGIH PALPAYLAAQ
WQAEEPMIYP GARAAADSAL LAAWAAACQL WGKTMVAEAS AEAYAFIDRN RRTMGRMLGY
ALDNRRWAAA EPIYITLYQY LDHSGLAEEA RGWSDRARRA VEGPGGALSA GKLPTLRLED
LRDPRALSPA KLAEYRSVLS LWRAVLPALA LWNVVVMSQA NTLTRAGDLD AAEARYQEIL
QVQENTPGLS PAARALTYGK LGFVAEERGR FARADEMYRR ALVIQEEGGD RQGIAQSLRH
LGSVALDQGR WDEAAQHYRR SLAILKETPG DLHSTQLVYG KLGDLALLRG HLDEAEDWHA
ESLALAEQRH DWDGMARSLH SLGMLSFKRG QLAEAERWYL RCLAAAERVR DRPGMAAVYH
QLGMVALRRG QDESREWFRR SLALHDELGD RSGVARDHRM LGMTAIAGGQ ADEAKRLLLE
SLAAVVDIGE QKSIIECYHL LGMLAHQQGW WDEAERWLRK CLALEEKMGH EAGVAVCCLQ
LGVLAQARND DARALGWTVR ALAVARRFPQ LAVRLASGPV LSTVTSRLGI GAVEECWERV
TGESLPGDVR LLALEHAHDE PLADR