Gene Franean1_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2033 
Symbol 
ID5670434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2442856 
End bp2445183 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content71% 
IMG OID641240954 
ProductN-6 DNA methylase 
Protein accessionYP_001506376 
Protein GI158313868 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.599209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.628554 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGTC CCGAGTGGGT GTCGGCCGCC GAGATCGCCC GGGGTGCCGG GGTGAAGCCG 
GCCGCGGTCA GCAACTGGAG ACGGCGGCAC GCGGACTTTC CGGCTGCCGA GCGGCACGGT
GGCCGGGAGG TCTTCCGGGT TGGGGATGTC GCCGCGTGGC TCGACGGCCG TCCGGTCGCC
GCCCCCGATC AGCGCCCCGG CGAGGAGCCG GGAGCCACCT ATGGCCACCG CTTCCGCCAG
GCCGTCGGGG CCACCGAACC GGTCGCCGGG CCGTCGGCTG CTGCCGGCGT CCCCACGGGC
ACCGCCGGCC GACTGTGGTG GGCGCTGGCC GACGCTTACA GAGGCAAGGT CGGGATGTCC
GACGACCTGA TGCTGTTGAC GGGCCTCGCG TTCGCGTTTC TCCACCTCCG ATTCCACCGG
GCACAGCGAT GGACGGAACT CACCGCAGCG GCTGCGGACG TAGAGCCGGG CACGGTGGTC
CGGCTCGTCG AGTCCGCGCT GCGCGCGAGC GGGGAGGCGC CGGCCTCGAT CGCCGACATG
TCGGCTCTGC TCGGAGACAC GTCCGACGTC CGCCGGCTGG TACCGCTGAT ACAGATCCTG
GATCGACTCG AGCCGGTGCC CGATGCCGGC TCCGGCGACG TCGCGCCGCC CGCCGGCCGC
ATCGCCGACG AGCTGCTGGC CCACGCCGCG ACCGTCGGCG GCTGGCGGTC CGCCAACGTT
GTGACGCCCC CTTCCGTGGT CCGCACGGCG GTCCGTCTGA CGGACCCCGT CGCTGGCGAC
CGGGTTCACG ACCCGTTCTG CCGCGCCGGT GAGTTCCTCG TCGGCGCCGC CGACCACATC
AGATCCCGTG GCACCGGCAG CCCGAAGCTG ACCGTCAGCG GACAGGAGAT CAACCCCTCC
CTCCGGTGGC TTGCCCGGAT GAACCTGCTT CTGCACAACC TCGGCGCCGA GGACCTGCGG
GCCGGGTGGG CCCTTTCTTC TCCCGACCCA CAGCCGGGCG GCCCGTTCGA GGTGGTCCTG
GTCAACCCGC CTTTCAACGT GTCGGGCTGG CGGGACGGCG ATCAGAATCC CGATTCATCC
TGGCGCTACG GCGTGCCGCC CGGCCACAAC GCGAACTACG CCTGGCTGCA GCACGCGCTG
GCCTGCCTGG CCGAGGGCGG TCGGGCCGTG GTCGTGATGC CCGCGGGCGC CGGTTCCTCC
GCGAATCTGC AGGAGAGCGC CATCCGGGCC GCCATGGTCG AGGAGGGCGT CGTCGACGCG
GTGGTTGCTC TGCCACCCCG GCTCTTCGTC AGCACCTCGA TCCCGGTAAC GCTCTGGGTA
CTGCGCTGGC CGAGCCCCGG CCACGACGAC GTGCTGTTCG TCGATGCCCA CGGGGCCGGA
AGGATCGTCG AGCGGAACCG TTCGGAACTG CGCGACGAGG ACGTCGACCA CATCGCCGAG
GCGTACCGGA ACCGAGCGGC GCGGTCCACG GGGACGGTGC TGTCCCGGCC CGTCGACAGG
CGGCGCATCC GGGAGAACGG GTATGCGCTC AGTCCGGCCC GCTATCTCAC GGCCACGACC
GAACCCGTCG ATCCCCTACG GGCCCGGGTC GGGATCGAGC AGCTCCGTCG CGACCTGCGC
GAGCTCCACC AGCGGGCAGC CGGGGCGCAT GAGCACGCGG AGCGTCAGCT GGACGAGGTG
GGCGACCTGT TCGGCGCGGC CACCACTGGC TGGCGGCGAC TGCCCCTGGG CGACGTGTGC
GACGTACTCG CGGGTTTCTC CGGCGCGGTC AGGACGGAAC GCGGGCTGCC TTCCGGTATT
CCGGTCGTCA AACCGAGGAA TCTCGTGGAC AACCGCATCT CGCCGGAAGG CGTCGACTAT
GTCGCGCCCG ACGTGGCGGC GAGAATGGAA CGGTACCGGC TGCGGGCGGG TGACATCGTC
TGCGTGCGAA CCGGCCAGCT CGGCCGGCAG GCCCTGGTGA CCGAGGAGCA GAGCGGTTGG
CTGATCGGCA CGTCCTGCCT ACGCCTACGC CCGGACGAAT CCGTTGATCC TCGTTATCTG
GTCCACTTTC TGGCCCTTCC CCAGATCAGT GAATGGCTGC TCGGCCACTC CACCGGCTCG
GCGATCCGGG TGTTGACCGC CGCGACTATG CGTGGGCTTC CCCTCGTCCT TCCGGACCGT
CACCAGCAGG GCCGCATCGG CTCGGCGGCA GGTTCGCTGG ACGATCTGGT AGCGGTGCAC
GACCAGATCC GTCAGGTCAG TTCCGCGCTC CGCGACGCGC TTCTCCCGTT GTTCCTCCAG
GATCCGACAC CGCCGGGCCC TGTCCCCGAG GAGGGATCGA AGTCGTAA
 
Protein sequence
MTSPEWVSAA EIARGAGVKP AAVSNWRRRH ADFPAAERHG GREVFRVGDV AAWLDGRPVA 
APDQRPGEEP GATYGHRFRQ AVGATEPVAG PSAAAGVPTG TAGRLWWALA DAYRGKVGMS
DDLMLLTGLA FAFLHLRFHR AQRWTELTAA AADVEPGTVV RLVESALRAS GEAPASIADM
SALLGDTSDV RRLVPLIQIL DRLEPVPDAG SGDVAPPAGR IADELLAHAA TVGGWRSANV
VTPPSVVRTA VRLTDPVAGD RVHDPFCRAG EFLVGAADHI RSRGTGSPKL TVSGQEINPS
LRWLARMNLL LHNLGAEDLR AGWALSSPDP QPGGPFEVVL VNPPFNVSGW RDGDQNPDSS
WRYGVPPGHN ANYAWLQHAL ACLAEGGRAV VVMPAGAGSS ANLQESAIRA AMVEEGVVDA
VVALPPRLFV STSIPVTLWV LRWPSPGHDD VLFVDAHGAG RIVERNRSEL RDEDVDHIAE
AYRNRAARST GTVLSRPVDR RRIRENGYAL SPARYLTATT EPVDPLRARV GIEQLRRDLR
ELHQRAAGAH EHAERQLDEV GDLFGAATTG WRRLPLGDVC DVLAGFSGAV RTERGLPSGI
PVVKPRNLVD NRISPEGVDY VAPDVAARME RYRLRAGDIV CVRTGQLGRQ ALVTEEQSGW
LIGTSCLRLR PDESVDPRYL VHFLALPQIS EWLLGHSTGS AIRVLTAATM RGLPLVLPDR
HQQGRIGSAA GSLDDLVAVH DQIRQVSSAL RDALLPLFLQ DPTPPGPVPE EGSKS