Gene Franean1_3767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3767 
Symbol 
ID5672132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4466428 
End bp4468122 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content68% 
IMG OID641242648 
ProductN-6 DNA methylase 
Protein accessionYP_001508068 
Protein GI158315560 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCGC GGAGGAAGTC CGCTCAGGCC GGACCGCCCA CCATGGCCCA GTTGCGCGAG 
ACGCTGTGGA AGACGGCCGA CAAGCTGCGC GGCTCGATGG ACGCCGCCCA GTACAAGGAC
TTCGTCCTCG TCCTGATCTT CCTCAAGTAC GTCTCGGACG CGTTCGCCGA GCGGCGCGAG
CAGATCCGGC AGGACGTGCT CGCCGACGGG ATCGACGAGT CCCGGGCCGA GGAGTTCCTC
GACGACGTCG ACGAGTACGC CGGGCAGGGT GTGTTCTGGG TGCCGGGGCG GGCCCGGTGG
GAGCATATAG CGGCGAATGC GAAAAGCGCC GGCATCGGGG AACTGCTGAA CGCGGCGATG
GACGCCGTGA TGAAGACGAA CCCGGCGCTC ACCGGCGTCC TCCCGCGGAT CTTCAACGGC
GAGGGCGTCG ACCAGCACCG GCTCGGCGAG CTCGTCGACC TGCTCGGTGA CGCCCGCTTC
ACCGGCCACC GGGCGACCGA GCGGCCCCCG TCCACGCCTA CTGGCGAGGA CGGTGCGCTT
TTCGGCGAAT CCGCCGCCGG AGTTCCGACG GAAGCGGCGA CCCGGCCGGC GCGGGACGTG
CTGGGGGAGG TGTACGAGTA TTTTCTGGAG AGGTTCGCCC GCGCCGAGGG CAAGCGCGGC
GGTGAGTTCT ACACCCCGGC CAGCGTGGTG CGGTTGCTTG TCGAGGTGCT GGAACCCTAC
GAGGGCCGGG TGTACGACCC GTGCTGCGGT TCGGGCGGCA TGTTCGTCCA GGCGGAGAAG
TTCGTCGTCG CGCACCGAGG GCTCACCCAC TCCGGCGACA TCGCCGTCTA CGGCCAGGAG
TCGAACGAAC GGACCTGGCG GCTGGCGAAG ATGAACCTCG CCATCCACGG GATCACCGGC
GACCTGAGCG CCCGGTGGGA CGACACCTTC CGCAACGACC GGCATCCGGA CCTGCGGGCC
GACTTCATCC TGGCGAACCC GCCCTTCAAC ATGTCCGACT GGGCGCGCAC CGTCGACGAC
CAGCGCTGGC GGTACGGGAC CCCACCGACC GGCAACGCGA ACTTCGCCTG GCTGCAGCAC
ATCATCGCCA AGCTGGGCTC CCGCGGGACC GCCGGCGTGG TGATGGCGAA CGGCTCGATG
TCGTCGAAGC AGTCCGGCGA GGGTGAGATC CGCGCCGCGC TGGTCGAGGC CGACCTGGTG
GCCTGCATGA TCGCGTTGCC GCCACAACTG TTCCGCACCA CCCAGATCCC GGCCTGCCTG
TGGTTCTTCG CGAAGGACAA GGGCCAGCTG GGCGCCCGCT GGCTCGCCGA ACGGCGCGGC
GAGACGCTGT TCATCGACGC CCGCGACATG GGCACGATGA TCGACCGCAC CGAGCGCATC
CTCACCGACG GTGACCTCGA GAAGATTACC GACACCTACC GTGCCTGGCG TGGCGCGAAG
TCGGCCCGCG ACAAGGGCCT GGCCTACGAG AACATTCCCG GCTTCTGCTA TTCGGCCTCC
ACCGAGGAGA TCCGCACCCA CGACCACGTC CTGACCCCGG GCCGCTACGT AGGCGCCGCC
GAAGCCGACA TTTCCAACGA CGAGCCGATG GCCGAGAAGA TCGAGCGTCT CGCCAAGGAA
CTCTTCATCC ACTTCGAAGA GTCCGCTCGC CTGGAAAAGG AAGTTCGCAG CCAACTGGAG
CGCCTCGATG CCTGA
 
Protein sequence
MPPRRKSAQA GPPTMAQLRE TLWKTADKLR GSMDAAQYKD FVLVLIFLKY VSDAFAERRE 
QIRQDVLADG IDESRAEEFL DDVDEYAGQG VFWVPGRARW EHIAANAKSA GIGELLNAAM
DAVMKTNPAL TGVLPRIFNG EGVDQHRLGE LVDLLGDARF TGHRATERPP STPTGEDGAL
FGESAAGVPT EAATRPARDV LGEVYEYFLE RFARAEGKRG GEFYTPASVV RLLVEVLEPY
EGRVYDPCCG SGGMFVQAEK FVVAHRGLTH SGDIAVYGQE SNERTWRLAK MNLAIHGITG
DLSARWDDTF RNDRHPDLRA DFILANPPFN MSDWARTVDD QRWRYGTPPT GNANFAWLQH
IIAKLGSRGT AGVVMANGSM SSKQSGEGEI RAALVEADLV ACMIALPPQL FRTTQIPACL
WFFAKDKGQL GARWLAERRG ETLFIDARDM GTMIDRTERI LTDGDLEKIT DTYRAWRGAK
SARDKGLAYE NIPGFCYSAS TEEIRTHDHV LTPGRYVGAA EADISNDEPM AEKIERLAKE
LFIHFEESAR LEKEVRSQLE RLDA