Gene Francci3_2674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2674 
Symbol 
ID3904898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3156969 
End bp3159209 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content74% 
IMG OID637879999 
ProductN-6 DNA methylase 
Protein accessionYP_481765 
Protein GI86741365 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.559823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.558712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGGC TGCCGGTGAA GACCAGCGGC CGGGACACGG CGAGCTCCGC GGCTGTCACG 
GCCACGGCCA CGGCCACGGC CACGGCCGCA GACACGGCCA CGGTCACCGC CACCGACATC
GCCCGGCTGG CCGGAGTCGG GCGGGCCGCG GTCAGTAACT GGCGGCGCCG GTTCGCCGAC
TTTCCCGCAC CCGTCGGGGG GACGTCGACC AGCCCGCTGT TCGCGCTGGC CTCCGTGGCG
GACTGGCTCG CCCGGCATGA CCGACCGTTC CAGGTCGAGG CGGCCGACCA GGTGTGGCAG
CGGATCCGGG GGGCCGTCGA TGATCCACGC CTCGGTGAGG TGACCGGCCA TCTCGGGGGT
TTCCTCATTT ATCGTCAGCG GGATCCCGCG GGCGCCACGG CACTGCTCGG GGAGAGCGAC
GCGGTGGCGG CCGTATCCCT GGAACCGGCG ATCGCCCGCG CCGTGCCGGA GCTTCCCGGT
GGTTTCCCCG ATACCTGGGA GACCGGGTGG GTGGCGATCG CCCGGGTCGC CGCCGAGGCG
GCGGACCGGC ACGGCCACGC GGAGCTCTTC GACGCGCTGC GCGCCCGCTA CCGCGAGGTC
TGCTCCCGGC AGGTGGCGGA GCCTCCGCGG GCGGTCGGGG AGCTGATGGT GACCCTCGCC
GGCCTGCGCG GCCGGTCGGG GGCCGCCACG GTGCTCGACC CGGCCTGCGG GATCGGCGGG
CTGTTGGAGG CCGCCCGGGC CGCGGGGGCG GGCCGCCTGC TCGGCCAGGA TGTCAACCCG
ACGATGGCGC GGGTGAGCGC CGTCGGACTG CTGCTGCGGG GTGGTGACGC GCGGATCGTC
GCGGGCGATT CGCTGCTCGC CGGCACCTTC GCCGGGGAAC GGGCCGACGC GGTCCTGTGC
GCCCCGCCGT TCGGGCAGCG TTCCTGGGGC TACGACGAGC TGCTCGGCGC GCCGTGGTGG
CGCCACGGTG TTCCACCGCG GGGGGAACCC GAGCTGGCCT GGGTGCAGTA CTGCCTGGCG
CACGCCCGGG ACGGCGCGCA GGTGCTGGTC ATCATGCCGG CGGCCGCCGC GTCGCGCCGT
GCCGGGCGCC GCATCAGGGC GAACCTGCTG CGCGCCGGGG AGTTGCGCGC CGTGCTGGGG
CTGCCGCCCG GTCTGTTCCC GGCGGGATCG GCCCCGGATC TGTGGGTGCT CCGCCGCGGG
GCCGGCCCCG ATGCCGGTGC CGGTTCCGAT GCCGGTGCCG GCGGGGCCGG GGCGACGCCC
GCTCAGGTGC TGCTGGGCCT GGCGCGCGAT GAGATCGCCG TCGTCGAAGC GGCCTGGGCG
TTGTTCGTCG CCGCTGGCGG GTTCGGCGGC GACACCGCCG CCCAGGCGGA CCAGGTGGCC
GACGACGCCG ATCTGCCGGC GGGATTCCGG CGGATATGCG TTGCGGACCT CCTCGACGAC
GAGGTCGACG TGAGCCCGCG ACGCTACATC GACCATCCTC CGGAAAGAAC CGCTGTCGGT
GAATTTCCCG CGGTCGGTGA ATTCCCCGCT GTCCGGGAGG ATTTCCTGGC GACGGTGGCC
TCGTTGTCCT CGGCGGTTCC CCTCCTGACC CCACCGAGGG GATCCGGACC GGCGGACCGG
CCGGCTTCGT TGAGCGTCAC GTCGAGCGTC ACGATCGGTG AGTTGGCAAA AGCCGGATCG
TTGACGATCC TGATGGCCCC GTTGCAGACC AGGGCCGGGG TCGGTGACCT GCCGTTGCTC
ACCGCGGGCG ATGTCCGGCA TGGGCGGGGC CCGTCGAGCC GGACGGAGGA ACTGGCCGGA
ATGCTCGTTC TCCGGCCCGG CGACGTCGTC TGTGTGACGG CGTCGGGGGA GAGCATGGCC
CGAGTCGTCG AGGACGCGGG CGCCGTTCTC GGGCCGCGGG TGGTGCTCCT GCGGGGCGAT
CCCGACCGGC TCGATCCGTA CTTCCTGGCC GGGATCCTGC GGGCGGCAGG CCATCGGGCG
CACGGGCGAT CCGGTGGTGG CGCGGATGCC CAGGCCGCCG GTCGCTCGGG TGGATCGTCG
CCGCGCGCCG ACCCGCGGCG GGTCCGGATA CCGGTTCTCC CGCTGGCCGA ACAGCGTTCC
CGGGGGCTCG ACTTCCAGCG CCTCACGATG GTCGAGAAGT CGCTGCGCAA CGCGGTGGAA
CTGGCCGAGA CCATCGTCCG GCTCGGTCAT CTCGGAATCG CCGACGGAAG CCTGCGGGGC
GGGGAGACAG GTCCCGGGTA A
 
Protein sequence
MNGLPVKTSG RDTASSAAVT ATATATATAA DTATVTATDI ARLAGVGRAA VSNWRRRFAD 
FPAPVGGTST SPLFALASVA DWLARHDRPF QVEAADQVWQ RIRGAVDDPR LGEVTGHLGG
FLIYRQRDPA GATALLGESD AVAAVSLEPA IARAVPELPG GFPDTWETGW VAIARVAAEA
ADRHGHAELF DALRARYREV CSRQVAEPPR AVGELMVTLA GLRGRSGAAT VLDPACGIGG
LLEAARAAGA GRLLGQDVNP TMARVSAVGL LLRGGDARIV AGDSLLAGTF AGERADAVLC
APPFGQRSWG YDELLGAPWW RHGVPPRGEP ELAWVQYCLA HARDGAQVLV IMPAAAASRR
AGRRIRANLL RAGELRAVLG LPPGLFPAGS APDLWVLRRG AGPDAGAGSD AGAGGAGATP
AQVLLGLARD EIAVVEAAWA LFVAAGGFGG DTAAQADQVA DDADLPAGFR RICVADLLDD
EVDVSPRRYI DHPPERTAVG EFPAVGEFPA VREDFLATVA SLSSAVPLLT PPRGSGPADR
PASLSVTSSV TIGELAKAGS LTILMAPLQT RAGVGDLPLL TAGDVRHGRG PSSRTEELAG
MLVLRPGDVV CVTASGESMA RVVEDAGAVL GPRVVLLRGD PDRLDPYFLA GILRAAGHRA
HGRSGGGADA QAAGRSGGSS PRADPRRVRI PVLPLAEQRS RGLDFQRLTM VEKSLRNAVE
LAETIVRLGH LGIADGSLRG GETGPG