Gene Francci3_4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4014 
Symbol 
ID3906975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4799751 
End bp4802201 
Gene Length2451 bp 
Protein Length816 aa 
Translation table11 
GC content65% 
IMG OID637881343 
ProductN-6 DNA methylase 
Protein accessionYP_483093 
Protein GI86742693 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCCGT TCCGGGGTGT TCAGGGTCGT GGGTGGCAGG CTGTCCGACA TGTCAGGGCT 
GTGGCTAGCC TTGCCGACAG GGGCGACCCG TGGCACGGCC GGCCGACGGT TGGATCTGAG
ACGAGCAGGG GGACGGTGTT GGGGCGCAAG CTGACGTTGC CGCAGCTGGA ACGGCATCTA
TACGCGGCAG CGGACATCCT TCGCGGCAAG ATGGATGCCT CGGAGTTCAA GGAATACATC
TTCGGGATGC TGTTCCTCAA GCGTGCGTCC GACGAGTTCG AGGTGGCCGA GAAGCGAATC
ATCGCCCAGC TCATTGCCGA TGGGCGCAGC CGCACGGACG CCGAGCGGCA GGCCACCCTC
CGAGCACGGT ATGGGGATAC CCTCTACGTT CCAGAGAAGG CCCGGTGGGC TTGGCTACGA
GACCAAATTC ACCACAATGT CGGCGATGCC TTGAACAAAG CTCTAGAACT GTTGGAGCAT
CACAATAGCA CCGCACTCGA AGGTGTCGTC CAACATATCG ACTTCACCCG GACAGTCGGA
CAATCAAGCA TCCCGGACCG CAAATTGCGC GATCTGATCG CGCACTTCAA CACGGTCAGA
CTACGGAACG AGGACTTCGA GTTCCCCGAC CTCCTGGGTG CAGCGTATGA GTACCTGATC
GGGGAGTTCG CTGACTCGGC CGGCAAAAAG GGTGGCGAGT TCTACACCCC ACGGGCGGTC
GTGCGGATGA TGGTAGCCCT CGTCGACCCG AAGCCGGGCA TGGAAGTCTA CGACCCGTGC
TCCGGCTCGG GCGGCATGCT GATCCTCGCC CGGGACTGGG TAGCCGAGCA CGGCGGTGAC
CCGCGGAATC TACGGCTGGT CGGCCAGGAG TACAACGGCG GCGTCTGGTC GATCTCGAAG
ATGAACCTGC TGCTCCACGG CATCCCCGAC GCGGACATCA GAAACGGGGA CACCCTCGCC
GAGCCGATGC ACGTCTCCAG CGGAGAACTG GAGCGGTTCG ACCGAGTGCT ATCCAACCCG
CCGTTCTCCC AGAACTACAG CCGCGAAGGG ATGGATCGGG AGAACCGATT CCGGTGGGGC
TGGGCTCCCG AGGGCGGCAA GAAGGCCGAC CTGATGTTCG TCCAGCACAT GGTCGCGGTG
CTGCGCGCGA ACGGCGTCGC CGCGACCGTC ATGCCGCACG GCGTGCTCTT CCGCGGCGGC
ACTGAGCGCG ACATCCGGAC GGCCCTCCTC GACGACGACG TCATCGAAGC CGTGATCGGT
CTGGCACCGA ACCTGTTCTA CGGGACGGGT ATCCCCGCCT GTGTTCTGGT GCTACGAGCG
CCAGGGTCGA AGCCGGCCGA GCGCGCCGGC AAGGTGCTGT TCGTCAACGC GGACGCCGAG
TTCCGCGCCG GCCGGGCGCA GAATTACCTG ATGCCGGAAC ACGTCGAAAA GATTGTCGCG
GCGTATCACG GGTTCACCGA CATCCCCAGC TACGCGAAGG TCGTCACCCG GGAGGAGCTG
CGCGCGGCCG ACGACAACCT GAACATCCGC CGGTATGCCG ATAACGCGCC GCCCCTGGAG
CTACAGGACG TCCGCGCCCA CCTGCACGGT GGAGTGCCGC GTGCCGAGGT TGCGGCGAAG
GCTGGTCTAT TCGCGGCCCA TGGATTCGAC CTCGGTGAGG TGTTCGTCGA CCGGGACGCC
GACTACCTCG ACTTCGCCGA CGGCGTCACG AAAAGCGATC TTCGTCGGCT CGTCGAGGGT
CACCCGGGGG TGCTCGCCCA GGAGCGTGAG GCCGTGGAAG CGCTCTCGGC TTGGTGGGAG
GGCAACTGCG AACTTTTCGA TGGGCTGGCT GCGGAACGTG GGTTGCACAC CACGCGAACA
GCCCTGTTGG AGAGCTTCTC AGCGGTGCTG ACCCCCGTTG GCCTGCTTGA CCGTTTCCAG
GTCGCCGGCG TCTTCGTCCG CTGGTGGGAC GCCGTCCAGT TCGACCTGCG GACCCTCGCC
GCGAACGGCT ATGACGGGGT GCTCGACGGC TGGGTGACCA CCATCGTCAC TGCCGCCGAG
GACGCACAGT CGAAGACCGA CCCTTTCGAC CACCGGCTGG TCCGGGCGCT CCTCGCGGAT
TTCCTGGACG AGCTGGCGAC AGTAGAGGCC CAGCGCGCCG AGCTCGACGC GAAGATCAAG
GCTGCTACGG CGCCAGTGGA CCAGGGCGAC GAGGATGCTG GCGAGGAGAG CACCGATACC
GAGCGGCTCT CCCCGGCGGA GCTGATTTCC CTGAAACGGG AGTTGACCAG GGTGAGGCGC
CGCCACCGCG AGCTGACACG TGAGATAGTC ACCCGACTGG AGAAGGCCCG CGCAGAACTA
ACCATCGCCG AGGTCCGTGA CCTCGTCCTC CGTCTGACCT ACGACCTCCT TGCCGAGCAT
CTTGCTGAGT ATATCACGCT GAGTATATCA CGGCCCGCCG TATCCAGGTG A
 
Protein sequence
MRPFRGVQGR GWQAVRHVRA VASLADRGDP WHGRPTVGSE TSRGTVLGRK LTLPQLERHL 
YAAADILRGK MDASEFKEYI FGMLFLKRAS DEFEVAEKRI IAQLIADGRS RTDAERQATL
RARYGDTLYV PEKARWAWLR DQIHHNVGDA LNKALELLEH HNSTALEGVV QHIDFTRTVG
QSSIPDRKLR DLIAHFNTVR LRNEDFEFPD LLGAAYEYLI GEFADSAGKK GGEFYTPRAV
VRMMVALVDP KPGMEVYDPC SGSGGMLILA RDWVAEHGGD PRNLRLVGQE YNGGVWSISK
MNLLLHGIPD ADIRNGDTLA EPMHVSSGEL ERFDRVLSNP PFSQNYSREG MDRENRFRWG
WAPEGGKKAD LMFVQHMVAV LRANGVAATV MPHGVLFRGG TERDIRTALL DDDVIEAVIG
LAPNLFYGTG IPACVLVLRA PGSKPAERAG KVLFVNADAE FRAGRAQNYL MPEHVEKIVA
AYHGFTDIPS YAKVVTREEL RAADDNLNIR RYADNAPPLE LQDVRAHLHG GVPRAEVAAK
AGLFAAHGFD LGEVFVDRDA DYLDFADGVT KSDLRRLVEG HPGVLAQERE AVEALSAWWE
GNCELFDGLA AERGLHTTRT ALLESFSAVL TPVGLLDRFQ VAGVFVRWWD AVQFDLRTLA
ANGYDGVLDG WVTTIVTAAE DAQSKTDPFD HRLVRALLAD FLDELATVEA QRAELDAKIK
AATAPVDQGD EDAGEESTDT ERLSPAELIS LKRELTRVRR RHRELTREIV TRLEKARAEL
TIAEVRDLVL RLTYDLLAEH LAEYITLSIS RPAVSR