Gene Francci3_0192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0192 
Symbol 
ID3903219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp225565 
End bp227316 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content66% 
IMG OID637877523 
ProductN-6 DNA methylase 
Protein accessionYP_479312 
Protein GI86738912 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.341004 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCC TCGGTAGTTT CATCTGGTCG ATCGCCGACC AGCTTCGGGG TCCCTACCGC 
CCCAACCAGT ACGGCAACGT GATCCTCCCG CTCACGATCC TGCGCCGGCT CGACTGCATC
CTCGAGCCCG ACCGGGAGAC GGTCCGCGAG CTCGCGCGGA CGTTCGACAA CCCGAACCGG
CTGAAGATCG AGGTGAAGAA GGCGACCGGC AGGCCGTTCT ACAACACTTC CAACTACGGA
TTCAGCAACC TCCTGGCCGA CGCCGACGGG CTGGCGGACA ACCTGGCCGA CTACCTCGAC
CGATTCTCGG CGGACGTCGA CGTGTTCGAG TACTTCGACT TCAAGAAGGA GATCCTCGCG
CTGGAGAAGG CGGGGCTCCT GCGCGAGATC ATCACCTCGT TCAAGGCGAT CGATCTGCAC
CCCAAGGTGG TGTCGAACGC CGACATGGGC GATGCGTTCG AGTACATCAT CCGCAAGTTC
AACGAGGCCG CGAACGAGAC CTCCGGCGAC CACTACACCC CACGCGACGC GATCCGGCTG
CTGGTCGACC TGCTCTTCGC CGAGAAGGAG GCCGACCTGT CCGAGGCCGG CATCGTGCGT
ACCCTCTACG ACCCGACCGC GGGCACCGGC GGCATGCTCG CCCTGGCCGA GGAGCACCTG
CTCGCGCAGA ACCCGGACGC GAACCTGAGC CTGTATGGCC AGGAGTACAA CCCGCAGTCG
TACGCGATCT GCAAGTCCGA CCTGCTCGCC AAGGGCCACG ACGCGACCAA CATCGCCTTC
GGTAACACGC TCACCGACGA CGCCTTCAAG GGCAGGAAGT TCGACTTCTG CATGTCCAAC
CCGCCCTACG GCGTCGACTG GAAGCAGTAC GCCAAGAAGG TCACCGAGGA GCGCGACGAG
GCGGGCCCGT ACGGCCGGTT CGCCCCCGGC CTGCCGGCGA CCTCGGACGG GCAGATGCTC
TTCCTGCTCC ACCTGGCCCA CAAGATGCGG GCGCCCAAGG ACGGCGGCGG CCGGGTCGGG
ATCATCATGA ACGGCTCGCC GCTCTTCAAC GGCGCCGCCG GGTCCGGCCC CTCCGAGATC
CGCCGGTGGC TGCTGGAGAA CGACCTGGTC GAGGCGATCG TCGCGCTGCC GACCAACATG
TTCTTCAACA CCGGCATCGC CACGTACATC TGGATCCTCG ACAACACCAA GCACCCCGAC
GCCAGGGGCC TGGTCCAGAT CATCGACGGC ACCTCGTTCT GGACCAAGAT GCGCAAGAAC
CTCGGTTCCA AGGGCCGTGA GATCTCCGAC ACCGACCGCG AGAAGGTCGT CAGCCTGTAC
GTCGACTTCC TCGACGCCGA CCCCGACTAC TCCAAGGTGC TGAGCAACGA CGAGTTCGGC
TACTGGACCA TCACCGTCGA GCGACCCCTG CTTGGCGAGG ACGGCAAGCC GGTCGTCGAC
CGCAAGGGTC AGCGCAAGCC CGACCCGAAG AAGCGCGACA CCGAGAACGT CCCCTTCACC
TACGGTGGCT CGACCGCCGG CCGGGCCGGC AAGCTCGACG TCATCAACGC CTACTTCGAC
GCCGAGGTGA AGCCGCACGT CCCCGACGCC TGGATCGACT GGGCCAAGGT CAAGACCGGC
TACGAGATCC CCTTCACCCG CCACTTCTAC AGGTACGTCC CACCCCGCCC CCTCGCCGAG
ATCGACGCCG ACCTGGACAA GCAGATCGCC AAGATCCTCG ACCTCCTGCG GGAGGTCGAG
GGTGATCGCT GA
 
Protein sequence
MSTLGSFIWS IADQLRGPYR PNQYGNVILP LTILRRLDCI LEPDRETVRE LARTFDNPNR 
LKIEVKKATG RPFYNTSNYG FSNLLADADG LADNLADYLD RFSADVDVFE YFDFKKEILA
LEKAGLLREI ITSFKAIDLH PKVVSNADMG DAFEYIIRKF NEAANETSGD HYTPRDAIRL
LVDLLFAEKE ADLSEAGIVR TLYDPTAGTG GMLALAEEHL LAQNPDANLS LYGQEYNPQS
YAICKSDLLA KGHDATNIAF GNTLTDDAFK GRKFDFCMSN PPYGVDWKQY AKKVTEERDE
AGPYGRFAPG LPATSDGQML FLLHLAHKMR APKDGGGRVG IIMNGSPLFN GAAGSGPSEI
RRWLLENDLV EAIVALPTNM FFNTGIATYI WILDNTKHPD ARGLVQIIDG TSFWTKMRKN
LGSKGREISD TDREKVVSLY VDFLDADPDY SKVLSNDEFG YWTITVERPL LGEDGKPVVD
RKGQRKPDPK KRDTENVPFT YGGSTAGRAG KLDVINAYFD AEVKPHVPDA WIDWAKVKTG
YEIPFTRHFY RYVPPRPLAE IDADLDKQIA KILDLLREVE GDR