Gene Francci3_0236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0236 
Symbol 
ID3906540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp276160 
End bp277170 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content72% 
IMG OID637877565 
ProductHemK family modification methylase 
Protein accessionYP_479354 
Protein GI86738954 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.92387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.953323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTGCTC TCCTCCCGCG GTCGCGCGGT TCGCTCGAGG TAGGCGATCG GGAGGGTGCC 
CTTCGGCATC CCGAAAGGCT TCCCGTGTCG ACATCCGGCC CTTCTCCTTC CTTTCTCTCC
CCTTCTCGTC CGTCCTTTCC CTCCTCTTCT CGTCCGTCTT CCTCACCGTT TTCGTCCCTG
CCGCGGTCCG TCGTCGTCAC GCGGCTGCGG GCCGCCGGCT GCGTCTTCGC CGAGGACGAG
GCACGGCTGC TCGTCAATGC GGCCCGGACC CCGGGCGAAC TCGCCGCGAT GGTGCGGCGG
CGCGTCGCCG GTCTGCCCCT GGAACACGTG CTCGGCTGGG CGGAGTTCCA CGGCCTGCGG
ATAGCCGTGG ACCCCGGGGT CTTCGTGCCC CGCCGCCGCA CGGAGTTCCT TGTCGATCAG
GCGGTCGAAC GGGTTGCCGG CCGGTCCCGG CCGGTCACTG TCGTCGATCT GTGCTGTGGC
TCGGGGGCGA TGGGGGTCGC GCTGGTGGCA GCCCTGCCCG GGATCGAAGT ACACGCCGCC
GACATCGAAC CGGCCGCGGT GCGGTGCGCC CGCCGCAACC TCGCCTCCGC CGGTGGCCAG
GTCTACGATG GTGATCTTTA CGAGCCGCTG CCAGCCGTCC TACGGGGCCA CGTGGACCTG
CTGGCCGCGA ACGCCCCCTA TGTACCCACC GACGCGATCG AGCTGATGCC GCCGGAGGCT
CGCGAGCACG AGCCGCGGGT GGCGCTCGAC GGCGGGGCGG ACGGGCTCGA TGTTCTGCGG
CGGGTGGCCG CCGAGGCGCC GCGGTGGCTG GCCCCGGGCG GTCACCTGCT GGTCGAGACC
GGCGAGCGGC AGGCGGCATC GATCGTCGAA GCCAGCGCCC AGGCCGGTCT GATCCCCACG
GTCGCCAGTT CCGCCGACCT GAACGCCACC GTCGTCATCG CCACCAGGCC GGCCGCGGCT
CCCGTGTCCG CGGTGGGCGC GGCAGCTGGA CAGCTGCCGC CAGTCGGGTA G
 
Protein sequence
MGALLPRSRG SLEVGDREGA LRHPERLPVS TSGPSPSFLS PSRPSFPSSS RPSSSPFSSL 
PRSVVVTRLR AAGCVFAEDE ARLLVNAART PGELAAMVRR RVAGLPLEHV LGWAEFHGLR
IAVDPGVFVP RRRTEFLVDQ AVERVAGRSR PVTVVDLCCG SGAMGVALVA ALPGIEVHAA
DIEPAAVRCA RRNLASAGGQ VYDGDLYEPL PAVLRGHVDL LAANAPYVPT DAIELMPPEA
REHEPRVALD GGADGLDVLR RVAAEAPRWL APGGHLLVET GERQAASIVE ASAQAGLIPT
VASSADLNAT VVIATRPAAA PVSAVGAAAG QLPPVG