Gene Francci3_3719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3719 
Symbol 
ID3903820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4451603 
End bp4452619 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content76% 
IMG OID637881045 
ProductHemK family modification methylase 
Protein accessionYP_482800 
Protein GI86742400 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.836815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCC CGCCGGCCGC GCAGGGGCGC TCACGGGGCA CGGAGCAGGA GCAGCCGGGC 
CGGGGTGCGT CCCCGCATCC GGGCTCCACG GATCCCCTGA GCCCGGTGCG CCACGCGGCC
GACCTGACAC CGCTGGGCGC GTGGTTGGCG GCGGCGACCG ACCGGTTGCG TGCGGCCGGG
GTGGCGAGCC CCCGCGCCGA CGCCGAGCAG CTCGCCGCCT TCGTGCTCGC GGTGCCCCGC
GGCCGGCTGG CACTGCTCGA CGACGTCACG GCGGCGGCGG CGCGGCGGCT GGACGAGCTT
GTGGCCAGGC GCGCGCAGCG GGTGCCGCTG CAGCACCTGA CCGGTGTCGC CGGCTTCCGC
CACCTCGATC TCACCGTCGG CCCCGGGGTC TTCATCCCGA GGCCCGAGAC GGAGTCCGTC
GTCGAATGGG CGCTTACAGA GCTCACCGGC TCCGCCGGGG CCCGGCGGCC GGGACCGTTG
TGCGTCGATC TGTGCGCGGG CTCGGGGGCG ATCGCGCTGT CCCTGGCGGC GGAGCTGCCC
GGCGCGACGG TGCACGCCGT GGAGGTCGAC CCGGCGGCGG TGGTCTGGCT GCGACGCAAC
ATCGCCGGCA CCGGTCTTCC CGTGACGGCG CACGCAGCCG ACATCGCCGC GGCGCTGCCC
GAGTCGCTCA CCCGACTCGC CGGGACGGTC GACCTGATCA TCAGCAATCC GCCGTACCTC
CCCGACGCCG ACCGCCACAC GGTGGAGCCC GAGGTGGGCG AGCATGATCC GGCCCGCGCC
CTGTGGGGGG GGCCCGACGG GCTCGACGTG GTGCGTACGG TTGTCGGGGT GGCTGCGCGG
CTCCTGCGGC CAGGCGGTCT CCTCGTCATC GAACACGCCG ACGGCCACGG GGTGTCGGCG
CCCGAGCTGC TTCGCGCCGA CGGTCGCTGG TCGCACGTGG CGGATTATCG GGACCTGGCC
GGCCGGGACC GGTTCGTCGC CGGCCGGCGC GGGGCGTCCC GTGAGCCCCG ATCCTGA
 
Protein sequence
MTGPPAAQGR SRGTEQEQPG RGASPHPGST DPLSPVRHAA DLTPLGAWLA AATDRLRAAG 
VASPRADAEQ LAAFVLAVPR GRLALLDDVT AAAARRLDEL VARRAQRVPL QHLTGVAGFR
HLDLTVGPGV FIPRPETESV VEWALTELTG SAGARRPGPL CVDLCAGSGA IALSLAAELP
GATVHAVEVD PAAVVWLRRN IAGTGLPVTA HAADIAAALP ESLTRLAGTV DLIISNPPYL
PDADRHTVEP EVGEHDPARA LWGGPDGLDV VRTVVGVAAR LLRPGGLLVI EHADGHGVSA
PELLRADGRW SHVADYRDLA GRDRFVAGRR GASREPRS