Gene Franean1_1011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1011 
Symbol 
ID5669425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1193479 
End bp1194483 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content76% 
IMG OID641239940 
ProductHemK family modification methylase 
Protein accessionYP_001505373 
Protein GI158312865 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGGCC CGGCCGCGTC CACGACGCGG GCCGACCACG CGGACGCCAC GCTCGCCGTC 
GAGCTCGCCG CCGCGACGGC CCGGCTGGCC GCGGCCGGTG TCGCCAGCCC CCGCGGCGAC
GCCGAGCAGC TGGCGGCGCA CGTGCTCGGG GTGTCGCGCG GCCGGCTCGC GCTGGTCACC
CGGGTCGAAC CGGCCGCGGC CGGCGAGCTG CGCGCGCTGG TGGAACGGCG GGCGAGCCGG
GTCCCACTGC AGCACCTGAC CGGCCTCGCC GGCTTCCGCC ACCTGGACAT CGCCGTCGGG
CCCGGAGTGT TCATCCCCCG CCCGGAGACC GAGTGGGTGG CCGAGTGGGC GATCGCGGCC
CTGCGATCGC CCGACGCGGT CGTGGCTGGA CGTCCAATCT GTGTTGATCT TTGTGCGGGT
TCGGGGGCGA TCGCCCTGTC GGTGGCCGAC GAGGTGCCGA ACGCCGAGGT GCACGCGGTG
GAGCTGGAGC CGGCGGCGCT CGGCTGGCTG CGCCGCAACG TCGAGCGCAC GGGCCTGCCG
GTGCGGGTGC ACCAGGCCGA CGTCGGCATC CCGCGGTCGC CAACAGACGC GGGCAGGCCG
GTCGCGCCGG TCGGCACGGT CCTGACTGAC CTCGCGGGAC GGGCCGACGT CGTCATCAGC
AACCCGCCGT ATCTGCCCGA TCATGAACGG CCGAGGGTCG AGCCCGAGGT CGGCCGGCAC
GACCCGCCAG CGGCCCTGTG GGGCGGGCCC GACGGCCTGG ACGGGCCGCG CGCGGTCGTG
GCCGCCGCCG GGGGACTCTT GCGGCCAGGC GGTTTACTGG TCATGGAACA CGCGGACGGA
CATGGCCAGA CGGTGCCCGC GCTGCTCGCC GGTGAGGGCT GGTGGGCTGG TTCGTGGTCC
GAAATCGTGG ATCATCCCGA TCTCGCCGGG CGGGACCGGT TCGTCACCGC CCGCTGGAAC
CCGCCGGGGC CGCGCCCGCC GCGCGGCGCC GGAGAGGACG TGTGA
 
Protein sequence
MTGPAASTTR ADHADATLAV ELAAATARLA AAGVASPRGD AEQLAAHVLG VSRGRLALVT 
RVEPAAAGEL RALVERRASR VPLQHLTGLA GFRHLDIAVG PGVFIPRPET EWVAEWAIAA
LRSPDAVVAG RPICVDLCAG SGAIALSVAD EVPNAEVHAV ELEPAALGWL RRNVERTGLP
VRVHQADVGI PRSPTDAGRP VAPVGTVLTD LAGRADVVIS NPPYLPDHER PRVEPEVGRH
DPPAALWGGP DGLDGPRAVV AAAGGLLRPG GLLVMEHADG HGQTVPALLA GEGWWAGSWS
EIVDHPDLAG RDRFVTARWN PPGPRPPRGA GEDV