Gene Franean1_5691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5691 
Symbol 
ID5674017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6909713 
End bp6910903 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content67% 
IMG OID641244544 
ProductDNA (cytosine-5-)-methyltransferase 
Protein accessionYP_001509947 
Protein GI158317439 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCGGT TCCGGCTTTA CCCGGCGTCC GAGCAGGCTG CTGTAATGGA GGCCCACTGC 
GGCCACGCAC GGTTTGTGTG GAATCTTGCC GTAGAACAGC AGTCGTGGTG GACGCCGCGA
CGCGGGCCGG CGCCGGACTA CCACGAGCAG TCCCGGCAGC TCACCGAAGC GCGACGCGAG
TTCCCGTGGC TCACCGAAGG GTCGCAGACC GTTCAGCAGC AGGCGTTGCG GGATTTTGCA
CAGGCCATGG ACAACTACTT CCGGGGCAGT CACCGGAAGC CGACGTTTCG GAAACGTGGC
CGGTCGGAGG GGTTCCGGAT CGTCGCTGTC AAATCCTCGG ATATTCGCAC GGTGAACCGG
CGCTGGTCCG AGGTGAGGGT TCCCAAGGTC GGCTGGGTGC GTTTCCGTCG CTCGCGGACT
GTCCCGAAGG CGAAGTCCTA CCGGGTCACC AGGGACCGGG CTGGCCGGTG GCATGTGGCG
TTCGCCGCGA TCCCCGAACC GATCGACGCA CCGGGTACCG GTGCGACCGT CGGTGTCGAC
CGTGGGGTGG CTGTGTCGGC GGCGCTGTCG ACGGGCGAAC TGCTGTCCTG CCCGAGACTG
CCGCCCACAG AGGCGCAGCG GCTGGTCAGG CTGCAACGGC GACTTGCCCG GGCGAAGCGC
GACAGCAACC GGCGCAGCCG TCTCAAGGCC CAGATCGCCC GGGTGAAGGC CCGTGAGGTG
GACCGGCGGA AGGACTGGGT GGAGAAGACC AGTACCGACC TTTCCCGCCG ATTCGACCTG
ATCCGCGTCG AAGACCTGAA GGTCAGGAAC ATGACCCGCT CGGCGCGGGG CACCGGGGAG
GCACCGGGCA GGAACGTCCG CCAGAAAGCC GGCCTGAACC GGGCCATCCT GGCAAGCGGT
TGGGGCCTGC TGGTGCAGCG CCTCGAGGAC AAGGCCCCCG GCCGGGTCGA GAAGATACCC
GCCGCGTACA CCTCTCAGTG CTGTTCTGCC TGCAGGCATG TCGCTACCGA GTCGCGTGAG
AGCCAAGCAC GATTCGCCTG CGTCGCCTGC GGATATGAGG ACAACGCCGA TGTGAACGCG
GCTAGGAACA TCGCGGAGGG ACACGCCGTG ACTGCGCGGG GAGGCATCGG ACTGCCGAAG
CCCATGAACC GCGAACCTCA ACTAACCGCA CCTCCTCCGG TCACTGCGTG A
 
Protein sequence
MSRFRLYPAS EQAAVMEAHC GHARFVWNLA VEQQSWWTPR RGPAPDYHEQ SRQLTEARRE 
FPWLTEGSQT VQQQALRDFA QAMDNYFRGS HRKPTFRKRG RSEGFRIVAV KSSDIRTVNR
RWSEVRVPKV GWVRFRRSRT VPKAKSYRVT RDRAGRWHVA FAAIPEPIDA PGTGATVGVD
RGVAVSAALS TGELLSCPRL PPTEAQRLVR LQRRLARAKR DSNRRSRLKA QIARVKAREV
DRRKDWVEKT STDLSRRFDL IRVEDLKVRN MTRSARGTGE APGRNVRQKA GLNRAILASG
WGLLVQRLED KAPGRVEKIP AAYTSQCCSA CRHVATESRE SQARFACVAC GYEDNADVNA
ARNIAEGHAV TARGGIGLPK PMNREPQLTA PPPVTA