Gene Franean1_1456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1456 
Symbol 
ID5669860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1751686 
End bp1753077 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content52% 
IMG OID641240376 
Productputative RNA methylase 
Protein accessionYP_001505802 
Protein GI158313294 
COG category[L] Replication, recombination and repair 
COG ID[COG1041] Predicted DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.270738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGGG GCGCTACCGG GGGTGCGCCT GCACCATGGC GCTGTGAGGG ATGCTGGATC 
CGGCGTGGGA GGACATGTAA CTTGGGGCTA ATCGAAGAGC CGGATGATCT AGATATCTCT
AGCGACAGTG AGCGTCTGCT GGTCCACAGC CTCTTCCGTT TTCCGGCCAA ATTTCATCCG
CCGGTGGCTC AAGCTCTTAT CCGTAATTTC TCGGAGCCGG GAGACCTGGT TCTAGACCCC
TTCTGCGGAA GTGGAACGCT TCTAGCTGAG GCGGCCCGGA TGAGGCGGAG ATCTATTGGT
ACTGACGTCG ATCCTGTCGC AGTGTCGGTG AGTTCTGCCA AAACTGGGCT CCTAGTGGAA
AGTGAACTAC TCGAGGCTGC TTCAAGTTTG TTGGCCGCAC TTGACGAGAT TGCACCGCCG
CAGGAGATAT ATGAATCACG GAAATTCGAA GATATCTCAC AGGAATCCTT GAAAGAAAAT
CTTGATCGGG AATCCTTGTG GGTTCCCGAT ATTCCTAATC TGGATCATTG GTTTCGTCGG
TATGTCACTC TTGACCTGGC CCGCATCTAC AGGTGTGTAA TAGATCTTCA GTGTAGTTCT
GCGATCAAGA GCTATTTGCT GGTTGTATTT GCTTCCGTCA TTCGCAACGC GTCAAACGCG
GACCCGGTCC CGGTGTCTGG TCTTGAAGTC ACAGCACATA TGAAAAGGCT CGATGCGGCA
GGAAGAGTTG TCAACCCATA CTTTCTCTTT CGGAAAGCAA TGGGAAAGGC TGTTGCGGCT
TCGAAGGAGT ATGGCGAGCA AGTCTCTCCG TACTTTGAGC CAAGGATCTG GGAAGCGGAC
GCTACTTCCT TAGACCTCGA CTCCGAAATG TGTGATTTGG TAATTACTAG TCCGCCGTAT
CATAATGCTG TCGATTACTA TCGGCGTCAT CAGCTTGAGA TGTTCTGGCT GAGGCATACC
CGATCTCAGA AAGAAAGACT AGACCTACTT CCGAAATATA TTGGCCGCCA TCGAATACCT
CGGAAGACCC CGATTCTCGC AACTGACGAG AAGTTGCCAG CACTTGCGGC ACACTGGGAA
AGCGAGATGG CGCAAGCGAG TATTCAGCGC GCAGTGGATT TTCGTCACTA CGCACTCAGT
ATGCGAAACG TATTTAGGCG ATTGTCGCAG GCGGTAAAGA TCGGGGGAAA GGTTGTTCTG
GTTGTAGGCC GAAGCAGTTG GAACGGCGAC CGCATCCCGA CCGATGATCT TTTTGTCGAA
CTTGCTTCCG AGAACTTCTC CTCGGTAAGA CTAATGTCAT ACCCAGTGAA AAATAGGTAC
ATGAGCTACG CTCGGAGGAA TAGCGCTAGC ATCGACCGTG AATATGTGCT GGTGTTGCAA
AGAAAGATCT GA
 
Protein sequence
MRRGATGGAP APWRCEGCWI RRGRTCNLGL IEEPDDLDIS SDSERLLVHS LFRFPAKFHP 
PVAQALIRNF SEPGDLVLDP FCGSGTLLAE AARMRRRSIG TDVDPVAVSV SSAKTGLLVE
SELLEAASSL LAALDEIAPP QEIYESRKFE DISQESLKEN LDRESLWVPD IPNLDHWFRR
YVTLDLARIY RCVIDLQCSS AIKSYLLVVF ASVIRNASNA DPVPVSGLEV TAHMKRLDAA
GRVVNPYFLF RKAMGKAVAA SKEYGEQVSP YFEPRIWEAD ATSLDLDSEM CDLVITSPPY
HNAVDYYRRH QLEMFWLRHT RSQKERLDLL PKYIGRHRIP RKTPILATDE KLPALAAHWE
SEMAQASIQR AVDFRHYALS MRNVFRRLSQ AVKIGGKVVL VVGRSSWNGD RIPTDDLFVE
LASENFSSVR LMSYPVKNRY MSYARRNSAS IDREYVLVLQ RKI