Gene Franean1_1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1080 
SymbolmnmA 
ID5669494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1286221 
End bp1287288 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content76% 
IMG OID641240012 
ProducttRNA-specific 2-thiouridylase MnmA 
Protein accessionYP_001505442 
Protein GI158312934 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0482] Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain 
TIGRFAM ID[TIGR00420] tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.997084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGGA TGAGGGTGCT CGCGGCCATG TCGGGCGGCG TCGACTCGGC GGTGGCCGCC 
GCCAGGGCCG TCGACGCCGG CCATGACGTG ACCGGCGTCC ACCTGGCGCT GTCGCGCTCG
CCCGCCTCGG ACCGGGTCGG TGCCCGTGGG TGCTGCACGC TCGAGGACGC CCGCGACGCC
CGCCGTGCCG CCGACGTGCT CGGCATCCCC TTCTACGTCT GGGATCTCGC GGAGCGTTTC
GAGACCGAGG TCATCGACGA GTTCGTCGCC GACTACTCGG CCGGGCGCAC GCCCAACCCC
TGCGTGCGCT GCAACGAGCG CATCAAGTTC GCCGCGGTGC TGGAGAAGGC GCTCGCGCTC
GGCTTCGACG CGGTCGTCAC CGGGCACCAC GCCCGGCTGG ACGAGGACGG CACGCTGCGC
CGCTCGGTGG ACCCGGCGAA GGACCAGTCG TACGTGCTCG GCACCCTGCG GCCCGAGCAG
CTCGCCGCGG CCCGGTTCCC GCTCGGCGAC TCGACGAAGG CGCAGGTCCG CGAGGAGGCC
GCGCGCCGTG GCCTCGCCGT CGCCGACAAG CCGGACAGCC ACGACATCTG TTTTGTGGCC
TCCGGGGACA CCGGCGCCTG GCTGCGCGAG CGCCTGGGGC CGCGCCCGGG CCCGGTCGTC
GACGCGGAGA CCGGCGAGAC CCTCGGCGAG CACGACGGGG CGTACGCCTT CACAGTCGGG
CAGCGCCGGG GGCTCGGGCT GGGACGTCCC GCGCCGGACG GCCGTCCGCG CTACGTGCTG
GAGATCTCCC CGGCGACGTC GACGGTGACC GTCGGCCCGT CCAGCGCGCT GGACATGCGC
AGCCTGGTCG CGGAACGGAT CGTCTGGCCC CATGACGGGC CGGTGTCGTG CACCGCGCAG
GTCCGGGCGC ACGGCGGGGT GGTCCCCGCG GTGGTGACGG CCCGGGGCGA CGAGCTCGAC
GTGCGGCTGG CGGAGCCGGT GCGCGGCACC GCGGCGGGCC AGGCCGTGGT CTTCTACGAC
GGCGACCGGG TGCTCGGCGG CGGGCGCATC CGCTCCCTGG CAGCCTGA
 
Protein sequence
MSRMRVLAAM SGGVDSAVAA ARAVDAGHDV TGVHLALSRS PASDRVGARG CCTLEDARDA 
RRAADVLGIP FYVWDLAERF ETEVIDEFVA DYSAGRTPNP CVRCNERIKF AAVLEKALAL
GFDAVVTGHH ARLDEDGTLR RSVDPAKDQS YVLGTLRPEQ LAAARFPLGD STKAQVREEA
ARRGLAVADK PDSHDICFVA SGDTGAWLRE RLGPRPGPVV DAETGETLGE HDGAYAFTVG
QRRGLGLGRP APDGRPRYVL EISPATSTVT VGPSSALDMR SLVAERIVWP HDGPVSCTAQ
VRAHGGVVPA VVTARGDELD VRLAEPVRGT AAGQAVVFYD GDRVLGGGRI RSLAA