Gene Franean1_5097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5097 
Symbol 
ID5673432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6100453 
End bp6101952 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content78% 
IMG OID641243948 
ProductUDP-N-acetylmuramate--alanine ligase 
Protein accessionYP_001509362 
Protein GI158316854 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0773] UDP-N-acetylmuramate-alanine ligase 
TIGRFAM ID[TIGR01082] UDP-N-acetylmuramate--alanine ligase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.393339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.052337 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCGC GAGATCTCCC GGAGGGCTGG CGGCGGGTCC ACCTCGTCGG CATCGGCGGG 
ATCGCCATGA GCGGTCTGGC ACGGTTGCTG GTCGCGCGGG GCGCGACGGT CTCCGGCAGC
GACGCGGTCG AGTCCCGGCG GCTGGCGTCG CTGCGCGCGC TCGGCGTCCC GGTCACGGTG
GGCAACGGCC CCGACAGGTT CGACGCCTCG CGCCTCGACG GGGTCGAGCT GGTCGTCGTC
GCGCCAGCGG TGCCGGGCGA CGACCTGGAG CTCGCCGAGG CGCGCCGGCG CGAGCTGCGG
GTGCTGACCC GCTCCGCCGC GCTCGCCGGG CTGATGGCCG GGCACCGAGG CGTGGCGGTC
GCGGGCTCGC ACGGCAAGAC CACCGTGGCG ATGATGCTCA CCGCCGCCCT GCAGGCCTGC
GGCGCGGACC CGACGTTCGC GGTCGGCGGG GATCCGGGCG AGGCCGGCTC GCACACCCAC
GCGGGCAGCT CCGAGCTGAT GGTCGTCGAG GCGGACGAGG ACGCCGGCGC GTTCTGGCAG
CTCCAGCCGT ACGGCGCGGT GCTGACAGGG GTGGCCGCCG AGCACCTCGA CCACTACCGG
ACGATGCCCG CCCTCGCCGC CTCGTTCGCG ACGTTCCTGC GCCGGGTCGA CCCGGGCGGC
TTCCTGGTCG CCTGCGTGGA CGACGCGGCC GGGTGGGCGC TCGCCACGGC GGCGGCCGAC
CACGCCGACC GCTGGCGCCG GGCCGGCGCC AGCGCCAGCG CCAGCGCCCC GGACGGTGGC
CGGCCGTGGC TGACGGGGTA CGGGTTCGGG CCGTCGGCGG ACGTCCGGCT CGTCGCCGAG
GAGATCTCGA TCGCGGGCAC CAGCGCGGAG GTCGTCGTCC ATGGCGTCCG GCTGGGCCGG
CTGTCGCTGC GGGTCCCCGG CCGGCACCAC CTCCTGGACG CCGCGGCGGC CCTGGCCGCC
GGGATCGCCC TGGGCGCCCC GCCGGCCGGC CTGCTGGCCG GGCTGACCGA GTTCGCCGGG
GTCCGCCGCC GCTTCGAGTC GCTCGGGTCG GCGGGCGGGG TGCGGGTGGT CGACGACTAC
GCCAACCACC CGGACCGGGT GGCCGCCGCG GTCGAGGCGG CGCGGGCGGC TGCCGGCGGC
GGCCGGGTCG TCGTCGCGTT CCAGCCGCAC CTGTACAGCC GCACGGCCCT GCTCGCCGAC
CGGTTCGGTG CCGCGCTGGG CTCGGCCGAC GCGGTCGTGG TGATGGACGT CTACGGCGCC
GGGGAGCAGC CCGAACCGGG CGCGGGCGGT GCCCGGGTCG CCGCGGCCAC CCGGTCCGGG
GCGGCGCGGG TCGGGGCGGC GCGGGTCGTG TACGAGCCGT CGTGGTCGGC CGTCCCGGGC
GTGCTGATGG ACCTGGCACG GCCGGGCGAC CTGGTCATGA CGCTCGGCGC CGGGGACGTG
ACGCAGGTCG GCCCGGAGCT GCTGCGTCTG CTGGCCGAAC GGTCGGCCCT GCCGGGCTAG
 
Protein sequence
MTARDLPEGW RRVHLVGIGG IAMSGLARLL VARGATVSGS DAVESRRLAS LRALGVPVTV 
GNGPDRFDAS RLDGVELVVV APAVPGDDLE LAEARRRELR VLTRSAALAG LMAGHRGVAV
AGSHGKTTVA MMLTAALQAC GADPTFAVGG DPGEAGSHTH AGSSELMVVE ADEDAGAFWQ
LQPYGAVLTG VAAEHLDHYR TMPALAASFA TFLRRVDPGG FLVACVDDAA GWALATAAAD
HADRWRRAGA SASASAPDGG RPWLTGYGFG PSADVRLVAE EISIAGTSAE VVVHGVRLGR
LSLRVPGRHH LLDAAAALAA GIALGAPPAG LLAGLTEFAG VRRRFESLGS AGGVRVVDDY
ANHPDRVAAA VEAARAAAGG GRVVVAFQPH LYSRTALLAD RFGAALGSAD AVVVMDVYGA
GEQPEPGAGG ARVAAATRSG AARVGAARVV YEPSWSAVPG VLMDLARPGD LVMTLGAGDV
TQVGPELLRL LAERSALPG