Gene Franean1_5095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5095 
Symbol 
ID5673430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6097951 
End bp6099600 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content78% 
IMG OID641243946 
ProductUDP-N-acetylmuramate--alanine ligase 
Protein accessionYP_001509360 
Protein GI158316852 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0773] UDP-N-acetylmuramate-alanine ligase 
TIGRFAM ID[TIGR01082] UDP-N-acetylmuramate--alanine ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.279001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.025809 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGAC CGCACTCCGA CGACGCCCCG GCCGCCCCAG CAGGCCCGGC CGCCGCGCCG 
CCGCCCGTCG TCCCCGATCC CCGCGACCGC CCCGAGCGGG CCCGGCGGGT CCACTTCCTC
GGCATCGGCG GCGCCGGGCT GTCACCCCTC GCCCAGATCC ACCTGGCCGC CGGCGGCCAG
GTCTCCGGCA GCGACCAGGA GTACTCACCG CGGGTCGCCA TGCTGCGTGA GCTGGGGGTC
CCCGTCCGGG TCGGCCCCGC TGCCGACGCG GCGGCGCTGG CCGCCGAGCT GTCCGCGGCG
GACGTCGTCG TGGCCTCCAG CGCGCTGCGT GACGACCACC CGGAGATCAT CGCCGCCCGC
GAGCTCGGCG TCCCCGTCCG GCGGCGCTCG GACTGGCTGC CGGAGCTGAC CGCGCCCTAC
CGGCTCGTGG CGGTCGCCGG CTCACACGGC AAGACCACCA CCGCGGCGAT GCTGACCCTG
GTGCTCGCGG CCGGCGGCGC CGACCCGACC GCGGTCATCG GTGCCGATGT GGCCCAGTTG
GGCGGCAGCG CCGTGACCGG ACGGGGCGAC GTGTTCGTCC TGGAGGCGGA TGAGTACGGC
GGCGCGTTCG CCGGGCTGGA CCCGGAGCTC GCGGTGATCA CCAACGTCGA ATGGGAGCAT
CCGGACCTCT TCCCGGACGA GGCCTCGGTG CGCGCGGTGT TCGCCGACTT CGCGGCCCGG
GTCCGCCCGG GCGGGCGGCT GGTCGTCTGC GGCGACCATC CCGGCGTCGC CGCGGTGCTC
GCAGAACTCG ACCGGCGGGC CGAGCCCGAC CGGCGCGCCG ACGCGCCGCG GGCCCGGGTG
GTCGACTACG GCTTCGCCCC CGGGCGGACG TGGCGGGCGG TGGACTACGT CCCGCTGCCC
GGCGGCGGCT CCACCTCGAC CGTGCTGCAC GACGGGAACC GGGTCGGCGA GCTCACCCTG
GCGCTGCCCG GCCCCCACAT CGCCCTGGAC GCCCTCGCCG CCCTCGCCAC GGCCGCCGAG
CTGGGGGTGC CGCCCACGGA CGCCCTGAGG ACGCTGGGGA CGTACGCGGG CGCGGCCCGG
CGCTTCGACG TCGTGGGCCG GGCGAGCGCG CCCGGCGGCA CCCGCGTGGA GATCGTCGAC
GACTACGCGC ACCACCCGAC CGAGATCCGC GCCACCCTGC GCGCGGCCCG CGACCGCGCC
GCCGGCCGGC AGGTCTGGGC GGTCGCGCAG CCGCACACGT TCAGTCGCCT CGCCGCGCTG
CTCGACGACT TCGCGACGGC CTTCGGCGAC GCCGACCGCG TCTACGTGAC CGACGTGTAC
GCCGCGCGCG AGACCGACAA CCTGGGCCTG CACGCGTCCG ACCTCGCCGG GCGGATCAGG
CGGCCGGCCG CCGCCGGCTA CGTGGCCTGG CCCGACCTGC CGGGCCGGCT CGTCGCCGAC
CTCGCCGACC TCGGCGGCCG CGGCGGCCGC GGGGACCTCG GCGACGCGGC GGCCGGCGGC
ACGCACGGCG TCATGCTGCT CACCCTCGGC GCCGGAACGA TCACCACGCT CGGCCCGCGC
CTGCTGGACC TGCTCACCGC CGGCGGCACC GCCGCACTTT CCGGGAGCAG CCGGGCGGGC
GACCAGGACG CGACGGGCCT GCCGACCTGA
 
Protein sequence
MTGPHSDDAP AAPAGPAAAP PPVVPDPRDR PERARRVHFL GIGGAGLSPL AQIHLAAGGQ 
VSGSDQEYSP RVAMLRELGV PVRVGPAADA AALAAELSAA DVVVASSALR DDHPEIIAAR
ELGVPVRRRS DWLPELTAPY RLVAVAGSHG KTTTAAMLTL VLAAGGADPT AVIGADVAQL
GGSAVTGRGD VFVLEADEYG GAFAGLDPEL AVITNVEWEH PDLFPDEASV RAVFADFAAR
VRPGGRLVVC GDHPGVAAVL AELDRRAEPD RRADAPRARV VDYGFAPGRT WRAVDYVPLP
GGGSTSTVLH DGNRVGELTL ALPGPHIALD ALAALATAAE LGVPPTDALR TLGTYAGAAR
RFDVVGRASA PGGTRVEIVD DYAHHPTEIR ATLRAARDRA AGRQVWAVAQ PHTFSRLAAL
LDDFATAFGD ADRVYVTDVY AARETDNLGL HASDLAGRIR RPAAAGYVAW PDLPGRLVAD
LADLGGRGGR GDLGDAAAGG THGVMLLTLG AGTITTLGPR LLDLLTAGGT AALSGSSRAG
DQDATGLPT