Gene Sfum_3468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3468 
Symbol 
ID4458205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4235017 
End bp4236357 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content64% 
IMG OID639704240 
ProductUDP-N-acetylmuramoylalanine--D-glutamate ligase 
Protein accessionYP_847574 
Protein GI116750887 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase 
TIGRFAM ID[TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.694079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0198474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGCA GGGCGCTGGT CGTCGGCATG GGAGTGTCCG GCCGGTCGGT GTGCGAGTTG 
CTCTTGCGCA ACGGAGTCGA GGTCGTCGCG ACGGACCTCA GGCCGCTCGA CCGGTTCGGC
GGCACCCTGG ACGAGCTGCG TGCGAAAGGC TGCCGGTTAA GGTTGGGGGA ACATCACCCG
GATGATTTCC TGAACGTCGA CCAGATCATT GTGAGTCCGG GAGTGCCGTC GCTGCTCGAA
CCGCTGCGTG AAGCCCGCCT CAGGGGTATC GAGATCGTGG GCGAATTCGA ATGGGCCTGG
CGCCAGGTGG ACGCGCCCGT GATCGCGGTC ACCGGGACCA ACGGCAAGAC GACGACCACC
GCTCTTATCG GGGAAATGAT CAAAGCGTCC GGCACGCGCG TGTTCGTGGG GGGCAACATC
GGGACGCCGC TGAGCCGGTG GCTGCTGGAC GGAGACCGGG TGGACTGCAT GGTGCTCGAA
GTGAGCAGTT TTCAACTGGA CACGGCGTTC CTGTTCAGGC CCGAGGTGGG AGTCTTGCTG
AACGTGACCG AGGACCACCT GGATCGATAT CGCGATTTTG AAGAGTACAC GGAATCCAAG
CTCTCGATGT TTGGGCGCCA GGAGTCCACG GATGTCGCGG TGATCAACCT GGACGACCCG
GTCTGCGGCT CGAGGCCTTT CAACGGAAAG GGCAGGCTTC TGACCTCGAG CCGGAATGAT
CCGCGCACGC ACGCCCATGT CGAGGACGGG CGGATCGTCG TGAACGTTCC GTGGAAGCCG
GAGTTCCGTC TCGACCTGGC GGATCTGCGG CTCAAAGGGG TCCACAACGA GGAGAACGTG
CTCGCCGCCA TTCTTGCATG CCTGGCCATG GACGTAGTCC CCGAGGCCGT CGCGCGGGCC
GCCGGGACCT TTGGCGGCCT GCCCCACCGT GTCGAATGGG TTCGAGCGGC CGGGGGCGTC
GATTACTACG ACGATTCCAA GGGGACCAAC GTCGGCGCCG TCGTCAAGGC GATCGAAAAT
TTCGATCGAC CCGTTCTCCT CTTGTTGGGG GGAAGGGACA AGCTGGGCTC CTACGCTCCC
ATTGCCGAGC GGATGAGGAC CAGGGGCAAG GGGGTGTTCG TGTTCGGAGA ATCGGCTCCA
CGAATCCACG CAGAACTGCG CGACAAGGTT CCCATCCGGT TGTTTCCCGA TTTGGAGGGT
GCGTTCTCGG CCGCCGTGGA ACGGGCGCAG GCGGGAGACA TCGTGCTCCT TTCCCCGGCC
TGTTCGTCTT TCGATCAGTA CGAGAGCTAC GCGCAGAGGG GGGACCATTT CAAGAAACTC
GTAGCCGCCC TTCCGGGGTA G
 
Protein sequence
MPGRALVVGM GVSGRSVCEL LLRNGVEVVA TDLRPLDRFG GTLDELRAKG CRLRLGEHHP 
DDFLNVDQII VSPGVPSLLE PLREARLRGI EIVGEFEWAW RQVDAPVIAV TGTNGKTTTT
ALIGEMIKAS GTRVFVGGNI GTPLSRWLLD GDRVDCMVLE VSSFQLDTAF LFRPEVGVLL
NVTEDHLDRY RDFEEYTESK LSMFGRQEST DVAVINLDDP VCGSRPFNGK GRLLTSSRND
PRTHAHVEDG RIVVNVPWKP EFRLDLADLR LKGVHNEENV LAAILACLAM DVVPEAVARA
AGTFGGLPHR VEWVRAAGGV DYYDDSKGTN VGAVVKAIEN FDRPVLLLLG GRDKLGSYAP
IAERMRTRGK GVFVFGESAP RIHAELRDKV PIRLFPDLEG AFSAAVERAQ AGDIVLLSPA
CSSFDQYESY AQRGDHFKKL VAALPG