Gene Sfum_3471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3471 
Symbol 
ID4458208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4239056 
End bp4240093 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content60% 
IMG OID639704243 
ProductUDP-N-acetylenolpyruvoylglucosamine reductase 
Protein accessionYP_847577 
Protein GI116750890 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0812] UDP-N-acetylmuramate dehydrogenase 
TIGRFAM ID[TIGR00179] UDP-N-acetylenolpyruvoylglucosamine reductase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.306486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0427708 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACA ACAAGCGCGG GTCCAGTTAT TTGAATTCAC TCCCGGAAGG GTTGAGCGCA 
GAGGAATGCG ATCGTTATTC GGCTGTTGTT TCGCTGTGCT GGAATGTTCT GGGGGAGCTT
CAGGACGTCG AATTCAAATG GCACGAACCG CTTGCATACC ACACCACGTT TCGAGTGGGA
GGGCCCGCGG CGTGTCTTGC GCGTCCGCGT TCGGAGAGCG CCTTGCTGGC GCTCCTGGAA
AGAGTGCGGG AGAATTCCGT GCCCTATGTG GTGCTTGGCG GTGGAAGCAA CGTGCTCGTA
ACCGACGGGC CAATCCCGGC GCTGGTGATT CAACTCATCC ACGTGGCCGC GGGCCTTGCC
TTCAACAAGG GACGCTCGAG CTCGCGACCT CTCGTGGTTG TCGGAGCCGG CGTCCCGATT
TCTCGTTTGC TCAGGTTCTG TGTCCGCAAC GAACTGGGCG GCCTGGAATG CCTCGTGGGG
ATTCCAGGCT CGGTCGGAGG GGCGGTGGTG ATGAACGCCG GGACGGCTGA AGGAACCATC
GCCGAAGCCC TTGAATGGCT CGACGCCCTG GATGGAGCGG GGCAGAGACA ACTCGTTTTC
AAAGCGGACC TGCCGGCCGG GTACCGGAGC ATGGGGCTGC CCGAAGCCTG GCTGATCCTG
GGAGGTGCCT TCCGCCTTCA TGTCTCTTCG GGCCGATCGC TCAAAAGGGA AATGCGGTCG
CTGATGGTCC GCCGGAAAGC GACACAGCCG CTGGGGCGGC CTTCCGCCGG CTGCGTGTTC
AAGAATCCCG TCGAAGCTCC CGCCGGGGCG CTGATCGAGC GGGCGGGATT GAAGGGATTT
CGGATGGGAA ACGCGCAGGT TTCCGACAAG CATGCGAACT GGATCATCAA TCTGGGCAGT
GCCCGGGCCC GGGATATCCT GGCCCTGATC AGCCTGGTGG AAAATGAAGT TTTTGGAAAA
TTCGGCGTGC GTTTGGAGCG AGAAATTCGT ATACTCTCGC CAGAAAAAAA TTCCCTGAAT
CAGATGCTGA ACTCATGA
 
Protein sequence
MIDNKRGSSY LNSLPEGLSA EECDRYSAVV SLCWNVLGEL QDVEFKWHEP LAYHTTFRVG 
GPAACLARPR SESALLALLE RVRENSVPYV VLGGGSNVLV TDGPIPALVI QLIHVAAGLA
FNKGRSSSRP LVVVGAGVPI SRLLRFCVRN ELGGLECLVG IPGSVGGAVV MNAGTAEGTI
AEALEWLDAL DGAGQRQLVF KADLPAGYRS MGLPEAWLIL GGAFRLHVSS GRSLKREMRS
LMVRRKATQP LGRPSAGCVF KNPVEAPAGA LIERAGLKGF RMGNAQVSDK HANWIINLGS
ARARDILALI SLVENEVFGK FGVRLEREIR ILSPEKNSLN QMLNS