Gene Sfum_3341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3341 
Symbol 
ID4458334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4093633 
End bp4094913 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content58% 
IMG OID639704113 
Productglycosyl transferase, group 1 
Protein accessionYP_847449 
Protein GI116750762 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.178171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTTC TGGTCGCACA AATAGGGGCG AGACGCCACT ACGCGGTACC CAGGGTTCTG 
CATGGGGCAG GCATGCTGGA TAGGTTGGTA ACCGACGCCT GCGGGGACCT GTTTCCCTGG
TCGCTGGCTG ATCTCGTTCC GGCGTTTCTT CGCAGAGACG CTCTCTCGCG ACTGGCAGCC
CGTTCCTCCG GTTTGCCCCG TGGAAAAGTG ACGGGCCTTT TCATCTTCAC CGTTTCCGCT
TTGCTCGGTC CCTCGCGACG CAAACCCGGA GGGAACAACA TCTCCTGGTG GCTGCATCGC
AACAGACGGT TCAATGAGCT CGTGATCGGG AAAGGCTTTG GCTCTGCTGA CGGGGTATAT
GTCTTCAATG GAGCGGGGTT GGAGGTCTTG AAGGAGGCGA AAAGACTGGG CCTTTTCCGC
GTGTTGGACC AGACCTCCGC CCCGATGCGA CACGATGCCG AGTTGTTGAT CTCGGAATCC
GAACGGTGGC CGGACTGGTC GACCGAAGCC GACTCCATCT ACAATTGGCT TCCAATGGCC
GAGCGCGAAG AAGAAGAATG GCAACTGGCG GACCGGATTA TCTGCGGGTC GGAATATGTA
GCGGAAGGCA TTCGGGTATT GGGGGGCCCC GCAAAGAAAT CCCGCGTGGT CCCTTACCCA
TGCGATCATG GCATGCAGCC ACCGGCAACT GAACATGTCA CGCCGGCGGA AAAGCAGGAA
AAGCGATGCG GACAGCGGAG CAACAATTTG ATGCAAAGCC GCACGCGATC TACAGCCGGG
TTGCACATCC TTTTTGCCGG CACACTCAAC TTGCGCAAGG GGCTCCCCTA TCTTCACGAG
GCCATACGGC TATTCGGAGC TGGCCGCGCG AAGTGGCGGA TTGTGGGTCC ACCTGCGATC
AGTCGGCAAG CTCTTGCCAT GCTCGGGAGG GTGGCGGAGG TACGAGGGGC AGTGCCGCGC
TCGGAAATGG TTGAGCACTA CGCGTGGGCA GACATCCTGG TGTTGCCCAC CATCTCCGAG
GGGTCCGCCA ACGTTTGTTA TGAGGCACTT GCCCGAGCGG TGCCGGTGAT CACGACACCC
AATGCCGGCT CCATAGTCCG CGATATGATC GATGGCTTCA TCGTCCCTGT TAGATCCCCA
GCTTCGATCG CCGAGAAATT GGCTCTCCTG GAAGGAAATC CCTCATTATT ACAGGAAATG
TCACTTCGGG CTCTTGAACG ATCACGAGAA TTCACGGTGA ATACATATGC ATTCTATTTG
GAAAATGCCA TAAAATACTG A
 
Protein sequence
MRLLVAQIGA RRHYAVPRVL HGAGMLDRLV TDACGDLFPW SLADLVPAFL RRDALSRLAA 
RSSGLPRGKV TGLFIFTVSA LLGPSRRKPG GNNISWWLHR NRRFNELVIG KGFGSADGVY
VFNGAGLEVL KEAKRLGLFR VLDQTSAPMR HDAELLISES ERWPDWSTEA DSIYNWLPMA
EREEEEWQLA DRIICGSEYV AEGIRVLGGP AKKSRVVPYP CDHGMQPPAT EHVTPAEKQE
KRCGQRSNNL MQSRTRSTAG LHILFAGTLN LRKGLPYLHE AIRLFGAGRA KWRIVGPPAI
SRQALAMLGR VAEVRGAVPR SEMVEHYAWA DILVLPTISE GSANVCYEAL ARAVPVITTP
NAGSIVRDMI DGFIVPVRSP ASIAEKLALL EGNPSLLQEM SLRALERSRE FTVNTYAFYL
ENAIKY