Gene Sfum_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3337 
Symbol 
ID4458330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4088882 
End bp4090135 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content64% 
IMG OID639704109 
Productglycosyl transferase, group 1 
Protein accessionYP_847445 
Protein GI116750758 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.809351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCC TGCACGTCGT CAGGGGGGTG AACCAATCCT CGGGCACCAC TCACGCCATT 
CTTCCCATGG CCGAGGAACA GGCGCGGCGA GGACACGAGG TGTGGCTCTA TCACGTCCGC
AAACCTCCCG GCGTGCTCGA GGCGGCTCCC GACCCCACAT TTGTGAAGAC CCGTGTCTTC
GATCTGTCCC TTCCCTTCGA CCACCCCGGA TTCTCCACAA GCTTTGCCCG CGCGGTGTCG
CGGGACATCG GCCGCTTCGA CGTGGTTCAC ATCCAGGCCG TGCGGAATTT CGCGACCTGG
TGGACCATGC GTTGCGCTGC CGGAGCGGGT GTCCCTTACA TCGTCGCTCC CCAGGGGTCG
TATGAAGACT GGAACCTGGG CCGACGGTCC GTGAGAAACC GTTTATATGA CCGGTTCTTC
GAAATCCCTC TCCTGAACCG CGCGGCGCGG GTCCATTGCC TGACCCCGCG CGAAGTTGAG
CAGGTCCGGG CGATGGGGGT CACCGCGAAG TGCGTGGTCA TTCCCAACGG CGTGCTCGTC
GACGGAGCTC TGCCAGGGCC GGGACTGGAA CCAAATCCCA AACGAAACGC TTACAAGAAA
ACGGACGGAT TGCGGCGGCT CCTTTTCCTC GGTCGCATCC ACCCCAAGAA GGGACTCGAC
CTGTTGCTGC CCGCCTTTGC GCAGGCGGCT GAAAAGCTGC TCGACCTGCG GCTGGTGATC
GCGGGGTCCG ACAACGGGAG CGGGGAACTC GTCAAGACGA TCGCCGCCGC GGAAGCGCTC
TTCCCCGGCC TGGTGGACTC CTCCTCTGCC CGGCCAACCC CGGCGTCCGC CCGCCGTCAC
GGTGCCGCGA TCCGCGAGGA GAATCGTGCC CCGGCTCGTA TCGTGTTTCT GGGAGAGGTG
AAAGGCCGGG CGAAAGAAGC CTGCTTCGCC CTGGCCGACG CCTTCATCCT GCCTTCGTAT
TCCGAGGGAT TGCCCGTGGC GGTGCTTGAG GCCCTCGCAC ACGGGTTGCC CACCATCGTG
ACCGACGGCT GCAACCTCCC CGAAATCGCC CGGGAAGGAG CCGGAGTGCA GGCCGATACC
ACGCCCGACG GCGTGGCGGA TGCAATTTTG AGGCTGTTCG CGGATTCAAC CGCGCTGGCT
TCCTGTGCCC AAGCCGCACG AAACCTTGCG CTGGAACGCT TCTCCTGGCC GAAGATCGTT
GACCGCCTGC TTGCGGTCTA CGCGGCTCTG CCAATAGCTC ATTCATTTGG TTGA
 
Protein sequence
MKILHVVRGV NQSSGTTHAI LPMAEEQARR GHEVWLYHVR KPPGVLEAAP DPTFVKTRVF 
DLSLPFDHPG FSTSFARAVS RDIGRFDVVH IQAVRNFATW WTMRCAAGAG VPYIVAPQGS
YEDWNLGRRS VRNRLYDRFF EIPLLNRAAR VHCLTPREVE QVRAMGVTAK CVVIPNGVLV
DGALPGPGLE PNPKRNAYKK TDGLRRLLFL GRIHPKKGLD LLLPAFAQAA EKLLDLRLVI
AGSDNGSGEL VKTIAAAEAL FPGLVDSSSA RPTPASARRH GAAIREENRA PARIVFLGEV
KGRAKEACFA LADAFILPSY SEGLPVAVLE ALAHGLPTIV TDGCNLPEIA REGAGVQADT
TPDGVADAIL RLFADSTALA SCAQAARNLA LERFSWPKIV DRLLAVYAAL PIAHSFG