Gene Sfum_3123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3123 
Symbol 
ID4458533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp3836924 
End bp3838171 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content62% 
IMG OID639703894 
Productglycosyl transferase family protein 
Protein accessionYP_847231 
Protein GI116750544 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00523086 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCCCTG AAAGGGATCG GCCCTCGGGT CCGCGAATCC GCCGGCCGCA TCGATTGAAC 
CCATGGCATG AAATGCAGGC TCCGATGCTC ACCGATTGTG TCGTTTTTTT CCTGTCCCTG
CTGTCCCTTG CCGGGTGCGC CTATTACCTC ACCTGCATCG TTGCCGCACG GCGGTTCTTT
TCCCGCCCGC GCTCCGCTGC GCCGGTCTCC GCGGTCCGAC CCGCCTCGAT TCTCATTCCT
CTGTGCGGCG CCGATTTCCA GGCCTATGAC AACTACGCTT CGTTCTGCCG CCTGGACTAC
CCCGAGTTTC AGCTCGTGTT CGGCGTCCAG GACCCCATGG ACAGTTCGAT CCCGGTGGTG
GAACGGCTGA AGGAGAACTT CCCTCACTGC GACATTCATC TGGTGATCGA CTCGAAGGCC
ATCGGGACAA ACCCCAAGGT GAGCAATTTG AACAACATGC TCGCGGCCGC CCGGCATGAA
TTGATCGTAA TTGTGGACAG CGACATCCGG GTGGAAGCGG ACTACCTGTC CACCCTGGTT
CCGGAGCTCG CGGACGAGCG TATCGGCCTC GTCACCTGCC TTTATCGCGC CGGGGCGACC
CCGAATTGGA CGTCATTGCT CGAAGCGGTC GGGATAACGG GCGAGTTTGC CCCGGGTGTT
TTGGTCGCCG ACTTCACCGA AGGCATCCGG TTCGCTTTCG GCGCCACCAT GGCGACCACG
AAAACCAGGC TGTCTTCCAT CGGGGGGTTT GCGGCCATCG CCGATTACCT CGGGGACGAC
TACATGCTGG GAAACCTGCT TTGGAGAGAG GGATACGAAA TCAGGCTGGG AAGGCCGGTG
GTGGAGACGA TGCCGCCTCC GCTCAGCTTC CGCTCGATGC TCAATCATCA AATCCGCTGG
TCGCGAAACA TCCGGGCCTG CCGTCCCATG GGCCATTTCG GGACGGCGAT CACTTACGGC
ACGGTTCCGG CGCTGCTCAA CTTCATTTTC TTCGCCTCGC CCTTTACTTT CCTGCTGCTG
GGCGCCGTTG CGGCCCTGCG GCTTTTTACC GGGTGGTACG TCGGGGTGCG CGGCCTGCGG
GATCGGATAC TGGAAAAGAA CCTCTGGCTG CTGTTGCCTC GGGACCTGTT GGGATTTGGC
GTCTGGTGCG CCAGCCTGAC CGGATGGAGC GTGGAATGGC GAGGACGGCG CTACCGTTTG
CAGAAAGACG GCAGGATGCT GCCGCCGGGC GAGAGACCGG AGCCTTGA
 
Protein sequence
MFPERDRPSG PRIRRPHRLN PWHEMQAPML TDCVVFFLSL LSLAGCAYYL TCIVAARRFF 
SRPRSAAPVS AVRPASILIP LCGADFQAYD NYASFCRLDY PEFQLVFGVQ DPMDSSIPVV
ERLKENFPHC DIHLVIDSKA IGTNPKVSNL NNMLAAARHE LIVIVDSDIR VEADYLSTLV
PELADERIGL VTCLYRAGAT PNWTSLLEAV GITGEFAPGV LVADFTEGIR FAFGATMATT
KTRLSSIGGF AAIADYLGDD YMLGNLLWRE GYEIRLGRPV VETMPPPLSF RSMLNHQIRW
SRNIRACRPM GHFGTAITYG TVPALLNFIF FASPFTFLLL GAVAALRLFT GWYVGVRGLR
DRILEKNLWL LLPRDLLGFG VWCASLTGWS VEWRGRRYRL QKDGRMLPPG ERPEP