Gene Sfum_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1199 
Symbol 
ID4460471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1483746 
End bp1485533 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content61% 
IMG OID639701966 
Productglycosyl transferase family protein 
Protein accessionYP_845327 
Protein GI116748640 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.230311 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGGA TGAAGAACGA GGACGAGCTC GAAAGACAGC CGGTTCCAAC TCCGGCGGCC 
GGGGCGGGCA AAGACCATGG CGTCCGGCTG GCCATGGCGG TGCTGATCCT GGTCGTGATT
GCGGTCGTGA TGGTTCGCCT CAAGGCCGCT CCCGTGCCTC TGGAACGCGA TGAGGGCGAG
TATGCCCTCA TGGGCCAGTT GATTCTCGAG GGCGTGGCAC CCTACCGGGA GGCGGGAAAT
ATGAAACTTC CCGGCACCTA CTATGCTTAT GCCGCAATCC TGGCCCTTTT CGGGCAAACC
ATCACCGGCA TTCATGTCGG GCTCATGGTG GTGAATCTGC TCACGGCCGG CATACTCTTC
CTGATTGCAT CGCGCCTGCT CGGTCGTGTC GAGGCGGCCC TGGCCGTAGC GGCCTTTCTC
ATCATGTCGG TTGACCTCTC GGTGCTCGGT CTTTTTGCGC ATGCGACCCA TTTCGTGATC
ATGTTCGGCC TGGCCGGAGT CTGGCTGCTC CAGGAAAGCT CCGGGTCGAA AAGAGAGGTG
TTGCTGCTTT GGGGCGCGGG TCTTTGCCTT GGTCTGGCGG TACTCATGAA ACAGTCCGGG
GCGCTTTTCG CGCTGTTCGG CTGTCTATGG GTCCTGTTTG ACGGACGCCG GACCGGATCG
TGGAAACGCT CCCTCGTCAG GTCCGGGGTC CTTGCGTGCG GAATCGTCCT GCCCTATGCG
GCTTTCCTGG TTGTGACGGT CGTCCGCGGC ACATTCGACA GATTCTGGTT CTGGACCGTC
GACTACGCGC AATCCTACGT ATCTCTCGTG GGCTACGGGT TGGGTCTTGA GCTCTTCCGG
GTGGGAATCG TTCCCATCAT CCGAAGCAAC CCCGCGATAT GGTCGATGGC TCCGGTGGGC
ATGATCGGTC TCTGTGTGGC GAAGGGCAGT CGCAGAACGG GACTCTTTCT GATAACGTTT
TTCCTGTTCT CGTTCCTGGC CGTGTGCCCG GGCCTGTACT TTCGGCAGCA CTATTTCGTG
CAAATCCTGC CGGCAGTGGC GCTGGGCGCC GGCGCCGCAT TGCGGATCCC GCGGGGACTT
TTCGCAAGAG TTACAAGGCG GCCGCTGACC GCGGAGGCCG TCGTCTTCCT GGCGATCGCC
GCGTTTTCGC TGACGGGAAT CCTATCCATG TGGCACGGCC TGTCCACATT GACGCCGGAG
CAGTTCAGCC GTTCGGTCTA TGGTTCCAAC CCGTTCCCCG AATCCGTCGC CGTCGCCGAA
TACATCGCAA AGAACACCGG CGCCGAGGAG CGCATCGCCG TCCTGGGGTC AGAGCCCCAA
ATCTATTTCT ACGCCCATCG GAGGCCGGCG ACCGAGCACA TTTACATGTA CGGGCTGATG
GAACCTCAGC CCTTCGCGCT GCGCATGCAG GAAGCCCTGG TCGCACAGGT GGAGAAGAGC
GAGCCGCAAT TCATCGTGCT GGTGTTGGTT TCCACCTCCT GGCTCCAGCG CGGCGTCTCG
GAGAAGAAGG TGTTCGGCTG GATGAGGGGC TACCTCAACA CATACTACCA AGCCGTCATG
ACGGCCGAAA TCCAGTCGGA CCGCACCGTA TGGTCGGTCG ATGAGGACGT TCGGACCTCC
CAAGGGAGGC GCGGATCGAG TCGGTTGATC GTCTACAGAA GGAAACCTGC CGACTCTTCT
CACAACAGAG AGGGAGGCTC CGGCCTGGTG CGGCGTCCTG CGGGATGGGG AAGCGATTCA
TACCGGGGTG CGTTCAAGAG TACGCAACAC CTGCCCGGCG GGAGGTAA
 
Protein sequence
MVRMKNEDEL ERQPVPTPAA GAGKDHGVRL AMAVLILVVI AVVMVRLKAA PVPLERDEGE 
YALMGQLILE GVAPYREAGN MKLPGTYYAY AAILALFGQT ITGIHVGLMV VNLLTAGILF
LIASRLLGRV EAALAVAAFL IMSVDLSVLG LFAHATHFVI MFGLAGVWLL QESSGSKREV
LLLWGAGLCL GLAVLMKQSG ALFALFGCLW VLFDGRRTGS WKRSLVRSGV LACGIVLPYA
AFLVVTVVRG TFDRFWFWTV DYAQSYVSLV GYGLGLELFR VGIVPIIRSN PAIWSMAPVG
MIGLCVAKGS RRTGLFLITF FLFSFLAVCP GLYFRQHYFV QILPAVALGA GAALRIPRGL
FARVTRRPLT AEAVVFLAIA AFSLTGILSM WHGLSTLTPE QFSRSVYGSN PFPESVAVAE
YIAKNTGAEE RIAVLGSEPQ IYFYAHRRPA TEHIYMYGLM EPQPFALRMQ EALVAQVEKS
EPQFIVLVLV STSWLQRGVS EKKVFGWMRG YLNTYYQAVM TAEIQSDRTV WSVDEDVRTS
QGRRGSSRLI VYRRKPADSS HNREGGSGLV RRPAGWGSDS YRGAFKSTQH LPGGR