Gene Sfum_3360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3360 
Symbol 
ID4457642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4116263 
End bp4118191 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content58% 
IMG OID639704132 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_847468 
Protein GI116750781 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACAGC AATTGCGGAA TCCGAAATTT TACTCAATGC TCATCATTGA CGCATTTCTT 
TTCGCCGTGG CCTATTACGG AGCCTACCTG CTCCGCTTTG AATTCACTCT GCCTCCCAAT
CACCTGCCCC TCATCAATGC TCTCGTTCCT TGGGTCGTCG GCATGAAGCT GGGCGTTTTC
TGCCTGTTCG GCCTCTACAG GGGGATGTGG CGGTTTTCGG GACTGGAGGA TTTCTGGCGT
CTGGGACAGG CGTCCGTTCT TTCCATGCTT CTGATTGTCG CTACGGTGGC TTACACTGAC
AACTTCACCG GATATCCCCG ATCCGTCTTT CTTCTGGACT GCGTTTTGAC GTTTCTGCTG
ACCGGCGGTC TGCGTGTATG CATTCGGTCC TACTATATCG CCAGGAATAC GCCGCGGGGG
ATCCGGGCAT TCTCCCTGCC CAGGCTGAAT TACGTTGAGA AAGAGCGCAA GCAGATTCTC
ATCATCGGCG CCGGCGGGTC GGGAGAAAAG ATGCTCCGCG AGATCTTCGA CAATCCGCAG
CTCCATTATC ACGTGGTCGG GTTCCTGGAC GACGATCCGG GCAAGCGGGG CCGCACGGTG
CATGGGGTGC GGGTACTCGG GCCCGCGGAC CATTTGTCCA GAGTCCTGGA GCAGAACAAT
ATCGAACAGG TATTTATCTC GGTACCTTCC GCGACCGGAG CTCAGATGCG CCGGCTCATC
GATATCTGCA AGGGATGCGG GATTTCCTAC AAGACGCTTC CCGCCATCGG TTCGATCATG
AACGGCAACG TGAGCATCAA GTCTCTGCGG GACGTCAATT ACGAGGATTT GCTGCGCCGT
CCTCCGGTCA GCCTGGAAAC CGATGCCATC TCAGGCTACC TGACCAACCG CACGGTAATG
GTGACGGGGG CGGGCGGGTC CATCGGCTCC GAACTCTGCC GCCAGGTCGC CCAGTTCAAG
CCGCAGTTGC TGATCCTGGT CGATGCCAGC GAGTCCAACC TGTTCCATAT GCAGATGGAA
TTGCAGCATG AGAGGGAATT TCGCAACTAT CAGTGTATCT TGGGTCAGGT GCAGCAGCGT
CTGCTGATGG AGAGCGTGTT CCGGAAGTAC CGCCCGGACG TGGTGTTCCA CGCCGCGGCT
TACAAGCACG TCCCGATGCT GGAAAGGAAT CCCTGGGAAG CCGTGTTCAA CAACGTGCGC
GGCAGCCAGG TAATGATGAA GCTGTCCAAG GATTACGGGG TGAAACGTTT CGTCCTGGTT
TCCACCGACA AGGCGGTCCG GCCGACGAAC GTGATGGGAA CCAGCAAGCG CCTCACGGAG
CTCATCCTGC AGTCCTTTCA GGGCAATGGG ACAAAGTACA TGGCCGTGCG ATTCGGCAAT
GTGGTCGGGT CCTCGGGTTC CGTCATTCCC CTCTTCCGCC GCCAGATCGA GCAGGGAGGC
CCGGTCACCG TGACTCATCC CGAGGTCACC CGGTACTTCA TGACGATTCC CGAAGCGGCT
CAACTCATCC TGCAGGCCGG AGCGCTCGGG GAAGGCGGAG AAATCTTCAT CCTGGAAATG
GGAACCCCGG TAAAGATCGC GGACATGGCA CAGGACCTGA TCCGGTTGTC GGGGAAGCAA
CCGGGCAGGG ACATTGAAAT CCAGTTCACC GGCCTGCGAG AGGGCGAGAA GCTTTACGAA
GAGCTGATCA CCCTCGATGA AGGCATCGTA AACACCAGGC ACGAAAAGAT CCTGGTGCTG
CGCCCCAACG GAAATGGAAA CGGCAACGGC CATCATGCGG GAAACCACGA TGCTTTTCGG
CAGTGGCTGG ATCGGGAATT GGAAGAGCTT TGCGCCATTG CGCGCAAGCA TGACTCATAC
GCCATCAAGC GCAAGCTCAA GCAACTGGTT CCGGAATACA CCCCCCAGGA TGCGGAGTGC
GTCCTGTAA
 
Protein sequence
MIQQLRNPKF YSMLIIDAFL FAVAYYGAYL LRFEFTLPPN HLPLINALVP WVVGMKLGVF 
CLFGLYRGMW RFSGLEDFWR LGQASVLSML LIVATVAYTD NFTGYPRSVF LLDCVLTFLL
TGGLRVCIRS YYIARNTPRG IRAFSLPRLN YVEKERKQIL IIGAGGSGEK MLREIFDNPQ
LHYHVVGFLD DDPGKRGRTV HGVRVLGPAD HLSRVLEQNN IEQVFISVPS ATGAQMRRLI
DICKGCGISY KTLPAIGSIM NGNVSIKSLR DVNYEDLLRR PPVSLETDAI SGYLTNRTVM
VTGAGGSIGS ELCRQVAQFK PQLLILVDAS ESNLFHMQME LQHEREFRNY QCILGQVQQR
LLMESVFRKY RPDVVFHAAA YKHVPMLERN PWEAVFNNVR GSQVMMKLSK DYGVKRFVLV
STDKAVRPTN VMGTSKRLTE LILQSFQGNG TKYMAVRFGN VVGSSGSVIP LFRRQIEQGG
PVTVTHPEVT RYFMTIPEAA QLILQAGALG EGGEIFILEM GTPVKIADMA QDLIRLSGKQ
PGRDIEIQFT GLREGEKLYE ELITLDEGIV NTRHEKILVL RPNGNGNGNG HHAGNHDAFR
QWLDRELEEL CAIARKHDSY AIKRKLKQLV PEYTPQDAEC VL