Gene Sfum_3068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3068 
Symbol 
ID4458618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp3776590 
End bp3777840 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content60% 
IMG OID639703839 
Producthypothetical protein 
Protein accessionYP_847176 
Protein GI116750489 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00012558 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGTGAGCG AATCTAGCCT TCAGAGCTGG CAGATCGAGC AACAGTGCCC TCAGTGCGGC 
GCTCCCCTTG TACTGGAGGA GACGGATCGC GTGCTGTCGT GCGAGTTCTG CCGGGTGAAG
CTCACCATCT CGTTTCCCGG CCATTGCCGG TGCTACCTTC CCCCCGCGCG GAGTCGTCCC
GACCTGCTCT TCATCCCGTA CTGGAGATTC AAGGGAATGG TCTTTTCCTC CACGGAGGAA
GGGGTCCGCG CGCGGATCGT CGATTCCAAC CAACTGGCGG TGAATTCGCT AAACCTTCCT
CCGTCCCTAG GTTTGAGGCC GCAGGTGATG CGGCTGAAAT TCGCGGCACC GAAACCGGAA
GGCCGTTTCA TCAAACCCGG CGCGCCCGTC AGGGTATTTT CCATGAAAGT GGATGATACG
TTTCAGGGCG CCTCGGAAAG GATTTCCCGG GGGCTCTACG AAGCGTTCAT CGGTGAAACG
GTGAGCGCGA TCTACACGCC GGTGTTCGTC GAGGGCCGCA CCGTCCTGGA CGCCGTCCTC
AACCGCCCGC TCTGCGCGGC CGATGTCGAG GACGCCCCGG AGCTCGACCC GGATTCCTCC
TTCGACTGGC CGATCACGTT CTCATCGACC CTGTGTCCGC ACTGCGGCTG GGATTTGGAG
ACCGAGAGAG ACGGCCTGGT TCTGCTCTGC AGAAACTGCG GCTCCGCCTG GGAGCACCTC
CGGGGCGAAC TGAAAAGGAT CTCCTTTTTC CTGCCCCTCG ATTCGAACCC TTCGGCGATT
CATTTGCCTT TCTGGAAGCT TCGTACGCGT TTTGAAGGAT TGCAGCTCGA TTCCTTCGGG
GATCTGGTGC GGCTCGCCAA TATGCCCAAA ATAGCCCGGG CCGGTGAAGC CGACCGCGAT
CTGCACTTCC TGATCCCCGC CTTCAAGATT CAACCCCGGG TCTTCCTGCG GCTCGCCAAG
CTTCTCACCT CGAATTCGTT GGACGAGGGG CCGAGCGCGG AACCGGGCAG CCTGCTCCTG
CACCCGGTGA CCCTACCTGC CGCCGAAGCC GTCGAAAGCA TCAAGACCAT CCTGTTCGCC
ATGGTCGCCC CGAAAAGAGC GTTCTTCCCT CTGCTTGGCA GAATACGGCC ACGGGTGGAA
GAATACGGCC TGACCTACCT GCCCTTTGTT CCGAAGGGCC ACGAGCTCGT TCAGCCCGAT
TTGCAGACCA GCATAAACAA GAATGTACTC AATTGGGGAA GATTGATTTA G
 
Protein sequence
MVSESSLQSW QIEQQCPQCG APLVLEETDR VLSCEFCRVK LTISFPGHCR CYLPPARSRP 
DLLFIPYWRF KGMVFSSTEE GVRARIVDSN QLAVNSLNLP PSLGLRPQVM RLKFAAPKPE
GRFIKPGAPV RVFSMKVDDT FQGASERISR GLYEAFIGET VSAIYTPVFV EGRTVLDAVL
NRPLCAADVE DAPELDPDSS FDWPITFSST LCPHCGWDLE TERDGLVLLC RNCGSAWEHL
RGELKRISFF LPLDSNPSAI HLPFWKLRTR FEGLQLDSFG DLVRLANMPK IARAGEADRD
LHFLIPAFKI QPRVFLRLAK LLTSNSLDEG PSAEPGSLLL HPVTLPAAEA VESIKTILFA
MVAPKRAFFP LLGRIRPRVE EYGLTYLPFV PKGHELVQPD LQTSINKNVL NWGRLI