Gene Sfum_4083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_4083 
Symbol 
ID4457572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4966435 
End bp4967565 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content60% 
IMG OID639704853 
Productsolute binding protein-like 
Protein accessionYP_848183 
Protein GI116751496 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0424858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0912112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCCTT ACATTCGACT ATTAGGAAAG AATCGGTTCA AAACGAGCTT TACTTTTCTG 
ATGGTGTTAT GTGCCGTGGC GTGCGCGGGC ATCGCCCGGG GTGCTTCAGG CCCCGTTGGA
AAGGCCGCGG TCTTGAAAGG CGCCGTGTTC GTGGAGCGGG AAGGAAGGAG CATTGCGGCG
AAAGCCGGCG AATCCGTCTT TCTCAAGGAC AAGTGGCAGA CGAAGGGGGA TGGTTCCGTG
GAGATCGTCT TCCTTGATGA AAGCCGGGTA AAAATGGCCC CGGGATCGGT GATGGAGATC
ACTGAGTATT TGTACGATCC TTCGCAAAAA AGCCGTCAGG GGCTTCTGTC CATGATGTCG
GGGAAAGCGC GTTTCGTTGT GCAGGATTTG CAGGATTTCA AGGAAAAGCG GTTCCGGGTG
CAAGGTCAGA CGGCGGTTGT GGGCACCCGC GACACGGACT TCGTGGTGCG GGTGCGTTCG
GGTTCGGCGA AGGAAAGCAT ATGCAGGGAG GAACTGCTGG AAGCTCTGTG CATCGAGAAT
GTGATCATAG CCGTCAACCG CACTACCCCC GATAAGGGGG CCGTCATCAC CACCAACATG
ATCACTCAGG TCTGCGGGAA GAATCCGCCC ACCCCGCCCC GGTTCGCGAC TCCCGCCGAG
CGTGCCGATC TGCTGAAGGG GCTGGAGGAA ATCGGCTCCA GGAAGCTGCC TCGCGCCGAG
ACCGGCATCG GCGTGCCCGA AACGAGCGGA GGGGAGACTT CGACCGGTCT CACGCACACC
CCTCCGCCTG AAGTGATCGT GCCGCCATTC ATTTTCCCGT CCACGACGAC AACGACAACG
AGCTCATCGA CCACCACGAC GTCGACCACG TTGCCGTGGC AAGTGCCGAG GACCACTACG
ACCACCTCGA CCACTTCGAC CACGATGCCG ACGACGACGT CCACCACAAG CACAAGCACG
TCCACGACGA GCACATCGAC GACCAGTACG TCGACAACGA GTACCACGAC GACGAGCACG
TCCACAACGA GCACCTCGAC GACGAGCACG TCGACAACCA GCACCTCGAC GACGACCACA
CTCCCGCAGC CTCCACAGCC TCCCATCAGA GGTGGTCCGC GGGGCAGGTG A
 
Protein sequence
MLPYIRLLGK NRFKTSFTFL MVLCAVACAG IARGASGPVG KAAVLKGAVF VEREGRSIAA 
KAGESVFLKD KWQTKGDGSV EIVFLDESRV KMAPGSVMEI TEYLYDPSQK SRQGLLSMMS
GKARFVVQDL QDFKEKRFRV QGQTAVVGTR DTDFVVRVRS GSAKESICRE ELLEALCIEN
VIIAVNRTTP DKGAVITTNM ITQVCGKNPP TPPRFATPAE RADLLKGLEE IGSRKLPRAE
TGIGVPETSG GETSTGLTHT PPPEVIVPPF IFPSTTTTTT SSSTTTTSTT LPWQVPRTTT
TTSTTSTTMP TTTSTTSTST STTSTSTTST STTSTTTTST STTSTSTTST STTSTSTTTT
LPQPPQPPIR GGPRGR