Gene Sfum_2157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_2157 
Symbol 
ID4459546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp2638140 
End bp2639348 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content58% 
IMG OID639702923 
Productradical SAM domain-containing protein 
Protein accessionYP_846274 
Protein GI116749587 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACA TTTCCCTCTT GATAAAGCCT TCTTCAGCCC GGTGCAATCT TCAATGCACG 
TACTGTTTTT ACAGCCGCGT CAAGGACCTG TATCCCGAGC CCCACACGCT GATGACCCTG
GAGACCGCGG AAACACTGAT CCGAAAAACT CTGGAGCTCG GGCTGCGGGA GAACAGCTTC
TGTTGGCAGG GGGGCGAACC GACCCTCATG GGAATCGGTT TCTTCCAAGA GGTTGTACGA
CTTCAAAGGC GATATGCGAC CCCCGGGCAG ATCATCGCAA ATTCGCTGCA AACCAACGGG
ATTCTCCTGG ATGACAGGTG GGCGGAATTT CTTGCCGGAA ACTCCTTTCT CGTGGGCCTC
AGCCTGGATG GACCGCGGGA GTGCCACGAC CACTACCGCA CGACTCCATC GGGCAGGGGC
ACCTTCGATC GGGTCATGAG CGCGGCAAAA TACCTTGAAG AACGGGGGGC GGAATTCAAT
ATCCTGACTC TGCTCTCGGA TCGGAACGTC AATCGGCCCG ACGAACTGTA CCGGTTCTTC
CGCAGACACG GCTTTTCTCA CCTGCAATTC GTCCCGTGCG TCGAGACGGA CCATTCCACG
GGGAGACCTT TTCCTTACTC CGTCGCAGTT CGAGACTTGG GCAGGTTCCA TTGCATTCTG
TTCGATCTCT GGATGAAAAG CGGCTTCTAT GACGTCTCGA TCCGTACCTT CGAGGAAATA
CTGATCGCCT TCATCGACGG TGTCGGCACC TCCTGCGTCA TGCGCCCGCA ATGTTCCTCT
TATCTCGTCG TCGAACACAA CGGAGACGTT TATCCCTGCG ATTTCTTCGT GTACCCCGAA
TGGAAGCTGG GAAACATCAA CGAAGACTCG TATTCCCGGA TCTTCGCCAA TCCGCTCAGG
AAGAAGTTCG CAGTGATCAA ATCGGCCTTG CCGGAACAGT GCCGATCCTG TCGCTGGCTG
GACCTGTGCC GGGGCGACTG CCCCAGGTTC AGGCCGCCGG CGGAAAATGG GGAAGCGCAC
CCGAGCGTGT TGTGCGAGGC GAGGAAAATG CTGTTCCAAC AAATGGAGCC GCACTTGCCG
CGCATCAGGG AAGAAGCGCT TCGGATTCGA CGAAGACGGG AGGGCGCCGG GGTCACCGGC
GGTGTGCGTA GAAACGACCG GTGCCCGTGC GGCAGCGGCA TGAAATACAA ATCCTGCTGC
GGCCGCTGA
 
Protein sequence
MDDISLLIKP SSARCNLQCT YCFYSRVKDL YPEPHTLMTL ETAETLIRKT LELGLRENSF 
CWQGGEPTLM GIGFFQEVVR LQRRYATPGQ IIANSLQTNG ILLDDRWAEF LAGNSFLVGL
SLDGPRECHD HYRTTPSGRG TFDRVMSAAK YLEERGAEFN ILTLLSDRNV NRPDELYRFF
RRHGFSHLQF VPCVETDHST GRPFPYSVAV RDLGRFHCIL FDLWMKSGFY DVSIRTFEEI
LIAFIDGVGT SCVMRPQCSS YLVVEHNGDV YPCDFFVYPE WKLGNINEDS YSRIFANPLR
KKFAVIKSAL PEQCRSCRWL DLCRGDCPRF RPPAENGEAH PSVLCEARKM LFQQMEPHLP
RIREEALRIR RRREGAGVTG GVRRNDRCPC GSGMKYKSCC GR