Gene Sfum_3707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3707 
Symbol 
ID4457994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4522794 
End bp4523918 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content62% 
IMG OID639704480 
Productmolybdate metabolism transcriptional regulator 
Protein accessionYP_847812 
Protein GI116751125 
COG category[K] Transcription
[P] Inorganic ion transport and metabolism 
COG ID[COG1476] Predicted transcriptional regulators
[COG1910] Periplasmic molybdate-binding protein/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.487138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCATGG GATCGGTCAA AGAGCGGGTG GTCTGTAACC TCAAGTCGGC GCGAAAAGCC 
AGGGGGTTGT CCCAGAGCGA ATTGGCCGGT AGAGTGGGGG TCAAGCGGCA GGCGATTTAT
GACATGGAAA GCGGGCGGTA TCTGCCCAAC ACCGCCCTGG CGCTTTACAT CGCAAGAGAG
CTCGGCTGCA GGGTCGAAGA CCTGTTCGTC CTGGAGGAAT CGGAAGAGGA ACAACCCGTC
ACTTTGGTCG AGAAAGCGGG TGCGGCTAAC CCCAGGGTTG CCGTGGCCAG TGTGCGCGAG
CGACTCGTTG CTTACCCCGT GGACGGAAAA TGGCTGTTGA GCGACGGATT CCACTCCGCG
GACGGCCTGC TCCTGGCCGA CGGCTGCAAT GTGCGTCTCT TCCAGGGGCG CAGGGCGCTT
GAGAAGAAGA TATTCCTTTT CGGCTGCGAT CCCGCATTCG CCATCTTGAG CGCGCATGCC
TCCAGATGGA TGGCGGATGC ATTCGTGCAG TGCCGTTTTG CATCCAGTTA CCTGGCACTG
GCGAGGCTTT CCGCCGGGCA CGCCCACATC GCGGGCACTC ACATGCACAA CCGGGAGTCG
GTCGAATCCA ATGTGGTGCT GGCCAAGACG GCGCTGGCGG GCACAGGGGC CATGGTGGTC
GCTTTCTCGA TTTTCGAGGA AGGTCTGATG GTTGCCGCGG GAAATCCGCT CGACATCCGG
GATGTCGGCG ACCTGGCTCG AAAGCGGATT CGATTCGTCA ACCGTGAGCC CGGAGCCGCT
TTGCGTTCTC TTCTTGACGA GCGCCTGATG CAGGTGGGGC TCTCGGGCGA GGCCGTCAAT
GGTTACGACC GGCAGGGATC GAGCCACAAC CAGTGCGCCC AGATGGTCGC TCTCGACATG
GCCGATGCCG CCCTCGGGCT GCGAGCCGTC GCGGCCGCAC ACGGACTGGG CTTCGTTCCC
ATCGAGTCCG TTCGGTGCGA CCTGGTCATT CCCCACGATT TCCTGGATCT GCCGGCCGTC
AAGATTCTGC TCGAAGTGAT GCAGACGCGC GCCTTGCGGG AGGAATTGAG CGCCCTTCCC
GGTTACGGGT CTTCCTGCAC CGGCAAAGTC ATCGGACAGG TATAA
 
Protein sequence
MSMGSVKERV VCNLKSARKA RGLSQSELAG RVGVKRQAIY DMESGRYLPN TALALYIARE 
LGCRVEDLFV LEESEEEQPV TLVEKAGAAN PRVAVASVRE RLVAYPVDGK WLLSDGFHSA
DGLLLADGCN VRLFQGRRAL EKKIFLFGCD PAFAILSAHA SRWMADAFVQ CRFASSYLAL
ARLSAGHAHI AGTHMHNRES VESNVVLAKT ALAGTGAMVV AFSIFEEGLM VAAGNPLDIR
DVGDLARKRI RFVNREPGAA LRSLLDERLM QVGLSGEAVN GYDRQGSSHN QCAQMVALDM
ADAALGLRAV AAAHGLGFVP IESVRCDLVI PHDFLDLPAV KILLEVMQTR ALREELSALP
GYGSSCTGKV IGQV