Gene Sfum_1447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1447 
Symbol 
ID4460585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1791700 
End bp1794786 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content54% 
IMG OID639702216 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_845573 
Protein GI116748886 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.722583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000177894 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGATGTGGT ACCGGACAGA ACGACCGATA TGGCTCCGCC TCGGGGTAGG AGTTCTTCTC 
CCCGTCATTG CTGCCGTCAT CCGGTGGCAC TTCCTCGAAA CTCTCGGGTT CCGCGCCACC
TTTACCACCT TTTACCCGGC CGTCGCAGTC GCTGCACTGT ATGGAAGATT CGGCGCCGGT
TTCATCGCGA CAGCTGTCTC GGCCCTCCTT GCAGACTATT TCTGGATTGA GCCGGTTGGT
CAGTTCGCCA TTGCTCCCTT CGCAGACCGG TTGAGCGTTG CGGTCTTTTT AGCCAATGGG
ACCTTGATCT CCTGTTTGGC CGAAGCGGCA TATCGTGCCC AGGCCCGGGC ATTCAAAACG
GAGCAGCAAT CGAGGCTCTC CGCCGAGCGC GAAAAGGCGG CTGTCGAGCT CCAGCAAAGT
GAGAGGAAGT ACCGCGAGTT GGTACAAAAT GCTAACAGTG CCATTGTCCG CTGGAACAGT
GGCGGCACGA TCGCCTTTTT CAATGAATAC GCTCAAACAT TTTTCGGCTA TCGTGAAGAT
GAAATCCTCG GCAAGCACGT GAGTGTCCTG TTGCCTGAAA CGGAGTCCGC CGGAACGGAT
TCGACTGCAC TTGCCCTTGA TGTGGTAAGA TATCCGGAAC GCTACGTTTC AATCATCAAC
GAGAATATCC GTCGGGATGG CAGCCGAGTA TGGATGGCAT GGACCCACAA AGCCATATTC
GATGAGAGCG GCGCCGTTTC GGAAATCTTG GCGGTCGGGA CTGATATCAC TGCGCGCAGA
CAGGCCGAAG AGAAACTGAA ACACGCCAGT GAGTTTGCCA CGGCCATTCT TGATACAGTT
GACGGACTGA TCATAGTACT CGATCGGAAT GGTCGAATCG TCCGATTCAA TGCTGCCTGT
GAACGCCTGA CCGGCTGGCG GGTCGACGAG GTGATTGGGC GATGCTTTTG GGAATTCCTG
GTTCCGGAGG AACAGCGTCG CGGTGTTCTC CAAGTATTTC AAGGTCTGAC CACGGGGAAA
TTCCCCAGCC GTCACGAGAA TGAATGGATT CTCCGGGATG GGAGTCGACG TTGGATAACC
TGGGCCAACT CTTGCCTGTT GGATGCGAAA GGTGACATCG TGCATGTTAT AGGTACCGGG
ATCGACATTA CCGAACGCCA ACGGGCTGAA CAAGCACTGC GCGAAAGCGA GAGACTGGCG
CGGGACCAGG AAGCTCAAAT AATGGCAATC TTGGATGCAG CCCCCGCCAT GATCTGGACT
GCTCATGACA GGGAGTGCCG GAGCATATCG GGGAACCGGG CGGCACGTGA GTTGTCACGA
GTGGACGAGA CCGCCAATAT GTCCAAAACC GGGCCGGAGC GAGAGCATCT TGCCCACTAC
AGCATCTACA AGGATGGCAG GGAGCTCGCT CCCGAAGAGC TGCCGATCCA GGTTGTTGCG
TCCACAGGTC GGGAGTTCAG AGACTATTCC CTCGAGTTTG TTTTGAAGGA TGGTCCACGA
TATTCGCTCC TGGGCAACGT CATCCCCCTC ATCGACTCGG AAGGCAGCCC GAGCGGCGCG
ATCGCCGCCT TCATGGACAT CACCGAACGA AAGCTGATGG AAGAGGAGCT TCGCAAATCG
AGGGATGAGC TCGAAATCAG AGTCCGCGAG CGAACGGAGG AACTGGCGGC GGAACGGCAG
CTGTTTTTCG ATGTCCTGGA GACACTCCCG GTCTATGTTT GTCTTCTGAC ACCCGAATAT
CATGTTCCTT TCGCTAATAA AGTCTTTCGT GATCGTTTTG GCGCATCCAA GGGCCTTCGC
TGCTTTGAAC ATCTGTTCGG GCGCAGCGAA CCTTGTGAAA TTTGTGATAC ATATGCGGTT
CTGAAGACTT CCACTCCCCA CCGTTGGGAG TGGGAAGGCC CCGACGGCCG CAGCTATTCC
GTCTTCGACT TCCCATTCAT CGACGCCAAG GGTTCCCAAT TGATCCTTGA GATGGGAATT
GACGTCACCG AGCGCAAGCA GGCCGAGGAA GCACTAAAAA CCGAGAGGCA GCGGCTCTTT
GACGTGCTTG AGACATTGCC GGTAATGATC TGCCTATTGA CGCCTGACCA CCATGTCGCT
TTTTCAAACC GCAGCTTTCG CGAGAAGTTT GGCGAATCGC AGGGACGGCA CTGCTACGAA
TACCGTTTCG GGCTTAACGA ACCCTGCAAT TTTTGCGAAT CGTACAGAGT GGCGGAAACG
GGCCGGTCTC ACCACTGGGA AGTGACCGTC CCCGACGGCA CCATAATCGA TGCTCACGAC
TTTCCGTTCA TCGACGTGGA CGGCTCTCCC ATGATCCTTG AAATGGACTT CGACATTACG
GAGTTCAGGG AAGCCGAAAG GGAACTCAAA GCGACAGTGG CCAAGCTCGA ACAGTTGAAT
CAGGAACTGC AGGAATTCGC TTTTATCGCT TCCCATGATC TTCAAGAACC CTTGAGAAAG
ATCCAGACTT TCGGAGAAAT GCTTGTTAGA AGAAAGAAGG ATTCCCTCGA CGCCGAAGGC
AAAAACCTCT TGGAGCGAAT CATAAAGGGC GCAAACCGTA TGGCTGAGCT TCTACATGCC
CTGCGCACAT ACTCCAGAAG CGGCACAAGC CAGTTGATTC ATAAGCCCGT GTCCCTTTCA
GAGGTTGCCA GGGGGGCCGC GAGTGCCTTG GAATATTGGA TTTCCAAGAC CAATGCGAAG
GTGGAGATTG GTGACCTGCC GACAGTCGAT GCCGATGAGT CGTTGCTACA TCAGCTTTTT
CAAAACCTGA TCTCGAATTC AATAAAGTAC CGAAAGGAAT CGGAACCCCC TGTTATAAAA
ATTTCCGGGA GTGTGATCGA TCACCAATGC CGGATAAGCG TACAGGACAA CGGCATAGGG
TTCGATCCGT GCTACTCCGA TCAAATCTTC AAGCCTTTCC AGAGGCTTCA CGGCAAGGAT
TCGCCATATA GCGGCACCGG AATGGGTCTT ACCATTTGCA GGAAAATTGT GGCCAGGCAT
AATGGCGAAA TCACCGTCGA GAGCGAACCC GGGCGAGGCT CCACTTTCAA AGTGACCCTT
CCTTTGAAGC AGCAGAAAAG GGCCTGA
 
Protein sequence
MMWYRTERPI WLRLGVGVLL PVIAAVIRWH FLETLGFRAT FTTFYPAVAV AALYGRFGAG 
FIATAVSALL ADYFWIEPVG QFAIAPFADR LSVAVFLANG TLISCLAEAA YRAQARAFKT
EQQSRLSAER EKAAVELQQS ERKYRELVQN ANSAIVRWNS GGTIAFFNEY AQTFFGYRED
EILGKHVSVL LPETESAGTD STALALDVVR YPERYVSIIN ENIRRDGSRV WMAWTHKAIF
DESGAVSEIL AVGTDITARR QAEEKLKHAS EFATAILDTV DGLIIVLDRN GRIVRFNAAC
ERLTGWRVDE VIGRCFWEFL VPEEQRRGVL QVFQGLTTGK FPSRHENEWI LRDGSRRWIT
WANSCLLDAK GDIVHVIGTG IDITERQRAE QALRESERLA RDQEAQIMAI LDAAPAMIWT
AHDRECRSIS GNRAARELSR VDETANMSKT GPEREHLAHY SIYKDGRELA PEELPIQVVA
STGREFRDYS LEFVLKDGPR YSLLGNVIPL IDSEGSPSGA IAAFMDITER KLMEEELRKS
RDELEIRVRE RTEELAAERQ LFFDVLETLP VYVCLLTPEY HVPFANKVFR DRFGASKGLR
CFEHLFGRSE PCEICDTYAV LKTSTPHRWE WEGPDGRSYS VFDFPFIDAK GSQLILEMGI
DVTERKQAEE ALKTERQRLF DVLETLPVMI CLLTPDHHVA FSNRSFREKF GESQGRHCYE
YRFGLNEPCN FCESYRVAET GRSHHWEVTV PDGTIIDAHD FPFIDVDGSP MILEMDFDIT
EFREAERELK ATVAKLEQLN QELQEFAFIA SHDLQEPLRK IQTFGEMLVR RKKDSLDAEG
KNLLERIIKG ANRMAELLHA LRTYSRSGTS QLIHKPVSLS EVARGAASAL EYWISKTNAK
VEIGDLPTVD ADESLLHQLF QNLISNSIKY RKESEPPVIK ISGSVIDHQC RISVQDNGIG
FDPCYSDQIF KPFQRLHGKD SPYSGTGMGL TICRKIVARH NGEITVESEP GRGSTFKVTL
PLKQQKRA