Gene Sfum_2115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_2115 
Symbol 
ID4459574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp2587182 
End bp2588162 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content55% 
IMG OID639702882 
Productsigma-70 region 2 domain-containing protein 
Protein accessionYP_846233 
Protein GI116749546 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000423948 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000794083 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCCAAAAT CCAAGAAAAG CAAACGAGGC GGAGTCAGCC CGGATCGCGG CGATCCGCAC 
GAGGCGGGAG AAGAATCCCG TTCCGGCTCC ACTTTTCCGG CGGAGAGGCC GCAGGCCGTT
TCGTTTGACC CGTTCCGCAT CTACCTCGAC GAGATCAAGC GATACCCCCT TCTCAGCAGG
GAGGAGGAAA CGGATCTGGC CATCCGGTAC CGTGAGAAAG GCGATATTGA AGCCGGCTAC
AAGCTGATCA CCGCCAACCT GAGACTGGTT GTAAAGATAG CCATGGATTT TCAGAGGTAC
TGGATGCAGA ACCTCATGGA TTTAATCCAG GAGGGGAACG TCGGCCTGAT GCAGGCCGTC
AAAAAATTCG ACCCGTACCG CGGATATAAG TTTTCCTACT ATGCTTCGTT CTGGATCAAG
GCCTACATCA TCAAGTTCAT CATGGACAAC TGGAAGCTTG TCAAGATCGG CACCACCCAG
GCACAGCGGA AGCTGTTCTT CAACCTGCGC AAGGAGAAGG AGCGGTTGGA AGCGCAGGGA
ATCGAAGCTT CGCCGAAGCT CCTCAGTCAC CGGCTGGACG TCAAGGAATC CGAAATCATC
GAGATGGATC AACGGCTGAA TAGCTGGGAG ATCTCTTTGG ACAGCCCGCT CAAGGAAGAT
TCCGAAGACA CGCACAAGTC TTTCCTCCCC TCCGACGATC TTCCCGTCGA CGATCAGATC
GCAGACCGGG AAGCGAAGGC TATCCTGCAT GACAAGCTCC TGCTCTTTCG GGAACAGCTG
AAAGGCAAGG AGGCGGTCAT TTTCGACAAA CGGCTGCTGA CCGAAGAGCC CATGACCCTG
CAGGAGATAG GAGACCGTTT CGGAATCAGC CGCGAACGGG TCCGGCAGAT TGAAAGTCGT
TTGAAGAAGA AGCTCAAGGC ATACCTGGAA GAAGAAATCG AGGACATCGA TCTGCTTCAG
GAGAGCATGG TCGAAGTCTG A
 
Protein sequence
MPKSKKSKRG GVSPDRGDPH EAGEESRSGS TFPAERPQAV SFDPFRIYLD EIKRYPLLSR 
EEETDLAIRY REKGDIEAGY KLITANLRLV VKIAMDFQRY WMQNLMDLIQ EGNVGLMQAV
KKFDPYRGYK FSYYASFWIK AYIIKFIMDN WKLVKIGTTQ AQRKLFFNLR KEKERLEAQG
IEASPKLLSH RLDVKESEII EMDQRLNSWE ISLDSPLKED SEDTHKSFLP SDDLPVDDQI
ADREAKAILH DKLLLFREQL KGKEAVIFDK RLLTEEPMTL QEIGDRFGIS RERVRQIESR
LKKKLKAYLE EEIEDIDLLQ ESMVEV