Gene Sfum_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1107 
Symbol 
ID4461077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1376172 
End bp1377563 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content60% 
IMG OID639701872 
Producthypothetical protein 
Protein accessionYP_845235 
Protein GI116748548 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.655158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAGGA ACCCCTTGAA GACCCGCTCC GGCCGGCCCG GACTTCCGAG ATTGCTCCTG 
ATCGTTGCCT CGCTTCTCCT GCTCTGCGTC TTGAGCCTCC CGGCCGTTGC ATCGGCGGAA
TGGGTGCGGC ATTACGGCGG TGAGTATCGG GACGAAGCCA AGGCCATCGC CGTGGACTCC
GCCGGCAGCG TCTACGTCAC GGGCACATCG GCGAATTACG CAGAGGAGAA TTTCACGTCC
CTCGATTATG CCACCGTCAA GTACGACACG AATGGAAACC GAAAGTGGGT GCGGCGATAT
GACGGCCCGA AGCACTACAT TGATCAAGCC GCGGCGATTG CCGTGGACCG CGACGGGAAC
GTCTACGTCA CGGGCACATC GATGGGACTT CGTTCAGGCT ACGATTATGC GACCATCAAG
TACGACACGA ACGGAAAGCC GCAATGGGTG AGGCGATACG ACGGCCCCGC GGGCATGAAC
GACACGGCCA CGGCTCTGGC CGTGGATGCC GCCGGCAACA TCTATGTGAC GGGCAAGTCG
GAAAATTACA CCTACCTCGA CTATGCGACC GTCAAGTACG ACGCGGATGG AAATCAGAGG
TGGGTGGCTC GCTATAACGG CCCGAAGAGC TCCGATGACG AGGCGACGGC CATCGCCGTA
GACCGCAACG GGAACGTCTA CGTGACGGGC GCGTCGGTGG GAATGCAATC AGGCTACGAT
TATGCCACCG TCAAGTACGA CGCGAAGGGA AACCGAAAGT GGGTGAGGCG ATACAACGGC
CCCGGCAACA AGGATGACAA GGCCGCGGCC ATTGCAGTGG ACCGGAGCGG AAACGTCCAT
GTCACGGGAG GTGCGGTTTG GCCCGATAAT TATGGCGGCC TGAATTATGC CACCATCAAG
TACGACACGA ATGGAAACCG AAAGTGGGTC AGACGCTACA ACGGCCCATG GAATGAGACG
GACAGAGCCA AGGCCATCGC GGTGGACGCC GCCGGCAACG TCTATGTGAC GGGTGAAGCG
GGGACAAATA ACTTATTATT CTTTGATTAC GTAACCATCA AGTACGACCC CGACGGCAAC
CGGCAATGGA TGAGACGCCT CGTGGGGCCG GACGGATGGA GCGACAGCCC CTCCGGCATG
GCCGTGGACC CCGCCGGCAA CGTCTGCGTG ACCGGCCAGG TGGAAGGCGA TCTGGGGCAC
TTGCATTATG GAACCGTCAA GTACGATACG AATGGGGTCC GGCAATGGGT GAGGTTCTAC
GCGGAGATCT CCGGCGCGGC TGCAGCCGTG GCCGTGGACG GCGGCGGCAA TGTCTACGTC
ACGGGCCAAT CATACTCGCG CGACAAATAC CAGGACTATG CCACCATCAA GTACAACGCG
AACGGGGACT GA
 
Protein sequence
MRRNPLKTRS GRPGLPRLLL IVASLLLLCV LSLPAVASAE WVRHYGGEYR DEAKAIAVDS 
AGSVYVTGTS ANYAEENFTS LDYATVKYDT NGNRKWVRRY DGPKHYIDQA AAIAVDRDGN
VYVTGTSMGL RSGYDYATIK YDTNGKPQWV RRYDGPAGMN DTATALAVDA AGNIYVTGKS
ENYTYLDYAT VKYDADGNQR WVARYNGPKS SDDEATAIAV DRNGNVYVTG ASVGMQSGYD
YATVKYDAKG NRKWVRRYNG PGNKDDKAAA IAVDRSGNVH VTGGAVWPDN YGGLNYATIK
YDTNGNRKWV RRYNGPWNET DRAKAIAVDA AGNVYVTGEA GTNNLLFFDY VTIKYDPDGN
RQWMRRLVGP DGWSDSPSGM AVDPAGNVCV TGQVEGDLGH LHYGTVKYDT NGVRQWVRFY
AEISGAAAAV AVDGGGNVYV TGQSYSRDKY QDYATIKYNA NGD