Gene Sfum_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_2022 
Symbol 
ID4459653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp2473983 
End bp2475734 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content60% 
IMG OID639702788 
Productthaumatin, pathogenesis-related protein 
Protein accessionYP_846140 
Protein GI116749453 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.431794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACA TTCCCACTGC ACATCCGAAA ATGCCGTCCC ACAGTCTCAA GGCGGGTTTC 
CTGGCGGTCG TCCTGGCGGT CCTGCTCAGC GCCTCCCACG CCTTCGCCAC GCAGGTGCTT
TATGTCATCG GGGGAACCAC TCCGACCGTG GGGGAACTGA TCCTCAAGGA CCGCCTGGTA
AAAAGGGGAT TTGTGGTTGT GGTTCGAGAT ACCGGGAATG TCGCCGTCGC GAACGCCCTC
GACAAGGATT TGATCATCAT CTCCGAGTCG GTGGTGTCCA CAGGGCTGAA GACGCTTCTG
CGCGACGTGC ATACCCCCAT CATCTGCTTT GAACCCTACC AGTTCGACGA CCTGGGCATG
ACGGATTCCA AGCCGGGCGA ATCGTACGGA TCGGTCGAAG AGCAACAGAA CCTGGTGATC
GCCAGGCCGG GTCATCCCCT GGCCGCCTCG CTCGACGGCG TCGTGGAAGT CGCCGAACGG
GACATCAGGA TGGGATTCGG CACGCCGGGG ACGAACGCCA TCCCCATCGC CACCCTGATC
AACTCTCCGG ACATGTACGC GATTTTCGCC TACAAGGCGG GGGCGCGGAT GCCTGGGCTC
GCGGCGCCGG GCATCCGCAT CGGCTACTAT CTGCCGCGGA ACGCCCCGAA CGTGATGACG
GCCGAAGGGT GGAAACTGTT CGATGCCGCG GTGACTTGGG CCATGACCCC GCAATTGCCC
GCCATACCGC CCACTCCGCC CGGCAAGAGG ACGCTCGTTT TTTACAACAA CTGTTCCCGG
AAGATCTGGG TGGGCGCGTC CGGAACGGTT CCGGATTGTT CCGCGTGCAA CTGCGCCAAG
CAAAGCTGCC CTGAAAAACC CTGGAAGGGT ACAAGCACCG GTTTCGAGCT TCTTCCGACA
GCGAACAAAA ACACCAAGAT CATCCAGGTT CCAAACAACC TGCAATCGGC GGAATTCTGG
GCGCGCACCG GCTGCAGATG GGAGACGAAC CCCAGCTGGG AGGGGCCGCG GTTCATCTGC
GACACCGGCG ATTGCGGCAA CGCATCTGCG GGATTCAGGG TGCCGTGCAA CGGGGGCACC
AAGGCTCCCC CCGCGAACGC CCTTGAAGTC ACTTTCAATC CGTCCACGGG ATTCTTCAAC
GGAGTCGCCG TGATAAGAAC CGACACCTAC GATCTCACCA ACGTGGACGG TTACAGCCGC
GCGATCAAGG TGGAACCCCT GAAGGGAAGG TACAAGAAAG TCAGCCCCTA CAACGGACTT
CCAAAGTACA ACTGCGGGAA AGCGCAGTGC ACCTTCGACA TGAGGAAGTG CCCTCCGGAG
CTGAGCGCGG TCGACGGGAG CGGGAAGAAA GTCTGCTGGA GCCTCTGCAA GGCGGTGATG
GACCCGATCC AGCGTGAAAA GCATTCGGTG TTGAAGGCGA TCTACAACAA CCCCGACAAG
CGAGCCCTGG TGTGCTGCGC CTGCGACTGC GGGGCCGGGT GCGGGTGCGG AGACATTCAC
TGCAAGTACG GTTGTTCGCC CTACAACAAG AATCTGCCCA CCCCTCACGG AGGCATCTGC
CATTACGAGA AATGGCCCAA ACCGAATGCG ACGTGGTGCA AGAATGCCGG GCTGAGCGAA
GCGAACTGCA ACTATCAGGC GATTTACAAG AGCCAGTGCC CCGATGCATA CAGTTGGCAA
TTCAATGACA ACAGCAGCAC CTTTCAGTGC AAGGACGCCG ATTACCTGAT CACCTTCTGC
CCCAGCATGT AG
 
Protein sequence
MPNIPTAHPK MPSHSLKAGF LAVVLAVLLS ASHAFATQVL YVIGGTTPTV GELILKDRLV 
KRGFVVVVRD TGNVAVANAL DKDLIIISES VVSTGLKTLL RDVHTPIICF EPYQFDDLGM
TDSKPGESYG SVEEQQNLVI ARPGHPLAAS LDGVVEVAER DIRMGFGTPG TNAIPIATLI
NSPDMYAIFA YKAGARMPGL AAPGIRIGYY LPRNAPNVMT AEGWKLFDAA VTWAMTPQLP
AIPPTPPGKR TLVFYNNCSR KIWVGASGTV PDCSACNCAK QSCPEKPWKG TSTGFELLPT
ANKNTKIIQV PNNLQSAEFW ARTGCRWETN PSWEGPRFIC DTGDCGNASA GFRVPCNGGT
KAPPANALEV TFNPSTGFFN GVAVIRTDTY DLTNVDGYSR AIKVEPLKGR YKKVSPYNGL
PKYNCGKAQC TFDMRKCPPE LSAVDGSGKK VCWSLCKAVM DPIQREKHSV LKAIYNNPDK
RALVCCACDC GAGCGCGDIH CKYGCSPYNK NLPTPHGGIC HYEKWPKPNA TWCKNAGLSE
ANCNYQAIYK SQCPDAYSWQ FNDNSSTFQC KDADYLITFC PSM