Gene Sfum_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1501 
Symbol 
ID4460552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1858213 
End bp1860102 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content63% 
IMG OID639702268 
ProductTPR repeat-containing protein 
Protein accessionYP_845625 
Protein GI116748938 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.961258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATT ACCGGATCGG AGTTTTCACA TCGAGAATAT TGGGGACAGG CCGGAGGCGG 
GGCGCGGCGC GCCGCAATCC CCGCGCCTGC GGCGCTTTCC TTGCCGTTGC GGTGGCCCTC
TGCGCGTGGG GACTTCCGGG ATGCTCCTCA AAGAACGCGG GCGACGGCGT TTCCCTTCAG
AAACACTATG CCAAGCAGAT CGAGAAGCAG AATGCAAACA TCCCCGAGGA GATAAAAAAA
GCCGAAAAGC TGCCGGAAAT GAGCGAGGCC GACTACGAGA GGATGGGTGA CACGTTTGCC
AGGCAGGGGG AAACCACCAA GGCTTTCATC AGCTACGAGA AGGTCCTGCG CAAGGAGCCC
GATCTCACCC GGGTCCGTCT GAAGCGAGGC ATGATGTTTC TTGCCAGGGA CATGAACGAT
GAGGCGATCA GGGACTTTCA ACAGGTCCTG GCGAAGGAAC CCGGCAACGC CCAGGCCTAC
GAGGGGATCG GGCACGCGCT CTTCAAGCGC AGGCGTTACG ATGAGGCGGA AAAGAACTTC
CGGGAGGCGG TGAAGCTGAA CAACAGCCTG TGGATGTCCC ACAACTTCCT GGGCATCGTC
TACGACTACA AACAGCGTCC CGAGCTGGCC GTGCCGGAAT ACCAGGCGGC CATCGCCGTC
CGACCCGACG AGGGACTGCT CTACAACAAC CTGGGAATAT CCTACGCGAT GATGGGGGAC
TTCGAGAAAG CCGCCGCCGC ATTCCAGGAA GCGATGCAGA AACGGCCCAC GAACCCGAAA
ATCAGCAACA ACCTCGGCAT TATACTGTGC AGGCTCGGAC GGCAGTCCGA AGCTTTGGCC
GCGTTCGGAA AGGCGGGTGA TGAAGCCCAG GCCTACAACA ACCTGGGATG CGCCTACATG
ATGGACGGCG AATTCGAAAA AGCGGCGCGG GCGTTCGAGA GAGCCGTCAG TCTGAGAACC
ACCTATTATG CCCAGGCCAG CGACAACCTC AAGAAAGCCG AAATGGCCCT GCATTCCAGT
GGTGCCCTGG GCCGGCAGAC GCCGCCCCCC GGCTCTCCGG TGCCCGTGTC CGGCGCCCCG
GCGGTTTCCA GGACCCCGAT TCTTCAGGAA CTCAAACGCG AGCCGATCAA GGAGCAGAAC
CTCATCGAAC CGGTCACGGC CGTGAAAGGG ACGAAAGCCG CCGAACCGGC GGACTCCGCC
AAGGTGAAGA GCGTCTCCAC GCCCGCGCCG GCACCACGGG AGCAGGCTCC CCCCAATCCC
GTGACGATGG GGAAAGAAAC GAGCGCCGTC GAACCGACGA CCGGGAAGGA AACGAACCCC
GTCAAACCGG AGGATCCCGC GAAGGCGAAG AGTTCCCCCG AACCCGCGCC GGCACCGCGG
GAGCAGGCTC CTCCCAATCC CGTGACGATG GGGAAAGAAA CGAGCGCCGT CGAACCGACG
ACCGGGAAGG AAACGAACCC CGTCAAAACG GAGGATCCCG CGAAGGCGAA GAGTTCCCCC
GAACCTGCGC CGGCACCGCG GGAGCAGCCT CCTCCCAATC CCGTGACGAC GGGGAAAGAA
ACGAGCGCCG TCGAACCGAC GACCGGGAAG GAAACGAACC CCGTCAAAAC GGAGGATCCC
GCGAAGGCGA AGAGTTCCCC CGAACCCGCG CCGGCACCGC GGGAGCAGGC TCCTCCCAAT
CCCGTGACGA TGGGGAAAGA AACGAGCGCC GTCGAACCGG CTGCCGGGAC CGGCAAAGAG
AAGCCTGACG CTGAGGCGAA GGAAACCGCC GTCCGGGAAA AGAATGTCGC TGCACCTGCG
TCAGACCTGA AAAAGTCGGG GGGTGCGGAA TCGGGAGCGG TCGGGGCACC ATCCCCACCG
GCTCCCCCGG GGGTTGCAGC CAACCCGTGA
 
Protein sequence
MIDYRIGVFT SRILGTGRRR GAARRNPRAC GAFLAVAVAL CAWGLPGCSS KNAGDGVSLQ 
KHYAKQIEKQ NANIPEEIKK AEKLPEMSEA DYERMGDTFA RQGETTKAFI SYEKVLRKEP
DLTRVRLKRG MMFLARDMND EAIRDFQQVL AKEPGNAQAY EGIGHALFKR RRYDEAEKNF
REAVKLNNSL WMSHNFLGIV YDYKQRPELA VPEYQAAIAV RPDEGLLYNN LGISYAMMGD
FEKAAAAFQE AMQKRPTNPK ISNNLGIILC RLGRQSEALA AFGKAGDEAQ AYNNLGCAYM
MDGEFEKAAR AFERAVSLRT TYYAQASDNL KKAEMALHSS GALGRQTPPP GSPVPVSGAP
AVSRTPILQE LKREPIKEQN LIEPVTAVKG TKAAEPADSA KVKSVSTPAP APREQAPPNP
VTMGKETSAV EPTTGKETNP VKPEDPAKAK SSPEPAPAPR EQAPPNPVTM GKETSAVEPT
TGKETNPVKT EDPAKAKSSP EPAPAPREQP PPNPVTTGKE TSAVEPTTGK ETNPVKTEDP
AKAKSSPEPA PAPREQAPPN PVTMGKETSA VEPAAGTGKE KPDAEAKETA VREKNVAAPA
SDLKKSGGAE SGAVGAPSPP APPGVAANP