Gene Sfum_2005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_2005 
Symbol 
ID4459671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp2454394 
End bp2455821 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content64% 
IMG OID639702771 
Productprotease Do 
Protein accessionYP_846123 
Protein GI116749436 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGTCG CGGCCTTCTT TGTGGGCGGT TTGGCGGCTT CATCGGGATT GAATCCCGGT 
AGACTTCTCG CGGTGCCGGA TGCGGCCCAG GCAGCGCCCG CTGAAGGGGA TCAGCCAGGA
GTAACCCCAA CCAGCCCGTT CGCCACGCTG GCGGCAAAAC TCACGCCCGT GGTGGTCAAC
GTCAGGGTGA CCAAAATCGA ACGGGCGGAG TTCCCCGATT TTGAAGGACC GGAGCAGCCG
TTCGGAGACT TTTTCAGGCA TTTTTTCGGC GACCGGCGGG GATTTCCGAA TGTCCCGGCG
CAGGGCGCAG GTTCGGGAGT GATCATCCGC GGCGACGGGT ATGTCCTGAC CAACAATCAC
GTGGTTGAAG GCGCCAGGGA AGTGACCGTG ACGCTTTCCG ACAAGCAGGA ACACAAAGCG
CGAATCGTCG GGCGGGATGC CAAGACCGAC CTCGCGCTTC TCAAAATTGA AGCGGGCAAA
AGCCTGCCTG CCGCCAGCCT GGGCGATTCC GACCAACTCA AGGTCGGGGA TTGGGTGATG
GCCATCGGCA ACCCGTTCGG TCTCAGTGAA ACGGTCACTT CCGGGATCGT CAGCGCCAAA
GGCCGCGTCA TTGGGGCGGG CCCCTATGAC GACTTCATCC AGACCGATGC CTCGATCAAC
CCGGGCAATT CGGGAGGACC GCTTTTCAAT ATGAAGGGCG AAGTCGTGGG GATCAACACC
GCCATCATCC CGAACGCCCA GGGAATCGGA TTCGCCATTC CCGTCAACAC GGCCAAGCCG
CTGATTCCTC AGCTGGAAAC CAAAGGCGAA GTGACTCGGG GGTACCTGGG AGTCAGCATC
CAGTCGATCA CGCCCGATCT TGCCTCGGCA ATGGGGCTGG GTGACGGGAA GGGAGCGCTG
GTGGCGGACG TCGTTGAAGG CGGTCCCGCC GACAGGGCCG GGATCCGGCG CGGGGACGTG
ATCCTCGCCT TTGGAGGCAA GGACGTCAAA GACAGTCACG ATCTCTCGTT CATGGTCGCC
GCGGCCCCGG TGGGCAGGGA ATCCGCGGTG ACGATCATGC GGGAGGGCGT CGAGCGGCGG
CTGGACGTCA AGATCGGAAA ACAGGAATCC GAGGAAGGGG CGAAGGAGGA ATTTTCGAAA
CAGGCTCACG GCAAATGGGG CCTCCAGCTC CGGGATGTGC CTCCCCGGGT TGCGGAAGAG
CTCGGCCTCG AGTCGGAGCG CGGGGCACTC GTGGCCGGCG TTCTCCCGGG AAGCCCGGCG
GATCGGGCCG CCCTGCGGCA GGGGGATGTC ATCCTGGAGG TCAATCGTCA GCCCGTAACA
TCGGCGAGCG AGCTCAAAGA AAGGATTGCC GGGGCGGGCG AGCGGGGTGC CCTGGTTCTC
CTCGTGCAGA GCAGTCGGGG GACGAGGTAC GTCGTGCTGA AGGGCTGA
 
Protein sequence
MVVAAFFVGG LAASSGLNPG RLLAVPDAAQ AAPAEGDQPG VTPTSPFATL AAKLTPVVVN 
VRVTKIERAE FPDFEGPEQP FGDFFRHFFG DRRGFPNVPA QGAGSGVIIR GDGYVLTNNH
VVEGAREVTV TLSDKQEHKA RIVGRDAKTD LALLKIEAGK SLPAASLGDS DQLKVGDWVM
AIGNPFGLSE TVTSGIVSAK GRVIGAGPYD DFIQTDASIN PGNSGGPLFN MKGEVVGINT
AIIPNAQGIG FAIPVNTAKP LIPQLETKGE VTRGYLGVSI QSITPDLASA MGLGDGKGAL
VADVVEGGPA DRAGIRRGDV ILAFGGKDVK DSHDLSFMVA AAPVGRESAV TIMREGVERR
LDVKIGKQES EEGAKEEFSK QAHGKWGLQL RDVPPRVAEE LGLESERGAL VAGVLPGSPA
DRAALRQGDV ILEVNRQPVT SASELKERIA GAGERGALVL LVQSSRGTRY VVLKG