Gene Sfum_3414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_3414 
Symbol 
ID4458285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp4179355 
End bp4180455 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content61% 
IMG OID639704186 
Productpeptidase M42 family protein 
Protein accessionYP_847522 
Protein GI116750835 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.774179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0330149 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAACG ATCCTGTAAC AGCCGCCGCA ATTCACTTGC TGAAGAGCCT CGCCGAAGCT 
CCCGGCGCGC CGGGTCATGA AGACGCCGTG CGCCGTATCT TCCGGACGGA AGTGGGGGGG
GACACCACCA CGGACAAAAC CGGGAGCATC ATTTACACGA AAAAAGGAAC TTCCGAAACC
CCTCGGATCA TGCTTGCCGC GCACATGGAT GAAGTCGGGT TCGTGGTGCA GAGCGTCACG
CGGGAAGGAC TGATCCGGTT CCTCCCGTTG GGCGGCTGGT GGCCGCATAC GATCCTGGCC
AAACGGGTGA GGATAATCAC CCGCAACAAC ACGGAGATCA TCGGCGTCGT GGGGGCCAAG
CCACCCCATT TCCTGACCGA CGCCGAACGT GAAAAAGTGA TGAAAATCGA AGATATGTTC
ATCGATGTGG GAGCGCGCGA CGCGGTGGAT GTCCGGGATC GTTTCGGGAT CGAGGTGGGA
GACAGCATCG TTCCCGACAG CAGCTTCACC GTGCTGCACG ATCCCGACGT CTTCCTGTGC
AAGGCGTTCG ACAACCGGGT GGGGATGGCC GTGGTCATAC ATGCCGCCGC CATGCTGATG
TCGATGACGC ACCACAACAC GGTTTGCGCG GTGGGAACGG TCCAGGAGGA AGTCGGGGTG
AGGGGCGCCC AGACCGCGGC GCACGCCGTC AATCCCGACG CGGCCATCAT CCTGGAAGGC
ACGCCGGCGG ACGATCTGCC AGGAACGACG GAAGAGGAGC GACAGGGAAA ACTGCGGGGA
GGCGTCCAGA TCAGGCTCAT GGACCCGTCG GCGATCATGA ATCGCAAGTT CAGCCGATAT
GCCGTCGAAC TGGCCCGAGA ACACGGAATC GCGCACCAGG TCGCAGTCCG CCGAAGCGGA
TCGACCGACG CCCGTGCCGT TCATCTGACG CGGGAAGGCG TGCCGACCAT CGTTCTCGGC
GTACCCTCCC GTTACATCCA TACCCACAAC GGGCTCGTTC ACATGGAGGA CTACCTGAGC
GCACTCGACC TGGTCATGAA ATTGCTCGAA CGGCTCGACG AGGATGCCGT TCGGTCGTTT
GTTACCTACG ACGACAAGTA G
 
Protein sequence
MLNDPVTAAA IHLLKSLAEA PGAPGHEDAV RRIFRTEVGG DTTTDKTGSI IYTKKGTSET 
PRIMLAAHMD EVGFVVQSVT REGLIRFLPL GGWWPHTILA KRVRIITRNN TEIIGVVGAK
PPHFLTDAER EKVMKIEDMF IDVGARDAVD VRDRFGIEVG DSIVPDSSFT VLHDPDVFLC
KAFDNRVGMA VVIHAAAMLM SMTHHNTVCA VGTVQEEVGV RGAQTAAHAV NPDAAIILEG
TPADDLPGTT EEERQGKLRG GVQIRLMDPS AIMNRKFSRY AVELAREHGI AHQVAVRRSG
STDARAVHLT REGVPTIVLG VPSRYIHTHN GLVHMEDYLS ALDLVMKLLE RLDEDAVRSF
VTYDDK