Gene Sfum_1411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1411 
Symbol 
ID4461347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1751446 
End bp1753125 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content59% 
IMG OID639702179 
Productextracellular solute-binding protein 
Protein accessionYP_845537 
Protein GI116748850 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAAAT CGAAACGACG GGAAATCCAC GCATCGCGTG ACGGCAGGAC GATCGCGGGT 
CGGATGGCGA TCTTTCTGAG TCTGCTGCTC CTTTTCGCCT GGCAGGCCGT TTCCGGGGAT
GTCGCGGCAG CAGCCGATCC GGTCCCGGTC GACGGCGACT GGGTGATCGG CAACCTCGGC
AGCGAGCCGC CGACCCTCAA TCCGATCACG TCCACGGACT TGTCCGCATC GACGATTCAG
GAGTACATCT ATGAAACCCT GATACGTCGA AACCCCGAGA CAATGAAGCT CGAACCGCTG
CTGGCCGAGA GTTGGCAAGT CGCCGAAGAC CACCTCACCT ACACGTTTCA TTTGAGGAAA
AATATCGCCT GGGCGGACGG GCAACCGTTC ACCGCAAGAG ACATCCGCTA TTCCTTCGAC
CGCATCCGCG ACCCTGCCGT GGACGCGGCC CACCTCAGGA ACTACTACCA GGACATAGAG
CGCCTGGAAG TCCTGGACGA CCACACCGTG CGATTCCACT ATCGCATTCC CTATTTCCTG
GCGCTCCAAT TCTGCGGCGG CATTCCGATC GTTCCGGTCC ACCTGTTCAA GCCGGGAGAG
GACTTCAACA AGCACCCCAT CGCCCGGAGT CCGGTCGGCA CGGGCCCCTA CCGGCTCCTG
CACTGGCGAA CCGGGGAGGA AATCGTGCTG GTGCGCAACG AAGCCTATTG GGGAGTGAGA
CCTCATCTCG ACAGACTCGT TTTCAAAATC ATCCCTGATC CGACCGTGGC TTTGCACGTG
CTCAAACAGG GCGGTTTGGA CGTGTCCGGT CTGCTGCCCA TCCAGTGGGT CAAACAGACT
CAGAGCGAAC GTTTCCAGGA GATGTTCCGG AAACTCAAAT ACTATACGCC GAGGTACAAC
TATGTCGGTT GGAACATGAA ACGGCCCCTG TTCGCCGATC GCAGGGTGCG GGTGGCCATG
ACCATGCTGA TCGACCGGGA GACGATCCTC AGGAGAGTCC TCTTTGGATT CGGCACCGTG
GTGTCGGGCA CGTTTTACGT CAACAGCCCG GAGTACAACA GGAACATCAA GCCCTGGCCG
TATGATCCCG CGGCAGCGCT GGCACTGCTC GAGCAGGCCG GATGGAAGCG GCGCGACGGC
AACGGGCCTT TGGAAAAGGA TGGGACGCCG TTTCAATTCG AGTTCATTCT TCCGGCCGGC
TCGAAAATCG GCGAGCAGAT CGCCACCATG TTCCAGGAAA ACCTGAAGCA GGTCGGAATC
CGGATGGAGA TTCGAAAGCT CGAATGGGCG GTATTCATCC AGAAGATCGA CAGCAGGAAC
TTCGATGCCT GCACGCTTGG GTGGAGCCTG GGCTGGGAGT CGGACCCCTA CCAGATCTGG
CACTCTTCCA TGGCGGAAAA GGGATCGAAT TTCGTCGGAT TCAGAAATGA AGAGGCAGAC
CGGATCATCG AGGCGGCCCG CCAGGAGTTC GACCCCGAGA AACGCTACCG ACTCTATCAT
CGGTTCGGGG AGATTCTCCA CGAGGAACAG CCCTACACTT TTCTGTTCAC GACGGAGACC
CTTGCAGCCG TGGCCCGGCG CTTCGAAAAC GTGAAGGTCT ATGCAATGGG GCCGGAACGC
AAGGAATGGT GGGTGCCCAA GGCGCTTCAG AAGTACCCGA TCGTCAGGAA ACCGGAATAG
 
Protein sequence
MKKSKRREIH ASRDGRTIAG RMAIFLSLLL LFAWQAVSGD VAAAADPVPV DGDWVIGNLG 
SEPPTLNPIT STDLSASTIQ EYIYETLIRR NPETMKLEPL LAESWQVAED HLTYTFHLRK
NIAWADGQPF TARDIRYSFD RIRDPAVDAA HLRNYYQDIE RLEVLDDHTV RFHYRIPYFL
ALQFCGGIPI VPVHLFKPGE DFNKHPIARS PVGTGPYRLL HWRTGEEIVL VRNEAYWGVR
PHLDRLVFKI IPDPTVALHV LKQGGLDVSG LLPIQWVKQT QSERFQEMFR KLKYYTPRYN
YVGWNMKRPL FADRRVRVAM TMLIDRETIL RRVLFGFGTV VSGTFYVNSP EYNRNIKPWP
YDPAAALALL EQAGWKRRDG NGPLEKDGTP FQFEFILPAG SKIGEQIATM FQENLKQVGI
RMEIRKLEWA VFIQKIDSRN FDACTLGWSL GWESDPYQIW HSSMAEKGSN FVGFRNEEAD
RIIEAARQEF DPEKRYRLYH RFGEILHEEQ PYTFLFTTET LAAVARRFEN VKVYAMGPER
KEWWVPKALQ KYPIVRKPE