Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_1411 |
Symbol | |
ID | 4461347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | + |
Start bp | 1751446 |
End bp | 1753125 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639702179 |
Product | extracellular solute-binding protein |
Protein accession | YP_845537 |
Protein GI | 116748850 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGAAAT CGAAACGACG GGAAATCCAC GCATCGCGTG ACGGCAGGAC GATCGCGGGT CGGATGGCGA TCTTTCTGAG TCTGCTGCTC CTTTTCGCCT GGCAGGCCGT TTCCGGGGAT GTCGCGGCAG CAGCCGATCC GGTCCCGGTC GACGGCGACT GGGTGATCGG CAACCTCGGC AGCGAGCCGC CGACCCTCAA TCCGATCACG TCCACGGACT TGTCCGCATC GACGATTCAG GAGTACATCT ATGAAACCCT GATACGTCGA AACCCCGAGA CAATGAAGCT CGAACCGCTG CTGGCCGAGA GTTGGCAAGT CGCCGAAGAC CACCTCACCT ACACGTTTCA TTTGAGGAAA AATATCGCCT GGGCGGACGG GCAACCGTTC ACCGCAAGAG ACATCCGCTA TTCCTTCGAC CGCATCCGCG ACCCTGCCGT GGACGCGGCC CACCTCAGGA ACTACTACCA GGACATAGAG CGCCTGGAAG TCCTGGACGA CCACACCGTG CGATTCCACT ATCGCATTCC CTATTTCCTG GCGCTCCAAT TCTGCGGCGG CATTCCGATC GTTCCGGTCC ACCTGTTCAA GCCGGGAGAG GACTTCAACA AGCACCCCAT CGCCCGGAGT CCGGTCGGCA CGGGCCCCTA CCGGCTCCTG CACTGGCGAA CCGGGGAGGA AATCGTGCTG GTGCGCAACG AAGCCTATTG GGGAGTGAGA CCTCATCTCG ACAGACTCGT TTTCAAAATC ATCCCTGATC CGACCGTGGC TTTGCACGTG CTCAAACAGG GCGGTTTGGA CGTGTCCGGT CTGCTGCCCA TCCAGTGGGT CAAACAGACT CAGAGCGAAC GTTTCCAGGA GATGTTCCGG AAACTCAAAT ACTATACGCC GAGGTACAAC TATGTCGGTT GGAACATGAA ACGGCCCCTG TTCGCCGATC GCAGGGTGCG GGTGGCCATG ACCATGCTGA TCGACCGGGA GACGATCCTC AGGAGAGTCC TCTTTGGATT CGGCACCGTG GTGTCGGGCA CGTTTTACGT CAACAGCCCG GAGTACAACA GGAACATCAA GCCCTGGCCG TATGATCCCG CGGCAGCGCT GGCACTGCTC GAGCAGGCCG GATGGAAGCG GCGCGACGGC AACGGGCCTT TGGAAAAGGA TGGGACGCCG TTTCAATTCG AGTTCATTCT TCCGGCCGGC TCGAAAATCG GCGAGCAGAT CGCCACCATG TTCCAGGAAA ACCTGAAGCA GGTCGGAATC CGGATGGAGA TTCGAAAGCT CGAATGGGCG GTATTCATCC AGAAGATCGA CAGCAGGAAC TTCGATGCCT GCACGCTTGG GTGGAGCCTG GGCTGGGAGT CGGACCCCTA CCAGATCTGG CACTCTTCCA TGGCGGAAAA GGGATCGAAT TTCGTCGGAT TCAGAAATGA AGAGGCAGAC CGGATCATCG AGGCGGCCCG CCAGGAGTTC GACCCCGAGA AACGCTACCG ACTCTATCAT CGGTTCGGGG AGATTCTCCA CGAGGAACAG CCCTACACTT TTCTGTTCAC GACGGAGACC CTTGCAGCCG TGGCCCGGCG CTTCGAAAAC GTGAAGGTCT ATGCAATGGG GCCGGAACGC AAGGAATGGT GGGTGCCCAA GGCGCTTCAG AAGTACCCGA TCGTCAGGAA ACCGGAATAG
|
Protein sequence | MKKSKRREIH ASRDGRTIAG RMAIFLSLLL LFAWQAVSGD VAAAADPVPV DGDWVIGNLG SEPPTLNPIT STDLSASTIQ EYIYETLIRR NPETMKLEPL LAESWQVAED HLTYTFHLRK NIAWADGQPF TARDIRYSFD RIRDPAVDAA HLRNYYQDIE RLEVLDDHTV RFHYRIPYFL ALQFCGGIPI VPVHLFKPGE DFNKHPIARS PVGTGPYRLL HWRTGEEIVL VRNEAYWGVR PHLDRLVFKI IPDPTVALHV LKQGGLDVSG LLPIQWVKQT QSERFQEMFR KLKYYTPRYN YVGWNMKRPL FADRRVRVAM TMLIDRETIL RRVLFGFGTV VSGTFYVNSP EYNRNIKPWP YDPAAALALL EQAGWKRRDG NGPLEKDGTP FQFEFILPAG SKIGEQIATM FQENLKQVGI RMEIRKLEWA VFIQKIDSRN FDACTLGWSL GWESDPYQIW HSSMAEKGSN FVGFRNEEAD RIIEAARQEF DPEKRYRLYH RFGEILHEEQ PYTFLFTTET LAAVARRFEN VKVYAMGPER KEWWVPKALQ KYPIVRKPE
|
| |