Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sfum_2156 |
Symbol | |
ID | 4459545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Syntrophobacter fumaroxidans MPOB |
Kingdom | Bacteria |
Replicon accession | NC_008554 |
Strand | - |
Start bp | 2636525 |
End bp | 2637988 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639702922 |
Product | sulfatase |
Protein accession | YP_846273 |
Protein GI | 116749586 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACCA TCTCGGAATC GATGAAAAGC CGGCGGCACT TTCTGAAGCT TCTTTCGGCT GGTGCCGCTT CGCTCAGTCT CTGGCCGGGC ATCGGCCCTC ATATCACTCA TGCCGACGCT CAGGCCGTGC CGCAGGGCAA GCCGAACGTG CTCATGTTCG TCCTCGACGA CATGAACGAC TGGATCGGAT GCCTCGGCGG CCACCCGGAC GTCAAGACGC CGAACATCGA CCGGCTGGCT CAACGAGGCG TGCTGTTCAG GAACGCCCAA TGCTCGTCCC CGATCTGCAG TCCCTCCCGC GCCAGCTTCT TTACCGGAAT CCGACCCTCC ACGTCCGGCA TTTACGGGAA CTCCCAGGCT TTTCGCAAGA TCATGCCGAA TGCGGTGACC CTGCCTCAGC ATTTCATCGC GCATGGATAC CGCTCAATGG GATGCGGGAA GCTTTTCCAT TTCATCAAAA CCGATTCGCG GTCCTGGCAC GAGTTCTTTC CGTCCAGGAG CATGGAGCGA CCGTTCGATC CCGTTCCGCC GAACGCTCCC CTGAGCGGTC TGCCGGATGT CAACCAATTC GATTGGGGTC CCATCGACAT TGTCGACGAG GAGTTGGGCG ACGGAAAGTT GGCGCGCTGG GCGGCCGATG CCCTCAGGAG ACGATATGAC CGGCCCTTCT TTCTGGGCGT CGGCCTCCTC AGACCGCATG TCCCCCTGTA CGTTCCGCGG AAGTACTTCG ACATGTATCC GCCGGAATCG ATCACCCTGC CGACGGTGAA AGCAAACGAC CTCGACGATG TGCCGCCCAC CGGCGTGTCC TGGGCGAAGC CCGAACGGCA TCAGTTGATC GTGGAGCACG ATCAATGGCG AAAAGCCGTG GCCGGGTACC TGGCCAGCGT CAGCTTCGTG GACGCGCAGG TTGGGTGGGT GCTCGACGCC CTGGACGAAA GCCCCTACGT GAACAACACG GTCGTGGTGC TCTGGGGAGA CAACGGGTGG CATCTGGGGG AAAAATTGCA CTGGACAAAG CTCACCCTGT GGGAGGAATC GTGCCGGGTG CCCCTCATCA TCGCATTGCC GGGCCTCACC CCTCCGGGAA GAAAGTGTGC AAAACCCGTG AGCACCATGG ACGTTTACCC CACCCTCAAC GAGCTTTGCG ACCTGACACC CAAACCGGAA CTGGAGTGCC GCAGCATCCT TGAATTGCTG CGGAACCCGC AGTCGGACAC ATGGGACGGA CCCCCTGCTC TCAGCACATA TATGCCGGGC AATCACTCTC TTCGCGATGA GCGGTATCGC TACATCCGTT ACAACGACGG GACGGAAGAG CTTTACGATC TCAAGGCCGA CCCGATGGAA TGGAACAACC TTCTGGCGGG CGGGGGGACA GGCCCTGCCG GTGTGAGAGA TCGCCTGTCC GCCTTCCTGC CGAAGTTCAA CGCGCCCCAG GCGCCCCTGG TGCATCGTTT CTGA
|
Protein sequence | METISESMKS RRHFLKLLSA GAASLSLWPG IGPHITHADA QAVPQGKPNV LMFVLDDMND WIGCLGGHPD VKTPNIDRLA QRGVLFRNAQ CSSPICSPSR ASFFTGIRPS TSGIYGNSQA FRKIMPNAVT LPQHFIAHGY RSMGCGKLFH FIKTDSRSWH EFFPSRSMER PFDPVPPNAP LSGLPDVNQF DWGPIDIVDE ELGDGKLARW AADALRRRYD RPFFLGVGLL RPHVPLYVPR KYFDMYPPES ITLPTVKAND LDDVPPTGVS WAKPERHQLI VEHDQWRKAV AGYLASVSFV DAQVGWVLDA LDESPYVNNT VVVLWGDNGW HLGEKLHWTK LTLWEESCRV PLIIALPGLT PPGRKCAKPV STMDVYPTLN ELCDLTPKPE LECRSILELL RNPQSDTWDG PPALSTYMPG NHSLRDERYR YIRYNDGTEE LYDLKADPME WNNLLAGGGT GPAGVRDRLS AFLPKFNAPQ APLVHRF
|
| |