Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bamb_3271 |
Symbol | |
ID | 4312130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia ambifaria AMMD |
Kingdom | Bacteria |
Replicon accession | NC_008391 |
Strand | + |
Start bp | 57216 |
End bp | 58838 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638151113 |
Product | sulfatase |
Protein accession | YP_775160 |
Protein GI | 115358022 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.755541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.42944 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCAA TTCTTGTGAT GTACGACAGC CTCAATCGGC ACATGCTCCC GCCTTATGGC AACGAGTGGG TCAAGGCGCC GAATTTCCAG CGGCTTGCCG AGCGGAGTGC GACGTTCGAC AACTGCTATA TCGGCAGCAT GGCGTGCATC CCCGCCCGCA GGGAATTGCA TACGGGGCGC TACAACTTCC TGCATCGAAG CTGGGGGCCG CTCGAGCCGT TCGACGATTC GATGCCGAAC ATGCTTCGAC GCGAGGGCAT CCACACGCAT CTGGCCAGCG ATCATCCGCA CTATTGGGAA GATGGCGGTG CGACTTATCA CACGCGGTAC AGCAGTTGGG AGTTCTTTCG CGGGCAGGAG GGCGACCCGT GGAAGGGGCA AATCCGGCAG CCGCCGATTC CGGCACGGCT CAAGGGGGCG CTGAACGAGA AGGCCGCGCG ACAAGACTTC GTCAACCGCC AGTACTATCC GACCGAAGAT CTCCATCCGC AAACGCTGAC GTTCGACGCA GGCGAGGCGT TCATCCGGGA AAATCACGAT GCGGATTCGT GGTTGCTTCA GATCGAGACT TTCGATCCAC ATGAACCATT CACCAGTTAT CCGAAGTATC GCGATCTCTA TCCTCGAACG AATACGGACG CGCATTTCGA CTGGCCACCA TACGGTCCGG TATCCGGCAA CGAAACCGAC GACGACATTC GGCACGCGCG ATCCGAATAT GCCGCGCTTG TATCCATGTG CGATCACTCG TTGGGGCGGA TTCTCGACCT GATGGATGAA CTGAATCTGT GGCAGGACAC GATGCTCATC GTATGCACGG ATCATGGCTA TATGCTCGGA GAACACGGTT GGTGGGCCAA GACGGCGCAA CCCTGGTTCG ACGAGCTCGC GCATACGCCG CTTTTCATCT GGGACCCGCG TTCCCGCCGT GCGGGCGTTC GGCGCCAGTC GCTCGTGCAG ATGATCGACA TCGCGCCGAC GCTGCTCGAT TTCTTCGGCG TGCAAGCGAC GCCGCGCATG CAGGGCGTGG CGCTCGCGAA AACCGTCGAC GAGGATCTGC CGGTTCGGGA GGCGGCGCTG TTCGGCATGC ATGGCGAACA CGTCAACGTG ACGGACGGGC GTTACGTATA TATGCGGTCA GGGCCCGACC CGGACCGCAA CGTGCCGGTG TATGAATACA CGTTGATGCC CACGCACATG CGCGAGATGT TCAGCGTCGA CGAACTGCAG GATATCGAGT TGGCACCGCC GTTTCCGTTT TCACAGGGCG TGAGCATGAT GAAGATCGCG AGCCGGCCCA TTGGATATTC GTATCGTGCC GGAACGCTGC TGTTCGACTT GCAGACGGAT CCCGGACAGG CGCAACCGTT GTACGACGCC GCCGTGGAGC AGCGCATGGC GACACTCGCC CGGGATCTGA TGCGTGCCAG CGACGCACCG GCCGAGCAGT ATGAACGTTT GGGTTTGCCG GCCGCGGGCC CGATCGACCA CGATCAACTG CTTGCCCGTG CCCACGCGGA TCGATCGGAG GCGCTACGTC TGCATCTGGC CACGCGTGCG CAAGCGATGC GCGCTGAAGT GGAGGGCCGG CGCGTTCGCG CCGAAGCATC GCCGGACCAT TGA
|
Protein sequence | MKAILVMYDS LNRHMLPPYG NEWVKAPNFQ RLAERSATFD NCYIGSMACI PARRELHTGR YNFLHRSWGP LEPFDDSMPN MLRREGIHTH LASDHPHYWE DGGATYHTRY SSWEFFRGQE GDPWKGQIRQ PPIPARLKGA LNEKAARQDF VNRQYYPTED LHPQTLTFDA GEAFIRENHD ADSWLLQIET FDPHEPFTSY PKYRDLYPRT NTDAHFDWPP YGPVSGNETD DDIRHARSEY AALVSMCDHS LGRILDLMDE LNLWQDTMLI VCTDHGYMLG EHGWWAKTAQ PWFDELAHTP LFIWDPRSRR AGVRRQSLVQ MIDIAPTLLD FFGVQATPRM QGVALAKTVD EDLPVREAAL FGMHGEHVNV TDGRYVYMRS GPDPDRNVPV YEYTLMPTHM REMFSVDELQ DIELAPPFPF SQGVSMMKIA SRPIGYSYRA GTLLFDLQTD PGQAQPLYDA AVEQRMATLA RDLMRASDAP AEQYERLGLP AAGPIDHDQL LARAHADRSE ALRLHLATRA QAMRAEVEGR RVRAEASPDH
|
| |