Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_B0483 |
Symbol | |
ID | 3752247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007511 |
Strand | + |
Start bp | 538018 |
End bp | 539853 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637765331 |
Product | arylsulfatase A like protein |
Protein accession | YP_371241 |
Protein GI | 78061333 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGC TGATCAATCC CGCAGGTTCA CCGGAAGCGA ATCGGGCGTT TCCCGACAGC CGGCGTCGCG TGTTTCTCAA GACGTCCGGC GCCGTCGCGA TGACGGCGGC CCTCGCGCCG TCCGTGGCGA TGGCCGGCGG ATCGGCGGCG AAGCCCGTGG CGAACGAGTC GAACCCTGCG CTGTCCGCGC GATTCGATCT GCCGGCCGGC TACAACATCC TGTTCGTGCT CGTCGACCAG GAGCGCTATT TCGACGCATG GCCGATGAGC GTGCCGGGGC GGGAGCGTCT CGCCAGGAGC GGCATCAGTT TCATCAACCA CCAGATTGCC GCCTGCGTGT GCTCGCCGTC GCGCTCGACC ATCTATACGG GCCAGCACAT GCAGCGCACG GGTGTGTTCG ACAACGCGGG GCTACCGTGG CAACCGGACA TGCCGACGTC GATCCGGACC GTCGGTCACA TGATGAAGGA CGCCGGCTAT CAGGCCGTGT ACGTCGGCAA GTGGCATCTG AGCGCGACGA TGCACGAATC GAATTCGCCG TACAACGCGC CGGTGGCCGA TTACAACAAA GCGATGCGCT CGTACGGCTT CGACGATTAC TTTGGCGTCG GCGACCTCGT CGGCTCCGCG CATGGCGGCT ACAACTTCGA CGGCGTGACC ACCCAGGCCG CCATCAGCTG GATGCGCGAG CAGCGGCGCA ATGCGGCCGG CGCGAAGCCG TGGATGCTCG CGGTCAACCT CGTGAATCCT CATGACGTGA TGTGGCTCAA CACCGACCCG TCGGGGCGGC CGAACGGGTC CGGGTTGATT CCGACGCGTC CGGCGCCGGA TACGCAGCTA TACGGTGCGC ACTGGGACAA GGTGCCGCTG CCGGTATCGC GCCGCCAGCC GCTCGCCGCG CCGGATCGGC CGAAGGCGCA CGCGATGTAT TCCGCCGCCC ACGAGGCGCT GATCGGCAAG ATCGAATTCG ACGATGCGAC GGTCAAGCGT TATCAGGATT ACTACCTGAA TTGCATACGC GACTGCGATC GGCACGTCGA GCGCCTGCTC GACGAACTGG ACGATCTCGG CATCGCCGAC AAGACGATCG TCGTGCTGAC GTCCGATCAC GGCGATCTGG CGGGCCATCA CCAGATGATC GACAAGGGTG CGAACGCGTA TCGGCAACAG AATCACGTGC CGATGATCGT CCGGCATCCG GCATTTCGCG GCGGGAAGTC GTGCCGCGCG CTCACTTCGC ATCTCGATGT CGCGCCGACG CTCGTCGCGT TGACCGGTGC GCCGGCCGAC AAGGTTGCGA GTGTCGTCGG CCCCGATGCG AAAGGGTCCA GCTTCGCCCA TCTGCTCGCG CAGCCGGAAC GGGCGAGCGT GCATGCGATC CGCGACGCGG CGCTGTTCAA TTACGCGATG CTGCTTTACT ACGACAGCGA ATGGATGCTC GCCGAGTTCA GGACGATGCG GGACCGGGGC GTGCCGCCCG ACGAGATGCA CCGGCGGGCG GCCGCGCTGC AACCGGATCT GGCGCAGCGC GGCGCGATCC GCAGCGTGTT CGACGGCCGG TATCGGTTCA GCCGCTACTT TGCGCTATCG AATTTCAACG AGCCGGATAC GCTCGCCGAT CTGACGGCCG CCAATGACCT CGAGCTGTTC GATCTTCATA CCGATCCGGA CGAGATGCAC AACCTGGCGA TGCGTCCGGA CCTGCACGGC GCGTTGATGG TGGAGATGAA TGCCAAACTC AACCGGCTGA TCCGGGAGGA AGTCGGTCAG GACGACCTGT CGAGCCTGCC GTTCAAGGAC GGCAGGCTGC AGTTTCAATT CAGGGCGCAC GCCTGA
|
Protein sequence | MPELINPAGS PEANRAFPDS RRRVFLKTSG AVAMTAALAP SVAMAGGSAA KPVANESNPA LSARFDLPAG YNILFVLVDQ ERYFDAWPMS VPGRERLARS GISFINHQIA ACVCSPSRST IYTGQHMQRT GVFDNAGLPW QPDMPTSIRT VGHMMKDAGY QAVYVGKWHL SATMHESNSP YNAPVADYNK AMRSYGFDDY FGVGDLVGSA HGGYNFDGVT TQAAISWMRE QRRNAAGAKP WMLAVNLVNP HDVMWLNTDP SGRPNGSGLI PTRPAPDTQL YGAHWDKVPL PVSRRQPLAA PDRPKAHAMY SAAHEALIGK IEFDDATVKR YQDYYLNCIR DCDRHVERLL DELDDLGIAD KTIVVLTSDH GDLAGHHQMI DKGANAYRQQ NHVPMIVRHP AFRGGKSCRA LTSHLDVAPT LVALTGAPAD KVASVVGPDA KGSSFAHLLA QPERASVHAI RDAALFNYAM LLYYDSEWML AEFRTMRDRG VPPDEMHRRA AALQPDLAQR GAIRSVFDGR YRFSRYFALS NFNEPDTLAD LTAANDLELF DLHTDPDEMH NLAMRPDLHG ALMVEMNAKL NRLIREEVGQ DDLSSLPFKD GRLQFQFRAH A
|
| |