Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BamMC406_3454 |
Symbol | |
ID | 6179541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia ambifaria MC40-6 |
Kingdom | Bacteria |
Replicon accession | NC_010552 |
Strand | + |
Start bp | 373053 |
End bp | 375002 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641683223 |
Product | sulfatase |
Protein accession | YP_001810137 |
Protein GI | 172062486 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGT CCGCTTCGCC CTTGCTTGCC TTTCGTTTCC GTGTGGTCTG CGCCGCGATC GCCGGTGCGA TGTCGCTTGC GTCGTGCGGC GGCGTCGACA GCGATCCGCC GCCGCAGGCC AATGCGACGC CCGTGGCCAA GCGCCCGAAC ATCCTGTACA TCATGGCCGA CGATCTCGGC TATTCCGACA TCCATGCATT CGGCGGCGAG ATCAACACGC CGAACCTCGA CGCGCTCGTC GCGTCGGGCC GCATCCTGTC GAACCATCAC ACCGGCACCG TGTGCGCGAT CACGCGCGCG ATGCTGATTT CCGGCACCGA TCACCACCTC GTCGGCGAAG GCACGATGGG CGTGCCGACC GACGAGCGGC GCGGCCTGCC GGGCTACGAG GGCTACCTGA ACGATCGCGC GCTGTCGTTC GCGCAACTAC TGAAGGACGC CGGCTATCAC ACGTACATCG CGGGCAAGTG GCACATCGGC TCGGGGATCG TCGGCAGCGC GACGGGCAGC GGACAGACGC CGGACCAATG GGGCTTCGAA CGCAGCTACG TGCTGCTCGG CGGCGCCGCG ACGAACCACT TCGCGCACGA GCCGGCCGGC TCGTCGAACT ACACGGAGGA CGGCCGCTAC GTGCAGCCCG GCCAGCCCGG ACAACCGGGC GGCGCGGGCG GCAGTCCGGC CGTGTTCTAC TCGACGGATT TCTATACGCA GAAGCTGATC TCGTACATCG ATTCGAACAA GCGGGACGGC AAGCCGTTCT TTGCATACGC GGCCTACACG TCGCCGCACT GGCCGCTGCA GGTGCCGGAG CCGTGGCTGC ACAAGTACGC GGGCGTGTAC GACGCGGGCT ACGACGCGAT CCGTAACGCG CGCATCGCAC GGCAGAAGGC GCTCGGTCTG ATCCCGGCCG ACTTCAGGCC GTTCGACGGC CTGCCAGAAA CGACGTCGGC GTCGCCGGCA ACGGCGAACA ACGGCACCGC GAACGCGAAG TACATCAGCG CCGTGCATTC GGCCGCCGAC GGCTATACCG ACTACGGCAC CGGCAAGGTC GACAAGCTGT GGGCGAGCCT GAGCCCGGCC GAGCGCCGTG CACAGGCGCG CTACATGGAG ATCTACGCGG GGATGGTCGA GAACCTCGAC TACAACATCG GCCTGCTGAT CCAGCACCTG AAGGACATCG GCGAATACGA CAACACGTTC ATCATGTTCC AGTCGGACAA CGGCGCGGAA GGCTGGCCGA TCGACTCGGG CGCTGACCCG ACGGCGACCG ACACCGCGAA CGGGCAGGAG CCGATCTACT CGACCCTCGG CACCGACAAC GGCAAGCAGA ACGCGCAGCG GCTGCAATAC GGGCTGCGCT GGGCCGAAGT GAGCGCGACG CCGTTCCGGC TCACGAAGGG CTATTCGGGC GAAGGCGGCG TGTCGACGCC GACGATCGTG CGCCTGGCGG GGCAGACGCA GCAGTTGCCG ACGCTGCGCG CGTTCACGCA CGTGACCGAC AACACGGCGA CGTTCCTCGC GCTCGCGGGC GTCACGCCGC CGTCGCAGCC GGCGCCGCCG CTCGTGAACA CGCTGACCGG CATCGACCAG AACAAGGGCA AGGTCGTCTA CAACAACCGC TACGTGTATC CGGTGACGGG CCAGTCGCTG CTGCCGGTGC TGACCGGCTC GGCGACGGGT GAAGTGCATA CGGCGCCGTT CGGCGACGAA GCGTACGGCC GCGCGTACCT GCGCAGCGCC GACGGTCGCT GGAAGGCGTT GTGGACCGAG CCGCCGCTCG GCCCGCTCGA TGGTCACTGG CAGCTGTACG ACCTCGCTGC GGATCGCGGC GAGACGACCG ACGTGTCCGC GCAGAACCCG TCGGTGATCG GCACGCTGGT CGACCAGTGG AAGACCTACA TGAGCAACGT CGGCGGCGTC GAGCCGTTGC GTCCGCGCGG CTACTACTGA
|
Protein sequence | MKKSASPLLA FRFRVVCAAI AGAMSLASCG GVDSDPPPQA NATPVAKRPN ILYIMADDLG YSDIHAFGGE INTPNLDALV ASGRILSNHH TGTVCAITRA MLISGTDHHL VGEGTMGVPT DERRGLPGYE GYLNDRALSF AQLLKDAGYH TYIAGKWHIG SGIVGSATGS GQTPDQWGFE RSYVLLGGAA TNHFAHEPAG SSNYTEDGRY VQPGQPGQPG GAGGSPAVFY STDFYTQKLI SYIDSNKRDG KPFFAYAAYT SPHWPLQVPE PWLHKYAGVY DAGYDAIRNA RIARQKALGL IPADFRPFDG LPETTSASPA TANNGTANAK YISAVHSAAD GYTDYGTGKV DKLWASLSPA ERRAQARYME IYAGMVENLD YNIGLLIQHL KDIGEYDNTF IMFQSDNGAE GWPIDSGADP TATDTANGQE PIYSTLGTDN GKQNAQRLQY GLRWAEVSAT PFRLTKGYSG EGGVSTPTIV RLAGQTQQLP TLRAFTHVTD NTATFLALAG VTPPSQPAPP LVNTLTGIDQ NKGKVVYNNR YVYPVTGQSL LPVLTGSATG EVHTAPFGDE AYGRAYLRSA DGRWKALWTE PPLGPLDGHW QLYDLAADRG ETTDVSAQNP SVIGTLVDQW KTYMSNVGGV EPLRPRGYY
|
| |