Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1417 |
Symbol | |
ID | 3844789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 1670588 |
End bp | 1672270 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637838719 |
Product | serine metalloprotease MrpA |
Protein accession | YP_439613 |
Protein GI | 83717882 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.620558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA AACATAGCAA TCCTGAACAA ACGCGGCGCG GCCAGGTCCG CGCCGTCGCC GGCATTCTCT CCATGTCCGT GCTCGTTCCG CTCGCCGGTT GCGGCGGCGG CGGCGACGGA GGCGCAAGCG GCACGCCGTC CGCAGCCGCC CAGCCGACGG CGCCCACGCC GGCTCCGGCG CCCGCGCCAA GTTCGGGCTC GTCGCAATCC GCGAATTCGT CCACGTCGAC GGCGGCCTGC CCCGTCACGC AGGCCGCATC GACCGCCGCA AGCGAATCGC TCGCCCCCAG CACCGTCGCG TACGACGCGC CCGTCGACCA TCTGATCGTC AAGCTGCAAA GCGCACCGGC GACGAGCGCA TCGGGCGCGC GCATCATGGC CGCGACGAAC GACGCGGCCC GACTCGATTC GCTGATTCAG CGGGTGATGT CGCAATGGAA CGCGAAGAGC GGCAACGTTC GCTCGTATGC GCAGAACGTC GCGCCGATGA ACGCGGTGCA GGTCGAGCGG ACGATGTCGG ACGGCGCCGC GCTGCTCGCG CTCGGACAAA AGATGAGCGC GGATAACGCC GGCGCTCTCG CGCAAACGTT CGCGGCCGAT CCGGACGTCG CGTACGCGGA GCCCGACCGG CGCGTGTTCG CCCGCACGGT GGCGACCGAT CCGAGCTACT CGCAGCAATG GAATTACTTC GATCCGACGG CCGGCATCGA CCTGCCGAAC GCGTGGGACG TGACGACCGG CCTGCCGAGC GTCGTCACCG CGGTGCTCGA CACCGGCTAC CGTCCGCATC CGGACATCAT CGCGAACCTG CTGCCCGGCT ACGACTTCAT CTCCGACATC AACACCGGCA ACAACGGCCA TACCCGCAGC CCGGACGCGA CCGACCCGGG CGACTGGGTC ACGCAGCAGG AACTGACCGA TCCGTCGAGC CCGTTCTACC AATGCGCGAG CGCGCCGTCG AACAGCAGCT GGCACGGCAC GCAGGTCGCC GGCATCATCG GCGCCGCCGC GAACAACGGC ATCGGCATCG CGGGCGTCAA CTGGTACGGC AAGATCCTGC CCGTGCGCGT GCTCGGCAAG TGCGGCGGCA CGACGAGCGA TATCGCCGAC GCGATGCGCT GGGCGGCGGG CATCCCTGTC GCGGGCGCCC CGACGAACCT CACGCCCGCG AAGGTGATCA ACCTGAGCCT CGGCGGCACC GGCCCGTGCG GCGATACGTT CCAGCAAGCG ATCAACGACG TGATCGCGCG CGGCGCCACC GTCGTCGTCT CGGCCGGCAA CGACGGCCAG GCGACGACGC TGGACCGCCC GGCGAACTGC AAGGGCGTGA TCTCGGTGGG CGCGACCGAC AGCACGGGCC AGCGGGCGTG GTATAGCAAC TTCGGCTCGG ACATCACGTT GAGCGCGCCG GGCTCGAACA TCCTGTCGAC GACCAATGCG GGCACGACGG TGCCGACCAC CGACGCGTAC GGCACCCACA GCGGCACGAG CCTCGCCGCG CCGCAGGTGG CGGGCGTCGC CGCGCTGATG CTCTCGGCCA ACCCGAACCT GACGCCCGCG CAGATCGCGC AGAAGCTCGC GAGCACCGCG CGGCCATCGC CGGCGACGAC GTCCTGCCTC GCCCGCGCAC CGGGTGCGGG CATCCTCGAT GCCGGCACGG TGGTTGCGTC CGCAACGAAA TAG
|
Protein sequence | MNKKHSNPEQ TRRGQVRAVA GILSMSVLVP LAGCGGGGDG GASGTPSAAA QPTAPTPAPA PAPSSGSSQS ANSSTSTAAC PVTQAASTAA SESLAPSTVA YDAPVDHLIV KLQSAPATSA SGARIMAATN DAARLDSLIQ RVMSQWNAKS GNVRSYAQNV APMNAVQVER TMSDGAALLA LGQKMSADNA GALAQTFAAD PDVAYAEPDR RVFARTVATD PSYSQQWNYF DPTAGIDLPN AWDVTTGLPS VVTAVLDTGY RPHPDIIANL LPGYDFISDI NTGNNGHTRS PDATDPGDWV TQQELTDPSS PFYQCASAPS NSSWHGTQVA GIIGAAANNG IGIAGVNWYG KILPVRVLGK CGGTTSDIAD AMRWAAGIPV AGAPTNLTPA KVINLSLGGT GPCGDTFQQA INDVIARGAT VVVSAGNDGQ ATTLDRPANC KGVISVGATD STGQRAWYSN FGSDITLSAP GSNILSTTNA GTTVPTTDAY GTHSGTSLAA PQVAGVAALM LSANPNLTPA QIAQKLASTA RPSPATTSCL ARAPGAGILD AGTVVASATK
|
| |