Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1774 |
Symbol | |
ID | 3844843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 2141757 |
End bp | 2146283 |
Gene Length | 4527 bp |
Protein Length | 1508 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637839075 |
Product | serine protease |
Protein accession | YP_439968 |
Protein GI | 83717954 |
COG category | [S] Function unknown |
COG ID | [COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain [TIGR02601] autotransporter-associated beta strand repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.903556 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTGCA CGCGTTACAG AAAAAACAAC GATCATCCGA AACTCGCGCA GTCGATGGGC GTGCTCGTCG CGCTCGTCGG CGCCGGCATC GTCCCGGCGC ACGCGACCTG CACGGCGGCG GGCACGACCG TCACCTGCTC GGGCGCCGCC GACCCACTCG CGCCGAGCTA CGCGAACAGC GGCAACAATC TCGGCGTGAC GGTCAACAGC GGCGCAAGCC TCGGCGTGCT GCTCGGCGTC GGCGGCACCG CCCTGTCGCT GACGGGCAGC GGCGTCACGC TCACCAACAA CGGCACGATC GATCCGACGG TGCTGGGCTT CGGGCTCGGC GTCCTGTCGA GCGGCGCCGT CGTCGGCAAC GCGTCGCCGA GCACGACCAC CGTGACGAAC AACGGCACGA TGAACGGTTC GACCGGCGTG TCGATCAGCG GGCTGACCGG CATGGCGCTC GCGGTCGAGA ACGGCACGGG CGGCGTATCG AACATCACGA ACACGGGAAC GATCGGCTCG ACGCCGCTCG CCGGCGCGAC GCTGCTCGGG CCGGATTCTC CCGTCGTCGC CGCATACGGC GGCGGGCAGG TCAACTTCAG CAACAGCGGC ACGATCACCG GCCGGGTCGC GTTCCAGTCG AACGGCACGG CCGGGCAGGG CAACACGTTC GTCAACTCGG GGACGATCGA CGGCAGCGTG TCGATGGGCA CGAACAGCAC GAACACGTTC ACCGCGATGA CTGGCTCGAC CGTCAGCGCG GCAGGCGGCA CCGGGCTGTC GCTGAACATC GGCGTCGGCT CGCTGACGCT CGGTTTCGCG GCGACGGGCA TCGTCGACGG CGGCGCTGGC GGCAACAATA CGCTCGTGCT GCAGCAGGCG ACAGGCGGTC CGGCGACGGG CGCGATCGCC GTCGACAATT ACATCAACTT CAACCACCTC GACGTGACGA GCGGCGCGTG GACGATCAGC GGTGCGTCGA GCGCGCAGGA CGCGACGCTG TCGGGCGGCG TTGCGATCAT CGGCAACAAC GCGTCGCTCG GCACGGGCGC GATCACCGGC AATGGCGGCG CATTGCAGGC AGGCGCGGCA GGGCTCGACG TGAGCAACAA CGTCGCGCTC GGCGCGGGCG GGCTGACGGT GCAGGGCGCG ACGGGGCTCA CGCTGTCGGG CGCGATCTCG GGTAGCGGCG CGTTGACGAA GAATGACACC GGCACGCTGA CGCTGACGGG CGCGAACACA TACACGGGCG GCACGACGAT CAACGCGGGC ACGCTCGCGA TCGGCGCGGG CGGCAGCCTC GCGGCGACGG GCGCGGTGAA TCTCGCCGGC GCGGGTGCGG CGCTCGATAT CAGCGCGGCC GGCGCGAATC AGACGATCGG CGCGTTGTCG GGCGTGGCGG GAACGAACGT GAACCTCGGC GCGAACGGGC TGACGTTCGG CGACGGCACG AACCAGACGT TCGCCGGCGC GATCGGCGGC ACGGGCGGCG TGACGAAGCA AGGTGCGGGC GTCGAGACGC TGACGGGCGC GAACACGTAC ACGGGCGGCA CGACGATCAA CGCAGGCACG CTCGCGATCG GCGCGGGCGG CAGCCTCGCG GCGACGGGCG CGGTGAATCT CGCCGGCGCG GGTGCGGCGC TCGATATCAG CGCGGCCGGC GCGAATCAGA CGATCGGCGC GCTATCGGGC GTGGCGGGCA CGACGATATC GCTCGGCGCG AACACGCTCG GCTTCGGCAG TGCGGCGAAC CAGACGTTCG GCGGCAGCAT CGCGGGCACG GGCGGGATCG TGAAGAACGG TACGGGCACC GAGACGCTGA CGGGCGCGAA CACGTACACG GGCGGCACGA CGGTCAACGC CGGTACGCTC GCGCTCGGCG CGGGCGGCGG CCTGTCCGGC TCGACGACGG TGAATCTCGC GGCCGCGGGC GCGGGCTTCG ACATCAGCGG CGCGACGGGC AACCAGACGA TCGGCGGGCT GTCCGGCGCG GCGGGCACGA CGGTTGCGCT GGGCGGCAAT TCGCTGACGC TCGCCGGCAG CGGCAGCGCG ACGTTCGGCG GCACGATCGG CGGCACGGGT GGGTTGACGT TCGCTGGCAC GGGCACGCAG GCGCTCACTG GCAACAACAC TTACTCGGGC GGCACGACGC TCGCGGGCGG CACCGTCGCG CTCGGCAGCG GCGGCGCGCT GGGCACGGGC GCGGTGACGG TTGCCGCGCC GACGACGATC GACACGACTT CCGCGGTGAA CCTGTCGAAT GCGGTTGCGT TGAACGCGAC CGCGACGGTT GGCGGCACGC AGAGCCTGAC GCTGTCGGGC GCCGTGTCCG GCCCGGGCGG CGTCGTGATG AACGGCTCGT CGACGCTGAC GCTCGGCGGC GCGAACACCT ATGCGGGCGG CACGACGGTC AACGCGGGCA TGGTCGTCGT CGGCAACGGC AGCGCGCTCG GCACGGGCGG CCTCACGGTG AACGGCGGCG GCGTGTCGCT GGGCGGCTCG AGTGTGACGC TGCCGACGCT GAACGGCGCG GCGGGCGGCA CGATCGACAC CGGCGCCGGC AGTCTCGCCG TGACGGGCGG CGGCAGCTTC GGCGGCGCGC TGACGGGGGG CGGCTCGCTC GCCGTGTCGG GCGGCGCGCC GCTCACGTTG ACGGGCGCGA ACACGTTTAC GGGCGGCACG ACGATCGCGA GCGGCGGCGC GCTGCAGATC GGCAACGGCG GGACGACGGG CAGCCTCGCC GGCAACGTCG CCGACAACGG CGCGCTCGTG TTCAACGAGG CGGCCAATCT CGCGTATGGC GGCGCGATCT CGGGCTCGGG CTTGCTCACG CAGGCGGGCA GCGGCGTGCT GACGCTCACG GGCGCGAGCA CGCTCGCCGG GCCGACGACG GTGGCGGCGG GCACGCTCGC TGTCGACGGC TCGCTCGCGA ATTCGACGGT GACCGTGCAA AACGGCGCGA CGGTCACCGG CACGGGCACG CTCGGCGGGC TCATCGTCGC GAGCGGCGGC ACCGCGTCGC TGCCGCAGCC CGGGCAGGCG CTCAACGTCG CGGGCAACGT GACGTTCGAG GCGGGCTCGA CGCTGCAGGT CGCCGCGAAT CCGCAGCAAA GCGGCAGCCT CGCGGCAACC GGCTCGGCGA CGCTGAACGG CGGCACCGTG CAGGTGCTCG CGAGCCAGGC GAGCTATCAG GCGAACACGA CGTACACGAT CCTGAGCGCG AACGCGGGCG TCGCGGGCCA GTTCGCCGGC GTGAATTCGA CGTACGCGTT CGTCACGCCG ACGCTCGGCT ACGACGCGAA CCACGTGTTC CTGCGGCTCG CGCCAAACGG CAACGCGTTC ACGTCGGTCG CGACGACGCA GAACCAGACG GCCGTGGCGG GCGCGCTCGG CACGCTCGGC GCGGGCAATC CGCTGTTCGA CACCGTGCTC GTCTCCGATG CGCCCACCGC GCGCGGCGCG TTCTCGCAGC TCGACGGCGA ACTGAACGCG AGCCTGCAGA GCATGCTCTT GAGCGACAGC CGCTATGTGC GCGACGCGGT GACGGACCGC GTGCGCCAGG GGCTCGCGCC GGGTTCGGGG CCGCTCGCGG CGCTGTCCGC GGGCGGAGCC GCGCTGTGCG ACGACGCGGG CGGCGGCGCG GCGCGTCACG ATGCGATGCC GCCCGAGCGG CGGCTCGGCT CGCGCGACAG TTGCGTCGGC CGCACGCCGT ATCGGCCCGT CGTCTGGGGG CAGGCGTACG GCGGCCGCAG CCGGCTCGCG AGCGACGGCA ACGCGTCGAC GCTCAATCGC AGCATGACGG GCTTCATCGC CGGTGCCGAC GTCGCGCTGA ACGACCGCTG GCGCGCGGGC GCCGCGGCGG GCGTCACGCA CAGCTCGCTC GACAACGACC TGAATGCGTC GGCATCGCTG AACAGCTACT ACGTCGCGCT CTACGGCGGC GCGCAGTACG GCGCGTGGGG CGTGCGCGGC GGCGCGGTGT ACACGTGGTA CCGGATCAAC GCCGACCGCT CGCCCGCGTT CGCGAACTTC CGCGATCACG ATTCGGCCGG CTACGACGCG AACTCGGGCC AGGTGTTCGG CGAAGTCGGC TATGCGATTC CGGTCGGACG CTTCGCGCTC GAGCCGTTCG CCGGGCTCGC GTACGTGAGC CTGCACACCG ACGGCTATCA GGAAAGCGGC GGCGCGGCCG CGCTGAAGAG CGGCGCGCAG ACGAGCAACG TCGCGTTCTC GACGCTCGGC GTGCGCGCGG CGACGGCGCT CGACGTGCTC GCGAAGGGCA CGCTGAGCGC GCACGCGATG GCCGGCTGGC GGCATGCGTT CGGCAGCGCG CGGCCGACGT CGACGCTCGC GTTCGCGCGC GGCGGCGCGT CGTTCCAGGT CGCGGGCGTG CCGATCGCGC GCGACAGCGC GGTGCTCGAG CTCGGCATCG ACGCGAGCGT CACGAAGAAC CTGACGCTCG GCGTGTCGTA CAGCGGGCAG TACGGCAGCG GCGTGCGCGA CAACGCGGTG CTCGGCAACG CGCTGTGGCG GTTCTGA
|
Protein sequence | MSCTRYRKNN DHPKLAQSMG VLVALVGAGI VPAHATCTAA GTTVTCSGAA DPLAPSYANS GNNLGVTVNS GASLGVLLGV GGTALSLTGS GVTLTNNGTI DPTVLGFGLG VLSSGAVVGN ASPSTTTVTN NGTMNGSTGV SISGLTGMAL AVENGTGGVS NITNTGTIGS TPLAGATLLG PDSPVVAAYG GGQVNFSNSG TITGRVAFQS NGTAGQGNTF VNSGTIDGSV SMGTNSTNTF TAMTGSTVSA AGGTGLSLNI GVGSLTLGFA ATGIVDGGAG GNNTLVLQQA TGGPATGAIA VDNYINFNHL DVTSGAWTIS GASSAQDATL SGGVAIIGNN ASLGTGAITG NGGALQAGAA GLDVSNNVAL GAGGLTVQGA TGLTLSGAIS GSGALTKNDT GTLTLTGANT YTGGTTINAG TLAIGAGGSL AATGAVNLAG AGAALDISAA GANQTIGALS GVAGTNVNLG ANGLTFGDGT NQTFAGAIGG TGGVTKQGAG VETLTGANTY TGGTTINAGT LAIGAGGSLA ATGAVNLAGA GAALDISAAG ANQTIGALSG VAGTTISLGA NTLGFGSAAN QTFGGSIAGT GGIVKNGTGT ETLTGANTYT GGTTVNAGTL ALGAGGGLSG STTVNLAAAG AGFDISGATG NQTIGGLSGA AGTTVALGGN SLTLAGSGSA TFGGTIGGTG GLTFAGTGTQ ALTGNNTYSG GTTLAGGTVA LGSGGALGTG AVTVAAPTTI DTTSAVNLSN AVALNATATV GGTQSLTLSG AVSGPGGVVM NGSSTLTLGG ANTYAGGTTV NAGMVVVGNG SALGTGGLTV NGGGVSLGGS SVTLPTLNGA AGGTIDTGAG SLAVTGGGSF GGALTGGGSL AVSGGAPLTL TGANTFTGGT TIASGGALQI GNGGTTGSLA GNVADNGALV FNEAANLAYG GAISGSGLLT QAGSGVLTLT GASTLAGPTT VAAGTLAVDG SLANSTVTVQ NGATVTGTGT LGGLIVASGG TASLPQPGQA LNVAGNVTFE AGSTLQVAAN PQQSGSLAAT GSATLNGGTV QVLASQASYQ ANTTYTILSA NAGVAGQFAG VNSTYAFVTP TLGYDANHVF LRLAPNGNAF TSVATTQNQT AVAGALGTLG AGNPLFDTVL VSDAPTARGA FSQLDGELNA SLQSMLLSDS RYVRDAVTDR VRQGLAPGSG PLAALSAGGA ALCDDAGGGA ARHDAMPPER RLGSRDSCVG RTPYRPVVWG QAYGGRSRLA SDGNASTLNR SMTGFIAGAD VALNDRWRAG AAAGVTHSSL DNDLNASASL NSYYVALYGG AQYGAWGVRG GAVYTWYRIN ADRSPAFANF RDHDSAGYDA NSGQVFGEVG YAIPVGRFAL EPFAGLAYVS LHTDGYQESG GAAALKSGAQ TSNVAFSTLG VRAATALDVL AKGTLSAHAM AGWRHAFGSA RPTSTLAFAR GGASFQVAGV PIARDSAVLE LGIDASVTKN LTLGVSYSGQ YGSGVRDNAV LGNALWRF
|
| |