Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1434 |
Symbol | |
ID | 3844533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1694011 |
End bp | 1698672 |
Gene Length | 4662 bp |
Protein Length | 1553 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637838736 |
Product | YD repeat-containing protein |
Protein accession | YP_439630 |
Protein GI | 83717122 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG3209] Rhs family protein [COG4104] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.16383 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGAAG TGCAGAGTAG CGCGCCGCCG GACAGCCGGC AGAAGCTCGC GGCGCTAGCG CAACGAGGCA GCAATGCGGA CGCCGTGCAG ACGGTGTCCA ACATGGGGCT CGCGATCAAC GCGGCGCAAG TCACGGCGGC CGGGACGAGC GCGTGGGCGG CCGGCACGTT CCAGTGCTTC GCCGGGCGCG TGATCGCGCC GCTTGGCGGC GCGATGCTCG GCGGCGCGCT CGCCGAGGCG GTCGGCGCGG ACCGCCCGGT CACATGGGTG CTGGGCAAGA TGGGGCTTCC GGCGGTCGCG AAGCCCGGCA AGGCGCCGGC TCGCGTCGGA CACAAGATCG TCCACGAGAA CGCATTCATC GGCGCGCTGA CCGGCCTGCT GGCGGGGATC GCGGTCGGCG TGGCCATTGC CGCCGCGGCG GCCGCGATCG TGGCGACGGG CGGCGCCGCC GCGGTCGCCA TCGCGGCGGC CGGCCCGTTC GTCGTCGGGT TCGTCTCGGG CGCGGTGGGC GGATTCGTCG GCGCGGCGGT GGCCAAGGGC ATCGGCCATA CCGGCTCGGT GACGGGCGCG ATCGCGGACG GCTCGCCCAA CGTCTCGTTC GAAGGGGCGC GGGTTGCCCG CGTGACCGAC CCGGTGACAT GCAGCAAGGA CCCCGGCACG CCTCCGCCGC AGATCGCGCA GGGCAGCCTG ACCGTATCCG TCAACGGCCT GCCGCTCGCG CGGATCGGCC ACAAGATCAC GTGCTCGGCG GTCATTCAGG AAGGCTGCAC GACGATCAGC GCGGACGAGA CGACCGGCAC GTTCGGCAAG ATCGACGCGA ACGTGTCGCT TCTCGAGCAG TTGGTGCTGA CGGCCACCGA CGTCATCATG ATGCGTTCGG CGACCAAGGA AGGCGGCTTG CTGGACGGCG TGCTGCGCGA GCTGCTCGGC GAGCCGATCG ACATGGCGAC GGGCGATTAC GCCGACTACC GGACGGATTT CACGTGGCCG CACGTGCTGC CGCTCACGAT TTCCCGTGCG TACGCGGGGC GGCAGCCGGT CGAGGGGCTG CTCGGCGACA GGTGGATCAG CAACTGGTCG CAGCGCCTGC GATATCGGCG GCCGGCGGAC GGCCCGGCCA CCGTCACGTT CTTCGACGCG GATGGCCAGC AACTCGTGTA TCCGGTTCCG CACGAGCCGT TCAACGCGAT CAACTTCTGG GCGCCGCACT ACGCGTTGCA CGGCAATCGC GCGCGGGCCG TCGTTTTCGA CGAGCGCTCG CAGCAATCGC TGATTTTCGA GCCGGCGCAT GCCGAGGACG ACGTCGCCCG CCTGACCCGC ATCGAGGACC GCAACGGCAA CACGATCGAC TTCGAGTACA ACGCGCTCGG CCGGCTATGC ACGGTGCGGC ACAGCGGCGG GATGACGCTG TGGGTCACCT GCGATTCCCG CGGGCTGTTG CAGAGCGTGT CGGAGCGGCC GGGCGGGGAG GGCGAACTGG TCCGCTATCG TTTCGACGGC AAGCGCCTCA CCGACGTTCA TAGCCGCTTC CAGGGCGAAT TTCATTTCGG CTATACCGAC GAGGGCTGGC TCAATCACTG GCGCGACAGC GGCGCCACCG AGGTTGCGCT GCGCTATGAC GAGCGGGCAA GGGTGATCGC CACGCGCACG AATACCGCGC TGTACGACGA TCGCTTCGAG TATGACGACG AGCAACGGCG GACGACGTAC ATCGACGCGC TCGGGCACCG GCATCAACGC TGGTTCGATG CGCAGAACCG GCTGATTCGG TCGCAGGACC CGCTGGGCCG GGTGATGCAT GCCAGTTTCG ACGAGCGGGG CTGGCTGAGC GCGCGGACGG ACCCGATCGG CCGAGTCAGC CAGTACCGTT ACGACGCGCG CGGTCGCCCC GTGAAAGTCA TCGACGTCTA CGGACGGGAG AGTCGATATC GATGGAATGG GGCCGGGCAG TTGATCGAGC GAACCGATCC GTTCGGCGCG CTGCGCTGGC ATTTCAGCGC AGAGGGCAAT CTGGTGGCGT TCGAGGGGCC GTCAGGCGAG ACGCGTTTTC GATACGACCC GCGTGGCCTG CTCGTCAGCC GGACTGATCC CGATGGCGCG ACGTACGCAT GGCGCCATGA TGAAGCAGGG CGTCCCGACA GATGGACCGA TCCGCTCGGC CGGCACACCC ATCTCGACCG CGATCGATAC GGCCGACTGA GGAGCCGAAC CGATGCGGCC GGTCATCGGA CGGTCTATGG CTATGAGCCG GGCCCGTCCA ATCCGCGCGA AGCGCTTAGC AGCGTGACGT ACCCGGACGG CGCAATCGCG CGCTTTCACT ATGACTCGGA AGGCATGTTG CGGGAGGCGG TGAATCCGCT GGGACATGGC ATCCGCTATA CGTGGGGCGC ATTCGATTTG CTGGCGAGCG TCACGGACCC GTCGGGGGCG GTCACGCAAT ATCATCGCGA TGGCACTGCC CGCCTGACGG GTGTGACGAA CGCGCTGGGG CAGCGATGGA CGTTGGAGCG GGACGCCGCG GGACAAGTGA TCGCCGAAAC GGATTGGCGC GGGCGCTGCA CGCGATATGT GCGTAACCGG CTAGGGCAGG TCACCGAGAA GCATCTGCCG GACGGCGTCG TGCTGCGCTA TGAATATGAC GCATACGACC GGTTGATTTC CCTTGCCGGG CCGCTGCAGA AGCATACGTT CGCTTGGGGA TCGCGCGGGC AACTGACCCG AGCCCAGGTT TGGGAGCGTA GCGATGACGC GGATGCGTGG CGGGCCGACA ACGACGTATG CCTGGAATAC GACGACGCAT TCCGGTTGAT CGAGGAAAGC CAGAGCGGGC AGGCGATCCG TTACGAATAT GACGTGATGG GGCGGCCCGC GTCGCTCGGT ACGCCGAGCG GGCAGACGCG TTGGCAGTAC GATCTCGCCG GCCAGCTCGA CGTGATCGAG AGCAACGGCC ACCAATTCCG CTTCGGCTAC GACGTGCTCG GCCGGGAAAG CCATCGCCGC TATTTGCCGA CGGAGCAACG GCGGAACTGG CAACCGGAAT GGGCGGATCG CTATCCGGAC GGTTTTGCGC AACGGCAGGC GCATGACGCG CGAAGCCAAT TGACCTCGCA GGTATTCGGG GCGCTGCCTT GGCACGACGA GGTCGCGCCC GAAGCCTTTG GCCCGCAGCG CGCCCGGCAT TACGAATGGG ATGTCGCGGG CCACTGCGTC GGCATGCAGG AAGAACGCAA GGGCCTGCCT GTCGAAGCGG GCCGCTGGCG GTACGACGCT CGCGGCCAGA TGGTCGACGC TTACCACGAG CGCACCGAAA GTCGAAGCGC TCGGGAGCGT TACCGATACG ACGCGCTCGG CCATGTGGTC GAGCAGCAGA TCGATGGCGG CGAAATCCGC ACGCATGACT ACCTGGGCGA TCAACTGATC AGTGCGGGGC CGAACGTCTA CAAATACGAC GCGCGTGGGC GCATGGTCGC GCGCACCGAA CTGCGCGACG GCTTCCGTCC GCGCTCATGG CGGTATCGCT GGGACGATTT CGACCGCCTG AGAGAAGTGA CGACACCCGG CGGCGAACGC TGGGCGTATC GCTACGACGC GTTTGGCCGG CGCATCAGCA AGGTTTGCAC GCATGGGGGC CGCCGGAACC GTTTGAAGCG TGCCGCCTAT TTGTGGTGCG GAAGCCGAAT GATCGAGGCC TGGCGAACCT ATGACGAGCG CGACGGATCG CGGCACGATA TCCAGCGTTG GCATTATCGG CCGAGCACCC ACATACCGCT TGCTCAGGAA CGGCTGCGCT TCGACGATCA ACCTGATCCG CAAACGAGCG AGTGGTATCC GCTCGCCTGT GATCCCAATG GCGCGCCGCA TACGCTGTAC AGCAGCGACG GCAGGGCGCT GTGGCGCGCG CGGCGGACGG CATGGGGCGA CACGGCCGGC GACGACGGGC GTGATTCGCT TCGTAGCGCG GTGCGAGAGC AATTGCGATT GGGGCATCGG GATAGTGACG AATTCGATCC GCCAGACTGC GAATTGCGAT TCCCGGGGCA GTGGGCAGAT GAGGAAAGCG GGCTGCACTA CAATTTGCAC CGCTACTACG ACCCATCGAC CGGCCAGTAT CTCAGCGCCG ATCCGGTGGG GCTGGCGGGC GGGTTGCGAA CGCACGCTTA CGTGCATGAT CCGATGCAGT GGGGCGATCC GTTCGGTCTG CAGGGATACG ATACGGTGAG AAACCATCGG GCGGGCAACA AGCAGATCGA TTACGACGGT CAACGCTGGA ACGTCCCGAA AGGCAAGAAT CCGACGGAGG TCATTCCGGA ATCGGACCCG ATCGGAAAGA AGCTCCAGGA TGCGACGGAC AACGCTGCTG CCCGCTGGAA CACGTCGAAT CTCACGCCGG CGGAAAGCAA TGCGATCGAA AGCGCTCGAG CATCGGGCGA ATACTGGCGC GCGAACCTGC TTCAGCAGCA GGCCAAGGGG CGCTGGATCG AATCCCAGGT CAAGAGCGAT CCGGACATCG CGGGGATGGA TCTGGATTGG AATCGCGTGG GAGTCGATGC GGTCGATCCG AATACGGGGT TGAGTTATGA CATCATGTCG GGTAGCAAGA GCAACATGGA TGCGCACGCG ATGCGGATGT CCGATGTTGT TTTCAGAATG ATCACTTTCT AG
|
Protein sequence | MGEVQSSAPP DSRQKLAALA QRGSNADAVQ TVSNMGLAIN AAQVTAAGTS AWAAGTFQCF AGRVIAPLGG AMLGGALAEA VGADRPVTWV LGKMGLPAVA KPGKAPARVG HKIVHENAFI GALTGLLAGI AVGVAIAAAA AAIVATGGAA AVAIAAAGPF VVGFVSGAVG GFVGAAVAKG IGHTGSVTGA IADGSPNVSF EGARVARVTD PVTCSKDPGT PPPQIAQGSL TVSVNGLPLA RIGHKITCSA VIQEGCTTIS ADETTGTFGK IDANVSLLEQ LVLTATDVIM MRSATKEGGL LDGVLRELLG EPIDMATGDY ADYRTDFTWP HVLPLTISRA YAGRQPVEGL LGDRWISNWS QRLRYRRPAD GPATVTFFDA DGQQLVYPVP HEPFNAINFW APHYALHGNR ARAVVFDERS QQSLIFEPAH AEDDVARLTR IEDRNGNTID FEYNALGRLC TVRHSGGMTL WVTCDSRGLL QSVSERPGGE GELVRYRFDG KRLTDVHSRF QGEFHFGYTD EGWLNHWRDS GATEVALRYD ERARVIATRT NTALYDDRFE YDDEQRRTTY IDALGHRHQR WFDAQNRLIR SQDPLGRVMH ASFDERGWLS ARTDPIGRVS QYRYDARGRP VKVIDVYGRE SRYRWNGAGQ LIERTDPFGA LRWHFSAEGN LVAFEGPSGE TRFRYDPRGL LVSRTDPDGA TYAWRHDEAG RPDRWTDPLG RHTHLDRDRY GRLRSRTDAA GHRTVYGYEP GPSNPREALS SVTYPDGAIA RFHYDSEGML REAVNPLGHG IRYTWGAFDL LASVTDPSGA VTQYHRDGTA RLTGVTNALG QRWTLERDAA GQVIAETDWR GRCTRYVRNR LGQVTEKHLP DGVVLRYEYD AYDRLISLAG PLQKHTFAWG SRGQLTRAQV WERSDDADAW RADNDVCLEY DDAFRLIEES QSGQAIRYEY DVMGRPASLG TPSGQTRWQY DLAGQLDVIE SNGHQFRFGY DVLGRESHRR YLPTEQRRNW QPEWADRYPD GFAQRQAHDA RSQLTSQVFG ALPWHDEVAP EAFGPQRARH YEWDVAGHCV GMQEERKGLP VEAGRWRYDA RGQMVDAYHE RTESRSARER YRYDALGHVV EQQIDGGEIR THDYLGDQLI SAGPNVYKYD ARGRMVARTE LRDGFRPRSW RYRWDDFDRL REVTTPGGER WAYRYDAFGR RISKVCTHGG RRNRLKRAAY LWCGSRMIEA WRTYDERDGS RHDIQRWHYR PSTHIPLAQE RLRFDDQPDP QTSEWYPLAC DPNGAPHTLY SSDGRALWRA RRTAWGDTAG DDGRDSLRSA VREQLRLGHR DSDEFDPPDC ELRFPGQWAD EESGLHYNLH RYYDPSTGQY LSADPVGLAG GLRTHAYVHD PMQWGDPFGL QGYDTVRNHR AGNKQIDYDG QRWNVPKGKN PTEVIPESDP IGKKLQDATD NAAARWNTSN LTPAESNAIE SARASGEYWR ANLLQQQAKG RWIESQVKSD PDIAGMDLDW NRVGVDAVDP NTGLSYDIMS GSKSNMDAHA MRMSDVVFRM ITF
|
| |