Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_B1578 |
Symbol | |
ID | 3753343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007511 |
Strand | - |
Start bp | 1787577 |
End bp | 1792247 |
Gene Length | 4671 bp |
Protein Length | 1556 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637766427 |
Product | Rhs family protein |
Protein accession | YP_372336 |
Protein GI | 78062428 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.883586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.184607 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAT TTGAAAGCCG GTTGACGCGC GCGTCGGCGC CAGCCGAGAG TCATTCAACG CCGTCGGAGT CGAAGGCCGA CACCGCATGT GATTCGCTGC TCGATACGGT CAAGTCGACG TTCGATCCGT TCAAGGAGAC GTTCTCGTCG GAAGGCGGCA CGCTTCACCA TGTCAGCGAG GCCGTGAATT CCCTGGCGTC ACTGCAGGGC ATGCCGTCGC AGTTGCTCAA CACTGGCATC GCGCAGATCC CGCTGCTCGA CAAGATGCCG GGGATGCCGG CCGCCACCAT CGGTGTTCCG CATCTCGGTA CACCGCACGC GCATAGTCAT CCGCCGAGCA GCGGTTTTCC GTTGCCGAGT GTCGGGGCAA CCATAGGCAG TGGGTGTTTG AGCGTACTGA TTGGGGGTAT TCCTGCAGCA CGTGTACTGG ATATCGGTAT TGCGCCGACG TGCGGCGGGA TAACGCCGTT CTTCGATATT CAGACGGGCT CGAGCAATAC GTTCATCGGC GGGATGCGTG CTGCGCGCAT GGGTATCGAC ATGACGCGGC ATTGCAATCC GATGGGGCAT GTCGGGCACT CGGGTGGCGA AGCAGCGAGC GCTGCCGAGA AAGGCGAGGA GGTGGCGAGC GAGGCTGCTC AGGTGTCCGG CCGGGCGAAG TTGCTGGGGC GGGCAGGGAA AGCGTGGTCG GTTGGTAATG CGGCGGTCGG GCCAGCGTCA GGGGTGGCGA CTGCTGCCGA TGACGCATCG CAAGGAGAAG TCGCCGCCGC TGCGATGATG GCCGCGCAAA CAGCGGCCGA TCTCGCGTTC ATGGCGCTCA GCAACCTGAT GGGCAAGGAT CCCGGCATCG AACCGAGTAT GGGGACGCTG CTCGCCGGGG ATCCGACGGT GCTGATCGGC GGATTTCCGC TGCCGGACTC GCAGATGATG TGGCACGGGG CAAAGCACGG GATCGGGAAG AAGGTCAGGC CGAAGTTGCC GAAGTGGGCG CAAGAGTTGG CGTGCGAATT CAAGGGCGAG CCGATCAGTG CCGTCACTGG AGAGGTGAAG AATGATTTCA CCGACTACGA AACCGACGAA ACGCTGCCTT TCATATGGGG ACGCCACTAC TGCAGTGGGT GGAATGAGCG TAACGGAGCG CAGGGATACG GATTCCGACA TGCGTGGCAG CGCGAACTGC AGTTGCTGCG CACGCGCGCA ATCTATACCG ATCCGCGGGG CACGAAATAC ACGTTCAGCA GAAACGCAGA CGGTACATAT GACGGTTGCT GCCAAGGCTA CTCGCTCGAG CAGATCGACA GTCGAAGGTT CTTCATCCGG CACGAGGCCG CTGGCGACGC AGAGTTCGAG CGGGACGCCG AAACTGCACT TTCAGTCCGC TGCATCAGCC ATCAGCGTGA TCGCACACAA AGCCTTCTTC GCTGGCGTCC AGACGGACAT ATCGGGAAGA TCGTGCAGAC CGACCACGAC GGCAACGTTC GCAGAACCGT CTCGTTACGG TACGACCAGT CCGGGCGGAT TGTTGAAGTC GTGCTGACAG ACGTCGATGG TCGGGACACT CGGATCGCAC GGTATGCATA CGACACCGAG AATTGCCTGA CTTCGTATCG CAATGCGCTC GATGCGATTT CGACCTATGA ATACGATGCA CAGCGGCGTA TGGCCCGACT CGCTGGTGCC AACGGGTATT CGTTCCTTTA CTGCTACGAC AGCGAAGGAC GTTGCGTGGA AAGCGCTGGG CAGGACCGGA TGTGGCACGT CAGATTCCAG TACCGACCCG GGCGGACGAT TGTCACCGAG GGAGATGGTG GACGATGGAC TATCCTGTAC AACGAGATGG GAACCATCAC CCATGTCGTC GATCCTTACG GTGGCATCGC CGAGTATGTG CTCGATCCAT CCGGGCGAAT CGTGGAGGAA ATTGACTCGG GTGGGCGGAC GTTGCGATGG CTTTACGACG CTAGAGCAGG GACCATCGGC CGCCTCGATC GATTCGGCAA CCTCTGGCCG AACAAGGACG AACTGCCGAA CTTGCCCAAC CCGCTGGCGC ACCATGTGCC GAACGCCATG CGCGGCCTGA TCTGGGGAGA ATTCAACGTC GACAATCTCA CTGACACGGT GTTGTTGCCG CAAGAAATTG ATCACTGGGC TTGGGGCTCT GCCACGGATG CAGTTGCGCG ATGGCCCGAT GAACGGCGAC AGCAACGCGA TGTTGCCGGC AGGGTGATTG CACGCACCAA TATCAGCGGG CATACCGAGC GCTTCGAATA CGATGCTGAC GATAACCTGC TGCTGCGATT CGATCAGGAT GGCGCCGCGA CCCGGTATAC CAGATGCTCA TGGAATCTGC TGGCCAGCGA AGAGAATGCT CTTGGCGAGA CGATCCGCTA TCGCTATACC TCCCGTCGGG ACGTTGCGTC TGTCATCGAC GCGAATGGCA ACGAATCCGC TTACAGCTAC GACTTCAAGA AGCGAATGAC GAGCGTGACA CGACATGGCC GGTTGCGGGA GAGCTATGTC TACGACGGCG GCGACCGCTT GATCGAGAAG CGTGACGGCG CGGGCAAGTG GCTATTAAGG TTCGAAATCG GCGAGAACGG CCTGCCGCGC GAGCGAACCC TGGCGTCAGG TAGTAAGCAT GTCTACGAAT ATGATCGCTT CGGCCATCTC ACGAAGGCCT CTACGGACGC GCACGAGATC ACGCGAACGT TTGACAATCT AGGAAGGCGC ACATCGGACA AATGCGACGG CTTGGGCGTC CAGCACGAAT ACGATGGCCA CCGGTTGGGG CGAACGACCT ACCTCGAGCG GTTTGAGGTG GTCTACACAT ATACTGCAAG CGGCCAATCC ATCCGAACGC CAGGCGGCCG CACACACTGC GTGCAACAGG GCAAGGACGG CCGAGTGTTG CTGCGGTTGG CCAACGGTAC GCAGGCATGC TACATGTTCG ACGACGTCGG GCGTTGCACT GGACGCATCA CCTGGCGGAC CGATTCTCCC GCGGCCTTTC ATCGTGTCAC GTATGAATAC AGCGGCGCCG GAGAGCTTCG CCGTATCAGC GACAACCAAA AGGGGGATAC GCAATACCAA TACGACCGCG CACATCGTCT GATTGGCGAG ATGAGCAGCG GCTGGCAAGC TTGCCGCTAC GAGTACGATC CCGCCGGTAA TCTGTTGGCT TTGCCGACCT GCCCGATGAT GCAATATACA GGCGGCAACC GGCTATCAAC CACGTCGCGC GGTCAATACA CGTACGACGA ACGCAACCAT CTGTCCGAAG AGATCGCACA CGATGGCCGG CGTACCACGT ATCGCTACGA CAGCATGGAC TTGCTTGTCC GTGTCGAGTG GTCTGACCGG CACGATGCCT GGACAGCGGA ATACGATGGT TTATGCCGTC AAGTAATCGC AGGGACAGAC AATGACTCCA CGGTGTATTA CTGGGACGAA GATCGCTTGG CCGCACAGGT TGATCCCGGT GGCAAGGTAC GTGTCTTCGT TTACGTCGAC GAGAAGTCGC TCGTGCCGTT CATGTTCATC GACTATCCAA GCATTGATTC GCCTGCAACC TCGGGAAGCG AATATTTCGT GCTATTCAAT CAGGTTGGCA TGCCTGAACG CATCGAAGAC GTCAGCGGCA GAGCAGTGTG GCTCGCACGG AAAATCCATT CGTACGGGGT TATCGAAGTT GCTGAGGGAA ATTCACTTGA ATACGATGTG CGTTGGCCCG GACATTTGTT TCAGCGTGAA ACAGGCTTGC ACGTCAATCG ATTCCGATCA TACAACCCGA TGCTCGGACG ATATCTCCAA TCGGATCCGG TTGGGCATGC AGGGGGAGGG AATCTCTATA CATATTCTGC CAATCCGGTC GTGGAAGTAG ACGTACTCGG CCTGAATGCG CACGTTCATG GGGAGGATGC ATCCGAGCCT TCGCCCGAAG GCCCTGCAGT CGCCGGCCGA ACGCAGGAGC TGACCAAGGC GGAACTGAAT CACCTCGCAA AAATCGAAAG ACGGACAGCG GAAAGAGATG CAGCTATCGA GAGAATGAAA AGAGAGATTG AGAATCTCGA ACAGGAAAAA AAGAAATTAT CATTAGATGA AAAGAACTGG TACAGCTATC CCGAAACACC CGAAACAAAA AGACTTGACG CAATTAGGGA AAGGGTATTG GAGCTCAATG ATAAAATTAA AAATTACCCG TTGGTTCGTG ATAAAACGAC GGAATTTCCA AAAGGATACT TGCAGAGCAC GCATATCGAG ATGATCAAGC GATATACTCG TGAAGGCAGG GAAGGCAAAC TTCCGATCGA TCCTGACAAG AACCCTGCAG GGCTGGTCGC AATGCGTAGC AAATTGACGT GGGTTGATGC GGATAACCAT CCGATCCCTT ACTACACAAC CAATGACAAA GGCGAACAGG TTGTGAATGT AACATACGAC CATCAGGTAT CGGAGGCCGA AATGTACAAC AGGGGCGCGA CCATCGGAGG CAAGCACTAT CCACCCGGCT GTGATTCCAA TCCAGAGAAT CGCAAGAAAT TTCATAACGA CACTGATAAT CTCGCTGTGA TGTCCCAGTC GGAAAATGCT TCGAAAGGAA GTAGTTCGAG CGAGACAAAT GGCAATGAGC GGTTCAGTTA CGACATACCC ACTGGAGATC ACTACTCATG A
|
Protein sequence | MSEFESRLTR ASAPAESHST PSESKADTAC DSLLDTVKST FDPFKETFSS EGGTLHHVSE AVNSLASLQG MPSQLLNTGI AQIPLLDKMP GMPAATIGVP HLGTPHAHSH PPSSGFPLPS VGATIGSGCL SVLIGGIPAA RVLDIGIAPT CGGITPFFDI QTGSSNTFIG GMRAARMGID MTRHCNPMGH VGHSGGEAAS AAEKGEEVAS EAAQVSGRAK LLGRAGKAWS VGNAAVGPAS GVATAADDAS QGEVAAAAMM AAQTAADLAF MALSNLMGKD PGIEPSMGTL LAGDPTVLIG GFPLPDSQMM WHGAKHGIGK KVRPKLPKWA QELACEFKGE PISAVTGEVK NDFTDYETDE TLPFIWGRHY CSGWNERNGA QGYGFRHAWQ RELQLLRTRA IYTDPRGTKY TFSRNADGTY DGCCQGYSLE QIDSRRFFIR HEAAGDAEFE RDAETALSVR CISHQRDRTQ SLLRWRPDGH IGKIVQTDHD GNVRRTVSLR YDQSGRIVEV VLTDVDGRDT RIARYAYDTE NCLTSYRNAL DAISTYEYDA QRRMARLAGA NGYSFLYCYD SEGRCVESAG QDRMWHVRFQ YRPGRTIVTE GDGGRWTILY NEMGTITHVV DPYGGIAEYV LDPSGRIVEE IDSGGRTLRW LYDARAGTIG RLDRFGNLWP NKDELPNLPN PLAHHVPNAM RGLIWGEFNV DNLTDTVLLP QEIDHWAWGS ATDAVARWPD ERRQQRDVAG RVIARTNISG HTERFEYDAD DNLLLRFDQD GAATRYTRCS WNLLASEENA LGETIRYRYT SRRDVASVID ANGNESAYSY DFKKRMTSVT RHGRLRESYV YDGGDRLIEK RDGAGKWLLR FEIGENGLPR ERTLASGSKH VYEYDRFGHL TKASTDAHEI TRTFDNLGRR TSDKCDGLGV QHEYDGHRLG RTTYLERFEV VYTYTASGQS IRTPGGRTHC VQQGKDGRVL LRLANGTQAC YMFDDVGRCT GRITWRTDSP AAFHRVTYEY SGAGELRRIS DNQKGDTQYQ YDRAHRLIGE MSSGWQACRY EYDPAGNLLA LPTCPMMQYT GGNRLSTTSR GQYTYDERNH LSEEIAHDGR RTTYRYDSMD LLVRVEWSDR HDAWTAEYDG LCRQVIAGTD NDSTVYYWDE DRLAAQVDPG GKVRVFVYVD EKSLVPFMFI DYPSIDSPAT SGSEYFVLFN QVGMPERIED VSGRAVWLAR KIHSYGVIEV AEGNSLEYDV RWPGHLFQRE TGLHVNRFRS YNPMLGRYLQ SDPVGHAGGG NLYTYSANPV VEVDVLGLNA HVHGEDASEP SPEGPAVAGR TQELTKAELN HLAKIERRTA ERDAAIERMK REIENLEQEK KKLSLDEKNW YSYPETPETK RLDAIRERVL ELNDKIKNYP LVRDKTTEFP KGYLQSTHIE MIKRYTREGR EGKLPIDPDK NPAGLVAMRS KLTWVDADNH PIPYYTTNDK GEQVVNVTYD HQVSEAEMYN RGATIGGKHY PPGCDSNPEN RKKFHNDTDN LAVMSQSENA SKGSSSSETN GNERFSYDIP TGDHYS
|
| |