Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_C6598 |
Symbol | |
ID | 3733915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007509 |
Strand | + |
Start bp | 93189 |
End bp | 96365 |
Gene Length | 3177 bp |
Protein Length | 1058 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637760300 |
Product | hypothetical protein |
Protein accession | YP_366292 |
Protein GI | 78059717 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000923882 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACTTAC GGTCGGCGCT CCATAGACAA ATGGACGTGA GAATCCTGAG TTGTACGCAA ATAGATCACA GAGGCCAAAT GCAGTTTCGC GCAAAACAGA TTGCTCCTCC TAAGGAGTGG GGAACATTTG AGGATTTGTG CCACGCCTTA TTCAAGCGAG TATGGCGGGA TCCGTTCGCG CAAAAAAATG GGCGGAGGGG GCAGGCTCAG CACGGTGTCG ATGTCTTCGG CTCTCCCGGC GGCGACCGCT CATCGTACTG GGGTGTTCAG TGCAAGGGAA AAGATTGCAA CTACGGCAGT AAGGCTGAAT GGTCCGAAGT ATTGTTGGAA GTTGCCAAAG CGGAAAAATT CTCCCCTCGG CTGGAAAAGT GGATATTTGC TACCACAGCT CCGACCGATG CGCTTTTGCA AAAGGCTGCT CGGGAACTGT CAGTCGCGCG TAGGGCAGAA GGCCTGTTCA GTGTCGACGT ACTGGGTTGG GAGGAAATCC AGGCGTTGAT GGCTGACGTT CCTGGAGTGA TAACTGAGTT TTATCCCGAG CACGCCGATC ACCTACCGCA GGTGATTGAG GCACTGCGTG CCGTGCCTTT GCTCGAAGCA AAGGTCGTGG ACCTAGTCGA AAGGATCGAG GCGACGCTAC TTAAGCCTCC GAATCTCCAT GGCAGCGCAG TTTGGGAGGC AGTGACATTT GATGGCGACC GCGGATTGGG GCCAGCGCTA ATGGGCTATG CGCTGGGGCC GTCTGACGCC GTGGCCTGTC CGTGTTTGAT CGAGGTGGGC ACTGTTCAGG CGCAACTGAG GGTCGCCTAT TCAGCACGTC TCATTGGCGA GCCTGGCGCA GGAAAATCGA TCTGCTCGTA TCAAGCTGCG AGGGAGCTTG CGAGTGGCGG CTTCGAAGTG CTGCGCTTGC TTGATCCCCA AGCCGATAGT ATCGCGTTGG AGGCGGTGTT GCCCGACAAA CCCCGTCTGT ATCTTATTGA CGATGCTCAC CTCCTCAAAC CTCACATTCT CAGTCGAATC GAGAATCAGG CCTGTCCGTC TCGTCTCGTC CTTTCAACGC ACAACGTTGT CGGGCGCCTC GGGCATCGAG GAGCGATTAC ACTCGATGCA AAGCGTGCGG TGAAGACAAT CGCAGCTGCG CTACGAGCTG ATCTGCCAAA GACGCTAGAA GCTGTACGCC TCGCCGACGA CGACGTGGGC GAGCGCATGT TGGACGCTGA TCTCGGTGAG CGCCTGGAAC ATGCGGAAGC CGTTGCGGAC CGTCCGTGGC AATTCTGTTT CGTGCTTGGG GGTGGCTGGC GTCGGTCCAA GCAAGCGGCA GATTCCGCAC GTCTAGCGGC GGCCGATCTC GTCTTGGCAG CGGTCGCGAT GCGCCAGATG GTGTCTCGCG ATGCCCGTGC GATGCCGGCA GAGATCATGG AGGTTTGTGA GCGCGTGGGC ATCAACTCGA GCGTAGTTGA GCAAGGATTG GAGTGGTTGG AGAGGGAAAG ACTAATAGTC GGCGCCACGG ATTGTCGGAC ACCGCACCAG CGCTTCGCGT CGGTTGTGCT CAAACGTATT TTGGAGGGGC AGGATACGAG CGGGCGCGAC AAGATTGCCA GAATGATCGA AAGTGTATTG TGCGATTCCC ACTATCCATT TGCCGGTTTG CGGGTCCTGA TACATGAGCT TAGCTTCGGC GATCGCTATA GTTGGACGCA TCTTCTCGGG CAACCAGCGG TCGAGGCTGC GGTGGCACGC TGCTGGATAG CCGCAGGTTC GGATCGCAAC TTCGCGGCGC TCGCCCTTTC AGACTTGTGG GATTTCATGG GGGGCGGGGC AGCCGCCGTC GTGGGCCCGC ACGTGAGCAC ACTGGCAAAA TGGATTTCCA GCCCAAGCGA CAGCGCTTAC GGACTCGGTC ACCTCCTGAA CGCTCTCACT CGGATTAATC AAGAGGTTGC CGAGAAAGTC ATCGCAAAGG TCGATTCAGT AGCAATTGCA GCGGCTTATT CCAACGCCAA TCCCGAGACG GCATATGGCT TGGCCGATCT TCTGTGTGCT GTTGCCAACG TTAAGGTCGA TGATGTCAAT GCCAAGATCC GGGCCGCTAT TGATCGAAAC AGACTGCGTG AGCTTGCTAA GCACGAGGGC TTTTTGGAAG ATGCTTTCGT CTTCTCGAAA TTCTGCGCTT CGGTGGTTTG GTGGGAGGAG GATCTGGCGC TCGAGATGGC CGAACTTTTC GTCCCAACGG CACAGCAAGT CCTGTCTAAG GATCCCGTTG AAGGCTTTGA ACAGCTCAGT CAGGATTTCG CTTCAACGGT GCTTCGTGTA TTCGATGTGC TGGGAGTATA TGTTGGAAAG TGGAGACCGA CTCGCCGCCA GTGGGCGATC GCGCGGCGAA TCTGCGAGAA AATTGATCCC AAGCAGGCTG CGGAACATAT CTCGACAGTG CGTCCTCGGA ATTTTCAGTC AGCGGGTTTC TTTCTTCATT TTCTCTTCCA GTCGGCACCG CGGAAATACG AGACGGTCTT ACAACAGATC GACTGGGACA AGCTTGATCT AGCTATCGGT GACGATTGGA GGGATATGCC CCATGATACC GAGGTGCTTT TGAGTACGCT CTATTCGGGT ACGCTTGCGC AGCAACTGGT ACAGAATTTC ATCTCCACAA GGTCAAACCG GATCGTACAT TTTCCGCCAC GCTTACTGCT GATGGCGCCC GAGGTCGGCT TCACGCATTT AGCCAACGGT GGCCTACTAC GTCTTGCACA ACTTGACCAC GTGAATTGGA CCGACGGTGG CATTGCCTTG GTCCTCATCG CAGAGACGCA CCCCGAATTG ACCGAGAGGG CGGTTACGCC ATTCATTGAT GTGATAGCGC ACGGGCTGAT GAAATATAAC CGCGATTTTA CTGGACCAGC GGAGGGTTTT GTTCGCGTTG TGATTGAACA TGCACCTGCC ACTTGGCGCG CAGTCTTGGG CAAACTTGAT CCTGTAACGA TAGAGAAGAA CTTCGCCGAA TGTCTCAAAG GGAATGCAGA TCATCGGCGC ACAGTTGCTG CTGTCATCGA GTCTGCGATA GTGCTGGATG ACCCTGTTGG TCATGCGGCC CGACGCATAC GGGCGCGATT TCCCAAGGCT TCAACTGCGC CGACTGATAC GCCGCATTTC GGCAGGTCGC GACGTCGGTC AGGCTAG
|
Protein sequence | MYLRSALHRQ MDVRILSCTQ IDHRGQMQFR AKQIAPPKEW GTFEDLCHAL FKRVWRDPFA QKNGRRGQAQ HGVDVFGSPG GDRSSYWGVQ CKGKDCNYGS KAEWSEVLLE VAKAEKFSPR LEKWIFATTA PTDALLQKAA RELSVARRAE GLFSVDVLGW EEIQALMADV PGVITEFYPE HADHLPQVIE ALRAVPLLEA KVVDLVERIE ATLLKPPNLH GSAVWEAVTF DGDRGLGPAL MGYALGPSDA VACPCLIEVG TVQAQLRVAY SARLIGEPGA GKSICSYQAA RELASGGFEV LRLLDPQADS IALEAVLPDK PRLYLIDDAH LLKPHILSRI ENQACPSRLV LSTHNVVGRL GHRGAITLDA KRAVKTIAAA LRADLPKTLE AVRLADDDVG ERMLDADLGE RLEHAEAVAD RPWQFCFVLG GGWRRSKQAA DSARLAAADL VLAAVAMRQM VSRDARAMPA EIMEVCERVG INSSVVEQGL EWLERERLIV GATDCRTPHQ RFASVVLKRI LEGQDTSGRD KIARMIESVL CDSHYPFAGL RVLIHELSFG DRYSWTHLLG QPAVEAAVAR CWIAAGSDRN FAALALSDLW DFMGGGAAAV VGPHVSTLAK WISSPSDSAY GLGHLLNALT RINQEVAEKV IAKVDSVAIA AAYSNANPET AYGLADLLCA VANVKVDDVN AKIRAAIDRN RLRELAKHEG FLEDAFVFSK FCASVVWWEE DLALEMAELF VPTAQQVLSK DPVEGFEQLS QDFASTVLRV FDVLGVYVGK WRPTRRQWAI ARRICEKIDP KQAAEHISTV RPRNFQSAGF FLHFLFQSAP RKYETVLQQI DWDKLDLAIG DDWRDMPHDT EVLLSTLYSG TLAQQLVQNF ISTRSNRIVH FPPRLLLMAP EVGFTHLANG GLLRLAQLDH VNWTDGGIAL VLIAETHPEL TERAVTPFID VIAHGLMKYN RDFTGPAEGF VRVVIEHAPA TWRAVLGKLD PVTIEKNFAE CLKGNADHRR TVAAVIESAI VLDDPVGHAA RRIRARFPKA STAPTDTPHF GRSRRRSG
|
| |