Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen_4539 |
Symbol | |
ID | 4094412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia AU 1054 |
Kingdom | Bacteria |
Replicon accession | NC_008061 |
Strand | - |
Start bp | 1791340 |
End bp | 1793565 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638017826 |
Product | protein-tyrosine kinase |
Protein accession | YP_624393 |
Protein GI | 107026882 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01005] exopolysaccharide transport protein family [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.267128 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAACA CGCAAGCGAA ACACTCCTAC GCGGACCTGA CCGCGAAGAC CGAGGAAGAG GACTTCGTTC TCGGCCAGTT GCTCCAGGTG ATCATGGACG ACATCTGGCT GCTGATCGGC ATCGCGGTGA CGGTCGTCGC GCTCGCCGGC CTCTACTGCT TCATCGCGAA GCCGGTCTAC CAGGCGGACG TGCACGTGCG GGTCGAGGGC AACGACAACA CGTCGCAGGC GCTCACGCAG ACGCAGACGG GCGCGTCGAT CAACAGCGGC CCGCAGCAGG CGCCGACCGA CGCGGAAATC GAGATCATCA AGAGCCGCGG TGTCGTCACG CCGGTCGTCG AGCAGTTCAA GCTGAACTTC TCGGTCGCGC CGAAGACGCT GCCGGTGCTC GGCAGCCTCG CCGCGCGCCT CGCGACGCCC GGCGAGCCGT CGCGGCCGTG GCTCGGCCTG AAGTCGTATG CGTGGGGCGG CGAAGTCGCC GACGTCGACT CGATCAGCGT CGTGCCGGCG CTCGAAGGCA AGAAGCTGAC GCTGACGGCC GGCCCGAACG GCACCTATTC GCTGGTCGAC GAGAACGGCA CGCGGCTGCT GGCGGGCCGT GTCGACGAAG CCGCGCAGGG CGGCGGCGTG ACGCTGCTCG TGCGGAAGCT GGTCGCGCGC CCCGGCACGC AGTTCACGGT GGTGCGCTAC AACGATCTCG ACGCGATCAG CGGCTTCCAG GCCGGTATCC AGGTGAGCGA GCAGGGCAAG CAGACGGGCG TCGTGCAGAT CTCGCTCGAA GGCAAGGACC CGGACCAGAC CGCCGCGATC GCGAACGCGC TCGCGCAGTC GTACCTGAAC CAGCACGTGG TCGCGAAGCA GGCGGAAGCG ACCAAGATGC TCGACTTCCT GAAGGGCGAG GAGCCGCGCC TGAAGGCCGA CCTCGAACGC GCGGAAGCCG CGCTCACGCA ATACCAGCGC ACGTCGGGCT CGATCAACGC GAGCGACGAG GCGAAGGTCT ACCTCGAAGG CAGCGTGCAG TACGAACAGC AGATCGCCGC GCAGCGGCTG CAGCTCGCGT CGCTCGCGCA GCGCTTCACC GATTCGCATC CGATGGTGAT CGCCGCGAAG CAGCAGCTCG CGGAGCTGCA GGGCGAGAAG GACAAGTTCA GCAACCGCTT CCGCAGCCTG CCGGCGACCG AGGTGAAGGC GGTCCAGCTC CAGCGCGACG CGAAGGTCGC CGAGGACATC TACGTGCTGC TGCTGAACCG CGTGCAGGAA CTGTCGGTGC AGAAGGCCGG CACGGGCGGC AACATCCACC TGATCGATTC GGCGCTGCGT CCGGGCGCGC CGGTCAAGCC GAAGAAGGTG CTGATCCTGT CGGCCGCCGT GTTCCTCGGG CTGATCCTCG GCACGGGCGT CGTGTTCCTG CGCCGCAACC TGTTCCAGGG CATCGAGGAC CCGGACCGCA TCGAGCGCAC GTTCAACCTG CCGCTGTACG GGCTGGTGCC GCAAAGCGCC GAGCAGGTGA AGCTCGACGC GGCGGCGGAG AAGGGCGGCA GCCGTGCGCG GCCGATCCTC GCGAGCCTGC GTCCGAAGGA CCTGAGCGTC GAGAGCATGC GCAGCCTGCG CACCGCGATG CAGTTCGCGA TGATGGACGC GAAGAACCGC GTGATCGTGC TGACCGGCCC GACGCCGGGC ATCGGCAAGA GCTTCCTGAC GGTCAACCTC GCGGTGCTGC TCGCGCATTC GGGCAAGCGC GTGCTGCTGA TCGACGCCGA CATGCGCCGC GGGATGCTCG ACCGCTACTT CGGTCTCACC GTGCAGCCGG GCCTGTCCGA GCTGCTGAGC GATCAGTCGC CGCTCGAGGA AGCGATCCGC GAGACGCCGG TGCAGGGCCT GTCGTTCATC GCGGCCGGCA CGCGCCCGCC GAATCCGTCG GAACTGCTGA TGTCGACGCG CCTGCCGCAA TACCTCGAAG GGCTCGGCAA GCGCTACGAC GTCGTGCTGA TCGATTCGCC GCCGGTGCTG GCGGTGACCG ACGCGACCAT CATCGGCCGC ATGGCCGGCT CGACGTTCCT CGTGTTGCGC TCGGGCATGC ATACCGAAGG CGAGATCGCC GACGCGATCA AGCGCCTGCG CACCGCGGGC GTCGATCTGG AGGGCGGGAT CTTCAATGGC GTGCCGCCGA AGGCGCGCGG CTACGGCCGC GGCTATGCGG CCGTACACGA ATACCTGAGC GCTTGA
|
Protein sequence | MVNTQAKHSY ADLTAKTEEE DFVLGQLLQV IMDDIWLLIG IAVTVVALAG LYCFIAKPVY QADVHVRVEG NDNTSQALTQ TQTGASINSG PQQAPTDAEI EIIKSRGVVT PVVEQFKLNF SVAPKTLPVL GSLAARLATP GEPSRPWLGL KSYAWGGEVA DVDSISVVPA LEGKKLTLTA GPNGTYSLVD ENGTRLLAGR VDEAAQGGGV TLLVRKLVAR PGTQFTVVRY NDLDAISGFQ AGIQVSEQGK QTGVVQISLE GKDPDQTAAI ANALAQSYLN QHVVAKQAEA TKMLDFLKGE EPRLKADLER AEAALTQYQR TSGSINASDE AKVYLEGSVQ YEQQIAAQRL QLASLAQRFT DSHPMVIAAK QQLAELQGEK DKFSNRFRSL PATEVKAVQL QRDAKVAEDI YVLLLNRVQE LSVQKAGTGG NIHLIDSALR PGAPVKPKKV LILSAAVFLG LILGTGVVFL RRNLFQGIED PDRIERTFNL PLYGLVPQSA EQVKLDAAAE KGGSRARPIL ASLRPKDLSV ESMRSLRTAM QFAMMDAKNR VIVLTGPTPG IGKSFLTVNL AVLLAHSGKR VLLIDADMRR GMLDRYFGLT VQPGLSELLS DQSPLEEAIR ETPVQGLSFI AAGTRPPNPS ELLMSTRLPQ YLEGLGKRYD VVLIDSPPVL AVTDATIIGR MAGSTFLVLR SGMHTEGEIA DAIKRLRTAG VDLEGGIFNG VPPKARGYGR GYAAVHEYLS A
|
| |