Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen2424_5172 |
Symbol | |
ID | 4452941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia HI2424 |
Kingdom | Bacteria |
Replicon accession | NC_008543 |
Strand | - |
Start bp | 2218922 |
End bp | 2220091 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639697228 |
Product | glycine betaine/L-proline ABC transporter, ATPase subunit |
Protein accession | YP_838798 |
Protein GI | 116693265 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4175] ABC-type proline/glycine betaine transport system, ATPase component |
TIGRFAM ID | [TIGR01186] glycine betaine/L-proline transport ATP binding subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00993584 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.408818 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGCCC CCAAGGTTGT GGTCGAAGGT CTGTGCAAGG TGTTTGGAAG CAATCCGCAG CAGGCGCTCG ACATGCTCGC CGCCGGCGCG ACGAAGGACG ATGTGCTCAA GCGCACCGGC CAGGTCGTCG GCGTGCACAA CGTCTCCTTC GATGTGCAGG AAGGCGAAAT TTTCGTGCTG ATGGGCCTGT CCGGCTCCGG CAAGTCCACG CTGATCCGCC TTGTGAACCG GCTGGTCGAT CCGAGCGCCG GCAAGGTGCT GATCGACGGG CTCGACGTCG CGTCGGCGCG CCGCTCGGCG CTGACCGCGC TGCGCCGCAA GGACATGAGC ATGGTGTTCC AGTCGTTCGC GCTGATGCCG CATCGCACCG TCGTGTCGAA TGCCGCGTTC GGGCTCGAAG TCGGCGGCAT GGGCAAGAAG GAGCGCGAGC GCCGTGCGAT GGAAGTGCTC GAGCAAGTCG GCCTCGCGCC GTTCTCGCAC AAGCTGCCGT CCGAGCTGTC GGGCGGCATG CAGCAGCGCG TCGGCCTCGC TCGCGCGCTG GCCGTGAATC CGTCGCTGAT GATCATGGAC GAGGCGTTCT CCGCGCTCGA TCCGCTCAAG CGCCGCGAAA TGCAGGACGT GCTGCTGCAA CTGCAGAAGG AGCAGCGCCG CACGATCATG TTCGTGTCGC ATGATCTCGA GGAAGCGCTG CGCATCGGCA ACCGCATCGC GATCATGGAG GGCGGCCGCC TCGTGCAGGT CGGCACGCCG CAGGACATCA TCGCGAACCC GGCCGACGAT TACGTGCGCG CGTTCTTCGA CGGCATCGAC ACCAGCCGCT ACCTCACCGC CGGCGACCTG ATGCAGACGG GCGCCGTGCC GCTCGTGTCG AAGTGCGATG CCGCGAACGT CGCGGCTTCG CTGAACGGCA GCGCCGAATA CGCGTTCGTG CTCGACGCCG CACGCAAGAT TCGCGGCTTC GTCACGCGCG AGGCGCTCGG CCAGGACACG CCGTCCGTGC GGCCGATCGA AAGCATCCGG CGCGATGCGT CGCTCGAACA TGTCGTCGCG CGCGTGGTCG CGAGCCCGAA TGCACTGCCC GTCGTCGACG ACGACGGCTG CTACTGCGGC TCGGTCGATC GTGCGCTCAT CCTGAAGGCC ATCACGCGTT CGCGAGGCTC CCATGTCTGA
|
Protein sequence | MDAPKVVVEG LCKVFGSNPQ QALDMLAAGA TKDDVLKRTG QVVGVHNVSF DVQEGEIFVL MGLSGSGKST LIRLVNRLVD PSAGKVLIDG LDVASARRSA LTALRRKDMS MVFQSFALMP HRTVVSNAAF GLEVGGMGKK ERERRAMEVL EQVGLAPFSH KLPSELSGGM QQRVGLARAL AVNPSLMIMD EAFSALDPLK RREMQDVLLQ LQKEQRRTIM FVSHDLEEAL RIGNRIAIME GGRLVQVGTP QDIIANPADD YVRAFFDGID TSRYLTAGDL MQTGAVPLVS KCDAANVAAS LNGSAEYAFV LDAARKIRGF VTREALGQDT PSVRPIESIR RDASLEHVVA RVVASPNALP VVDDDGCYCG SVDRALILKA ITRSRGSHV
|
| |