Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1933 |
Symbol | proV |
ID | 4904727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 1893070 |
End bp | 1894791 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640145039 |
Product | glycine betaine/L-proline ABC transporter, ATP-binding subunit |
Protein accession | YP_001075967 |
Protein GI | 126456616 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4175] ABC-type proline/glycine betaine transport system, ATPase component |
TIGRFAM ID | [TIGR01186] glycine betaine/L-proline transport ATP binding subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAAATGT CGGCGGGGCT TTTTTTCGTG CGCGCGCGGC ACGAGCGGCA GCGGGTGGCG GCTGCGGGCC GGACGCGCAT CGCCGCGATC GCCGCGATCG CCGGCGAATG CGTCTGCGGA AATCCGACAT GCGCGAACCG CTGCCTCGGC ACCGCATCCG GCGAAAAACG GCAAGTGTCC CGCGACCAAA ATTCGGCAGT CGAAAGGGCG GACCATGCGC GCGGCCGCGC GCACGCGGTC TGCGCAACGG CGAGCCGCGC GGCGAGCGAT GCGGCATCGC GCAAACGCGG GATGAACGCG TCCGCGCGCA GCGCCGGCGC GCATCGCGGC ACGCCGCGCC CGCCACGGCC GCGTGCGGCG CGCTGCGGCG CATCACGCCG GCCGCCGGTG GGTTTCGTCG GCGCGCCGAA ACGCCCGTTC GCCGCACTTG TACGGACGTG TCGTATTTGC GCAAGCGTTT GTCGCAATTC GTCGGCTGCA CGGGGTTTTT TCTGGCAACC ATGTAGGCAC GCTATGACGG CGTATTTCCC CGCTCAACGA GGAGACGTTG CAATGGATGC CCCGAAGGTC GTAGTCGAAG GTCTGTGCAA GGTGTTTGGA AGCAATCCGC GGCAAGCGCT GGACATGCTC GCCGCAGGCG CGACGAAGGA TGAAGTGTTC GCGCGCACGG GCCAGGTGGT CGGCGTGCAC AACGTGTCGT TCGATGTGCG GGAAGGCGAG ATTTTCGTGC TGATGGGGCT CTCCGGCTCC GGCAAGTCGA CGCTGATCCG GCTCGTCAAC CGGCTCGTCG AGCCGAGCGC CGGCAAGGTG ATGATCGACG GGCGCGACGT CGCCGCGGTG CGCCGCGCCG AGCTGACCGC GCTGCGCCGC ACCGACATGA GCATGGTGTT CCAGTCGTTC GCGCTGATGC CGCAGCGCAC GGTGCTGTCG AACGCCGCGT TCGGCCTCGA AGTGGCCGGC ATGGGCCGCA AGGATCGCGA GCGGCGCGCG ATGGACGTGC TCGAGCAGGT GGGCCTCGCG CAGTTCGCGC ACAAGCTGCC CGCCGAACTC TCGGGCGGCA TGCAGCAGCG CGTCGGCCTC GCGCGCGCGC TCGCGGTGAA CCCGTCGCTG ATGATCATGG ACGAGGCGTT CTCCGCGCTC GATCCGTTAA AGCGCAAGGA AATGCAGAAC GTGCTGCTGC AGCTTCAGAA AGAGCAGCGC CGCACGATCA TGTTCGTGTC GCACGATCTC GAGGAGGCGC TGCGTATCGG CAGCCGGATC GCGATCATGG AGGGCGGCCG GCTCGTGCAG GTCGGCACGC CGCAGGAAAT CATCGCGAAC CCCGCCGACG ACTACGTGCG CGCGTTCTTC GAAGGCATCG ACACGAGCCG CTACCTGACC GCGGGCGACC TGATGCTCAC GGGCGCCGTG CCGACCCTGT CGAAGCTCGA TGCGAAGCAC GTCGCCGCTT CGCTGAACGG CAGCGCCGAA TACGCGTTCG TGCTCGACGA GGCGCGCAAG ATCCGCGGCT TCGTCACGCG CGACGCGCTG AACGGCGCGA CGCCGAACGT GCGCCAGGTC GAAAGCATTC CGCGCGACGC ATCGCTCGAT CACGTCGTCG AGCGATGCGT CGCGCATCCG CACGCGCTGC CCGTCGTCGA CGACGACGGC TGTTACTGCG GCTCGGTCGA CCGGGCCGTG CTTCTGAAAG CCATTACGCG TTCACGAGGT TCCCATGTCT GA
|
Protein sequence | MEMSAGLFFV RARHERQRVA AAGRTRIAAI AAIAGECVCG NPTCANRCLG TASGEKRQVS RDQNSAVERA DHARGRAHAV CATASRAASD AASRKRGMNA SARSAGAHRG TPRPPRPRAA RCGASRRPPV GFVGAPKRPF AALVRTCRIC ASVCRNSSAA RGFFWQPCRH AMTAYFPAQR GDVAMDAPKV VVEGLCKVFG SNPRQALDML AAGATKDEVF ARTGQVVGVH NVSFDVREGE IFVLMGLSGS GKSTLIRLVN RLVEPSAGKV MIDGRDVAAV RRAELTALRR TDMSMVFQSF ALMPQRTVLS NAAFGLEVAG MGRKDRERRA MDVLEQVGLA QFAHKLPAEL SGGMQQRVGL ARALAVNPSL MIMDEAFSAL DPLKRKEMQN VLLQLQKEQR RTIMFVSHDL EEALRIGSRI AIMEGGRLVQ VGTPQEIIAN PADDYVRAFF EGIDTSRYLT AGDLMLTGAV PTLSKLDAKH VAASLNGSAE YAFVLDEARK IRGFVTRDAL NGATPNVRQV ESIPRDASLD HVVERCVAHP HALPVVDDDG CYCGSVDRAV LLKAITRSRG SHV
|
| |