Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2027 |
Symbol | proV |
ID | 4886004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1959008 |
End bp | 1960741 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640131965 |
Product | glycine betaine/L-proline ABC transporter, ATP-binding subunit |
Protein accession | YP_001063022 |
Protein GI | 126444903 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4175] ABC-type proline/glycine betaine transport system, ATPase component |
TIGRFAM ID | [TIGR01186] glycine betaine/L-proline transport ATP binding subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGTGG AACCACTCGG CACACGCTCG ACAGCCGAAA CGGCCCCGAC GCTGCGATCG CGTCGCCGGT TCGATCGGGC CGGACGCGCA TCGCCGGCGA TCGTCGCGAT CGCCGGCGAA TACGTCTGCG GAAATCCGAC ATGCGCGAAC CGCTGCCTCG ACACCGCATC CGGCGAAAAA CGGCAAGTGT CCCGCGACCA AGATTCGATA GTCGAAAGGG CGGACCATGC GCGCGGCCGC GCGCACGCGG TCCGCGCAAC GGCGAGCCGC GCGGCGAGCG ATGCGGCGTC GCGCAAACGC GGGATGAACG CGTCCGCGCG CAGCGCCGGC GCGCATCGCG GCACGCCGCG CCCGCCACGG CCGCGCGCGG CGCGCTGCGG CGCATCACGC CGGCCGCCGG TGGGTTTCGT CGGCGCGCCG AAACGCCCGT TCGCCGCACT TGTACGGACG TGTCGTATTT GCGCAAGCGT TTGTCGCAAT TCGTCGGCTG CACGGGGTTT TTTCTGGCAA CCATGTAGGC ACGCTATGAC GGCGTATTTC CCCGCTCAAC GAGGAGACGT TGCAATGGAT GCCCCGAAGG TCGTAGTCGA AGGTCTGTGC AAGGTGTTTG GAAGCAATCC GCGGCAAGCG CTGGACATGC TCGCCGCAGG CGCGACGAAG GATGAAGTGT TCGCGCGCAC GGGCCAGGTG GTCGGCGTGC ACAACGTGTC GTTCGATGTG CGGGAAGGCG AGATTTTCGT GCTGATGGGG CTCTCCGGCT CCGGCAAGTC GACGCTGATC CGGCTCGTCA ACCGGCTCGT CGAGCCGAGC GCCGGCAAGG TGATGATCGA CGGGCGCGAC GTCGCCGCGG TGCGCCGCGC CGAGCTGACC GCGCTGCGCC GCACCGACAT GAGCATGGTG TTCCAGTCGT TCGCGCTGAT GCCGCAGCGC ACGGTGCTGT CGAACGCCGC GTTCGGCCTC GAAGTGGCCG GCATGGGCCG CAAGGATCGC GAGCGGCGCG CGATGGACGT GCTCGAGCAG GTGGGCCTCG CGCAGTTCGC GCACAAACTG CCCGCCGAAC TCTCGGGCGG CATGCAGCAG CGCGTCGGCC TCGCGCGCGC GCTCGCGGTG AACCCGTCGC TGATGATCAT GGACGAGGCG TTCTCCGCGC TCGATCCGTT AAAGCGCAAG GAAATGCAGA ACGTGCTGCT GCAGCTTCAG AAAGAGCAGC GCCGCACGAT CATGTTCGTG TCGCACGATC TCGAGGAGGC GCTGCGCATC GGCAGCCGGA TCGCGATCAT GGAGGGCGGC CGGCTCGTGC AGGTCGGCAC GCCGCAGGAA ATCATCGCGA ACCCCGCCGA CGACTACGTG CGCGCGTTCT TCGAAGGCAT CGACACGAGC CGCTACCTGA CCGCGGGCGA CCTGATGCTC ACGGGCGCCG TGCCGACCCT GTCGAAGCTC GATGCGAAGC ACGTCGCCGC TTCGCTGAAC GGCAGCGCCG AATACGCGTT CGTGCTCGAC GAGGCGCGCA AGATCCGCGG CTTCGTCACG CGCGACGCGC TGAACGGCGC GACGCCGAAC GTGCGCCAGG TCGAAAGCAT TCCGCGCGAC GCATCGCTCG ATCACGTCGT CGAGCGATGC GTCGCGCATC CGCACGCGCT GCCCGTCGTC GACGACGACG GCTGTTACTG CGGCTCGGTC GACCGGGCCG TGCTTCTGAA AGCCATTACG CGTTCACGAG GTTCCCATGT CTGA
|
Protein sequence | MHVEPLGTRS TAETAPTLRS RRRFDRAGRA SPAIVAIAGE YVCGNPTCAN RCLDTASGEK RQVSRDQDSI VERADHARGR AHAVRATASR AASDAASRKR GMNASARSAG AHRGTPRPPR PRAARCGASR RPPVGFVGAP KRPFAALVRT CRICASVCRN SSAARGFFWQ PCRHAMTAYF PAQRGDVAMD APKVVVEGLC KVFGSNPRQA LDMLAAGATK DEVFARTGQV VGVHNVSFDV REGEIFVLMG LSGSGKSTLI RLVNRLVEPS AGKVMIDGRD VAAVRRAELT ALRRTDMSMV FQSFALMPQR TVLSNAAFGL EVAGMGRKDR ERRAMDVLEQ VGLAQFAHKL PAELSGGMQQ RVGLARALAV NPSLMIMDEA FSALDPLKRK EMQNVLLQLQ KEQRRTIMFV SHDLEEALRI GSRIAIMEGG RLVQVGTPQE IIANPADDYV RAFFEGIDTS RYLTAGDLML TGAVPTLSKL DAKHVAASLN GSAEYAFVLD EARKIRGFVT RDALNGATPN VRQVESIPRD ASLDHVVERC VAHPHALPVV DDDGCYCGSV DRAVLLKAIT RSRGSHV
|
| |