Gene BURPS668_A2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2027 
SymbolproV 
ID4886004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1959008 
End bp1960741 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content68% 
IMG OID640131965 
Productglycine betaine/L-proline ABC transporter, ATP-binding subunit 
Protein accessionYP_001063022 
Protein GI126444903 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGTGG AACCACTCGG CACACGCTCG ACAGCCGAAA CGGCCCCGAC GCTGCGATCG 
CGTCGCCGGT TCGATCGGGC CGGACGCGCA TCGCCGGCGA TCGTCGCGAT CGCCGGCGAA
TACGTCTGCG GAAATCCGAC ATGCGCGAAC CGCTGCCTCG ACACCGCATC CGGCGAAAAA
CGGCAAGTGT CCCGCGACCA AGATTCGATA GTCGAAAGGG CGGACCATGC GCGCGGCCGC
GCGCACGCGG TCCGCGCAAC GGCGAGCCGC GCGGCGAGCG ATGCGGCGTC GCGCAAACGC
GGGATGAACG CGTCCGCGCG CAGCGCCGGC GCGCATCGCG GCACGCCGCG CCCGCCACGG
CCGCGCGCGG CGCGCTGCGG CGCATCACGC CGGCCGCCGG TGGGTTTCGT CGGCGCGCCG
AAACGCCCGT TCGCCGCACT TGTACGGACG TGTCGTATTT GCGCAAGCGT TTGTCGCAAT
TCGTCGGCTG CACGGGGTTT TTTCTGGCAA CCATGTAGGC ACGCTATGAC GGCGTATTTC
CCCGCTCAAC GAGGAGACGT TGCAATGGAT GCCCCGAAGG TCGTAGTCGA AGGTCTGTGC
AAGGTGTTTG GAAGCAATCC GCGGCAAGCG CTGGACATGC TCGCCGCAGG CGCGACGAAG
GATGAAGTGT TCGCGCGCAC GGGCCAGGTG GTCGGCGTGC ACAACGTGTC GTTCGATGTG
CGGGAAGGCG AGATTTTCGT GCTGATGGGG CTCTCCGGCT CCGGCAAGTC GACGCTGATC
CGGCTCGTCA ACCGGCTCGT CGAGCCGAGC GCCGGCAAGG TGATGATCGA CGGGCGCGAC
GTCGCCGCGG TGCGCCGCGC CGAGCTGACC GCGCTGCGCC GCACCGACAT GAGCATGGTG
TTCCAGTCGT TCGCGCTGAT GCCGCAGCGC ACGGTGCTGT CGAACGCCGC GTTCGGCCTC
GAAGTGGCCG GCATGGGCCG CAAGGATCGC GAGCGGCGCG CGATGGACGT GCTCGAGCAG
GTGGGCCTCG CGCAGTTCGC GCACAAACTG CCCGCCGAAC TCTCGGGCGG CATGCAGCAG
CGCGTCGGCC TCGCGCGCGC GCTCGCGGTG AACCCGTCGC TGATGATCAT GGACGAGGCG
TTCTCCGCGC TCGATCCGTT AAAGCGCAAG GAAATGCAGA ACGTGCTGCT GCAGCTTCAG
AAAGAGCAGC GCCGCACGAT CATGTTCGTG TCGCACGATC TCGAGGAGGC GCTGCGCATC
GGCAGCCGGA TCGCGATCAT GGAGGGCGGC CGGCTCGTGC AGGTCGGCAC GCCGCAGGAA
ATCATCGCGA ACCCCGCCGA CGACTACGTG CGCGCGTTCT TCGAAGGCAT CGACACGAGC
CGCTACCTGA CCGCGGGCGA CCTGATGCTC ACGGGCGCCG TGCCGACCCT GTCGAAGCTC
GATGCGAAGC ACGTCGCCGC TTCGCTGAAC GGCAGCGCCG AATACGCGTT CGTGCTCGAC
GAGGCGCGCA AGATCCGCGG CTTCGTCACG CGCGACGCGC TGAACGGCGC GACGCCGAAC
GTGCGCCAGG TCGAAAGCAT TCCGCGCGAC GCATCGCTCG ATCACGTCGT CGAGCGATGC
GTCGCGCATC CGCACGCGCT GCCCGTCGTC GACGACGACG GCTGTTACTG CGGCTCGGTC
GACCGGGCCG TGCTTCTGAA AGCCATTACG CGTTCACGAG GTTCCCATGT CTGA
 
Protein sequence
MHVEPLGTRS TAETAPTLRS RRRFDRAGRA SPAIVAIAGE YVCGNPTCAN RCLDTASGEK 
RQVSRDQDSI VERADHARGR AHAVRATASR AASDAASRKR GMNASARSAG AHRGTPRPPR
PRAARCGASR RPPVGFVGAP KRPFAALVRT CRICASVCRN SSAARGFFWQ PCRHAMTAYF
PAQRGDVAMD APKVVVEGLC KVFGSNPRQA LDMLAAGATK DEVFARTGQV VGVHNVSFDV
REGEIFVLMG LSGSGKSTLI RLVNRLVEPS AGKVMIDGRD VAAVRRAELT ALRRTDMSMV
FQSFALMPQR TVLSNAAFGL EVAGMGRKDR ERRAMDVLEQ VGLAQFAHKL PAELSGGMQQ
RVGLARALAV NPSLMIMDEA FSALDPLKRK EMQNVLLQLQ KEQRRTIMFV SHDLEEALRI
GSRIAIMEGG RLVQVGTPQE IIANPADDYV RAFFEGIDTS RYLTAGDLML TGAVPTLSKL
DAKHVAASLN GSAEYAFVLD EARKIRGFVT RDALNGATPN VRQVESIPRD ASLDHVVERC
VAHPHALPVV DDDGCYCGSV DRAVLLKAIT RSRGSHV