Gene BURPS1106A_0005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0005 
Symbol 
ID4899968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp5869 
End bp7209 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content68% 
IMG OID640133235 
ProductCobW/P47K family protein 
Protein accessionYP_001064290 
Protein GI126452541 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAGC CGTTGCCCGT CACCGTGCTG TCCGGCTTCC TCGGCGCCGG CAAGACCACG 
CTGCTCAATC ACATCCTCGC GAATCGCGCG GGCCTGAAGG TCGCCGTGAT CGTCAACGAT
CTCGCCGCGG CCAACGTCGA CGCGACATTC GTGCGCGGCG CGACCGAGCT GTCGCATGTC
GAAGCGCACC TCGTCGAGAT GTCGAACGGC TGCATCTGCT GCACGCTGCG CGACGATCTG
CTCGTCGAGA TCCGCCGCCT CGCGGCCGAA AACCGCTTCG ACGCGATCGT GGTCGAATCG
ACGGGCATCG CCGAGCCGAT GCCGATCGCC GAGACCTTCA CGTTCGTCGA CGACGACGGC
TCCACGCTCG AGGACGTTGC GCGTCTCGAT ACGATGGTCA CCGTCATCGA CGCGTTCAAC
TTCCTGCACG ACTATGCGCG CGACGACGCG CTCGCGGAGC ACGGCCTCGC GGCGACCGAC
GAAGACGACC GCACGCTCGT CGAGCTGCTG ATCGAGCAGA TCGAGTTTTG CGACGTGCTC
GTGATCAACA AGGCGGATCT CGTCGACGCC GACGCGCTCG CGCGCCTGCA GCGGATCCTC
GCGAACCTGA ATCCGCGCGC GCGGCAGATC GTGAGCCGCT TCGGCGACGT GCCGCTCGCC
GAAGTGATCA ATACCGGCCG CTTCGATTTC GACGCGGCCG CGAACGCGCC GGGCTGGCTC
GCGTCGCTCG AGCATCGGCG CGACGCCGAT GAAGCCGAAT GCGGCCAAGG CCAAAGCCAA
GGCGACGGCC GCGTGCACAG CGAGGCCGAC GAATACGGCA TCGGCCACTT CGTCTATCGC
GCGCGCCGGC CGTTCCATCC GCAACGGCTC TGGGCGCTCC TGCACGAAGA GTGGAAGGGC
GTGCTGCGCA GCAAGGGGTT TTTCTGGCTC GCGACGCGCA ACGACATCGC GGGTTCGCTG
TCGCAGGCGG GCGGCGTGTG CCGGCACGGT CCGGCCGGCC ACTGGTGGGC GGCGCAGGAT
CGCACCGAAT GGCCGGAGGC GGGCGACGAG CTGTACGACG AGATCGTCGC CGACTGGCAC
GGCGAGCTCG CCGACACGTC GATCGGCGAT CGGCGCCAGG AGCTCGTACT GATCGGCATC
GGGCTCGATG CGGCGGCGTG GCGCGCGAAG TTCGACGCGT GCCTGCTCAC CGGCGCGGAG
TACGCGCAAG GCAAGCAGGC GTGGGCGGGC TACGCGGATC CGTTCCCGGC ATGGGACGTC
GATGATCACG ATCACGACCA TGCGCATGAC CACCATGATC ACGACCACGG CGACGACTCG
GAAATCGTCC ACCGCCACTG A
 
Protein sequence
MNQPLPVTVL SGFLGAGKTT LLNHILANRA GLKVAVIVND LAAANVDATF VRGATELSHV 
EAHLVEMSNG CICCTLRDDL LVEIRRLAAE NRFDAIVVES TGIAEPMPIA ETFTFVDDDG
STLEDVARLD TMVTVIDAFN FLHDYARDDA LAEHGLAATD EDDRTLVELL IEQIEFCDVL
VINKADLVDA DALARLQRIL ANLNPRARQI VSRFGDVPLA EVINTGRFDF DAAANAPGWL
ASLEHRRDAD EAECGQGQSQ GDGRVHSEAD EYGIGHFVYR ARRPFHPQRL WALLHEEWKG
VLRSKGFFWL ATRNDIAGSL SQAGGVCRHG PAGHWWAAQD RTEWPEAGDE LYDEIVADWH
GELADTSIGD RRQELVLIGI GLDAAAWRAK FDACLLTGAE YAQGKQAWAG YADPFPAWDV
DDHDHDHAHD HHDHDHGDDS EIVHRH