Gene BURPS668_0098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0098 
Symbol 
ID4882413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp94346 
End bp97198 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content68% 
IMG OID640126026 
Producthypothetical protein 
Protein accessionYP_001057153 
Protein GI126440554 
COG category[L] Replication, recombination and repair
[S] Function unknown 
COG ID[COG4643] Uncharacterized protein conserved in bacteria
[COG5519] Superfamily II helicase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAAT TCGAGCGGGC AAGTGTCGCG CTCGGCTATG TTCCGGCCGA CGACCGAGAC 
ACGTGGCGTC AGGCCGGCAT GGCGCTCAAG GCCGAGTTCG GCGAAGAGGG TTTCACCCTC
TGGAACGAAT GGAGCCAAGG CGCGCAAAAC TACAACGCAA GAGACGCCCG CGATGTATGG
AAGTCGTTCA AGGGCGGCAA GATCACCATC AACACGCTGT TTCATCTCGC CAAACAAGGC
GGCTTCGATC CGCGCGCGCA TCGCGCCAAG TCGATCGACC CAGCACAGCG CGAGCGACAG
CACGCCGAGC GCGCCGCCCG CGAAGCGGCC GAGTTGGCGA CACTCACCGA GAAGCAGCAA
GCCGCATCGG CGCTCGCCGA ATCGATCTGG TCGGCGGCCG AGCCCGCGCC GGCCGATCAC
GCGTACCTCG TTCGCAAGCG CATCCCGGTC GACGCGCTGC GCGTCTATCG CGGCGGCTTG
TGCATCGGCA CGGCCGCATG CGACGGCGCA CTGGTCATCG CAGCGCGTGA CGCGGACGGC
AAGCTGTGGA CGCTGGAGTT CGTCCTCACG GACGGCCAGA AACGCTATCT GCCGAACGGC
CGCAAGGCGG GCTGCTTCTC GCTGATCGGC GGGGCGGTAT CGTCCACGCT GCTGATTGGC
GAGGGTTACG CCACGTGCGC GACGCTCGTG GCCGCGACCG GCTATCCGGC CGCTGTCGCG
TTTGACGCAG GCAACCTGCA CGCCGTGGCG ACGGCACTGC GCGGCCAGTA TCCGGACGCC
CGCATCGTCG TGTGCGCCGA CGACGACCAC ACGACGAAGG GCAATCCGGG CGTGACGAAG
GCTCGCGCGG CGGCCGAGGC CGTCGCCGGC ATCGTGGCCG TACCCGACTT CGGTCCGAAC
CGCCCGGCGG CCGGGACCGA CTTCAACGAC CTGGCTGCGC ACGTCGGCCC GGATGCGGTG
GCCGCCGCCG TGCGGGCTGC GCTCGCGCCG GTCGGCTCAC CGGATGCCGG CAAGGCCAAG
ACAGCACTGC CCGCCGCGAA GCCCGCCAAG CGCCCGAAAA CGGCTTGCGC GCAGGACGGC
AAGTCGCGGT TCGTCGTCGA CGACAAGGGC GTGTGGTTTC ACGGCTTCAA CAATCAGGGC
GATCCGCTGC CGCCGCATTG GGTCAGCACG CGGATCGACG TGATTGCGGA GACGCGCAAC
GAGATGAACA GCGAGTGGGG CTACCTGCTC GAATTCACGG ACCGCGACGG CATCCTCAAA
CGGTGGGCGG TGCCGGCGGG GCTCTTTGCC GGCGACGGCA CGGAGCTGCG CCGCATGCTG
CTCGATATGG GCGTGAAGCT CGGCGTGACG CAGATCGCCC GCACGCAGAT CGCGAACTAT
GTGCAGATGG CGCAGCCGGA CGAGCGCGTG CGCTGCGTGC CGCGCGTCGG CTGGCATCAC
GGCGCGTTCG TGCTGCCCGA TCGCGTGATC GGCACCGGCA AAGAGGCGCT GATCTATCAG
GCCGACACGC CGATCCAGAG CCAGTTCAAG GAGCGCGGCA CGCTGGAGGA CTGGCAACGC
GAGGTCGCGG CCTACTGCGT CGGCAATAGC CGGCTGCTGT TCTGCGTCGC TACCGCCTTC
GCTGGTCCGC TGCTGCACTT CTCCGGGCTT CAGTCGGGCG GCTTTCACTT GCTCGGCACG
ACGTCGAAAG GCAAGTCGAC GGGCGGTGTC ATCGCCGCGT CCGTGTTCGG CTCACCGGAC
TACGTGCGGA GCTGGAAGGC GACCGACAAC GCGCTCGAAG CCGTCGCCAC GCAGCATAGC
GACGCGCTGC TGATTCTCGA CGAAATCGGG CAGGTCGAGC CGCGCTTGGT TGGAGACGTG
ATCTACATGC TCGCGAACGA GTCGGGCAAG GCCCGCGCGT CGCGTAGCGG CTCGGCAAAG
CCGGTTCTCA CGTGGCGACT GCTGTTCCTG TCGAACGGCG AAAAGAGCGT GTCCGCGTTG
ATGGCCGAAG GCAACAAGCC GATGAAAGGC GGTATCGAGG TGCGCTTGCC CGCGATCCCG
GCCGAGGTCG GCGAAATGGG CGTCGTGGAG AAGCTGCACG GGTTCCCGAC GCCGGCCGCG
CTGATCGAGC ATCTAGAGCG GCACGCCGGC AGGCACTACG GCACGGCGGG GCCGGCCTTC
ATCGAATGGG CATCGTCGCA GGCCGATGAG CTGGCTGAGC ATCTGCGCGT GCGCGTCGAC
GAGCTGGTCG GGCAATGGGT GCCGGACGGC TCGCATTCGC AGGTCGCGCG CGTCGCCAAG
CGGTTCTGCC TCGTTGCGGT GGCCGGCGAG CTGGCGACAG CGCACGGGCT GACCGGCTGG
CCGCAGGGCG AAGCGGTCGA GGCCGCGCGT CGCTGCTTCG AAGGCTGGCT CGAACTGCGC
GGCGGCACCG GCAACTCGGA CGAGGCCGAA GCCGTGCGGC AGGTACAGCA TTTCCTCGCC
GCGCACGGCG ACAACCGTTT CGTGTGGATG AACCGTGCGC AGGACGACCA TCGGCCGAAC
GTGCCGCATC GAGCGGGCTT CAAGCAGCAC GTGAAGCGCG ACGAGCGCCG CACGCCCATC
GCGTCCGATC GCGAGTATTA CGCCGAGTTC GGCGGCAAGA TGAGCGCCGA CGATGCCGAA
AGCGTCGAGA CGGAATACCT GATCGAAGCG GCCGTGTTCC GCAAGGACGT GTGCGCCGGC
TTCGATCACA AGATCGTCGC CAAGGCACTA ATGAAGCGGG GCGTGCTGAT GCCGCGCAGC
GACGGCTATC CGTACCGGCA GGAATACATC CCCGGTCACG GCAAGTTCAT GGTCTATCGC
GTGCTGCCGT CGATCTTCAC GCTTGAGCTG TGA
 
Protein sequence
MSEFERASVA LGYVPADDRD TWRQAGMALK AEFGEEGFTL WNEWSQGAQN YNARDARDVW 
KSFKGGKITI NTLFHLAKQG GFDPRAHRAK SIDPAQRERQ HAERAAREAA ELATLTEKQQ
AASALAESIW SAAEPAPADH AYLVRKRIPV DALRVYRGGL CIGTAACDGA LVIAARDADG
KLWTLEFVLT DGQKRYLPNG RKAGCFSLIG GAVSSTLLIG EGYATCATLV AATGYPAAVA
FDAGNLHAVA TALRGQYPDA RIVVCADDDH TTKGNPGVTK ARAAAEAVAG IVAVPDFGPN
RPAAGTDFND LAAHVGPDAV AAAVRAALAP VGSPDAGKAK TALPAAKPAK RPKTACAQDG
KSRFVVDDKG VWFHGFNNQG DPLPPHWVST RIDVIAETRN EMNSEWGYLL EFTDRDGILK
RWAVPAGLFA GDGTELRRML LDMGVKLGVT QIARTQIANY VQMAQPDERV RCVPRVGWHH
GAFVLPDRVI GTGKEALIYQ ADTPIQSQFK ERGTLEDWQR EVAAYCVGNS RLLFCVATAF
AGPLLHFSGL QSGGFHLLGT TSKGKSTGGV IAASVFGSPD YVRSWKATDN ALEAVATQHS
DALLILDEIG QVEPRLVGDV IYMLANESGK ARASRSGSAK PVLTWRLLFL SNGEKSVSAL
MAEGNKPMKG GIEVRLPAIP AEVGEMGVVE KLHGFPTPAA LIEHLERHAG RHYGTAGPAF
IEWASSQADE LAEHLRVRVD ELVGQWVPDG SHSQVARVAK RFCLVAVAGE LATAHGLTGW
PQGEAVEAAR RCFEGWLELR GGTGNSDEAE AVRQVQHFLA AHGDNRFVWM NRAQDDHRPN
VPHRAGFKQH VKRDERRTPI ASDREYYAEF GGKMSADDAE SVETEYLIEA AVFRKDVCAG
FDHKIVAKAL MKRGVLMPRS DGYPYRQEYI PGHGKFMVYR VLPSIFTLEL