Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0098 |
Symbol | |
ID | 4882413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 94346 |
End bp | 97198 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640126026 |
Product | hypothetical protein |
Protein accession | YP_001057153 |
Protein GI | 126440554 |
COG category | [L] Replication, recombination and repair [S] Function unknown |
COG ID | [COG4643] Uncharacterized protein conserved in bacteria [COG5519] Superfamily II helicase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGAAT TCGAGCGGGC AAGTGTCGCG CTCGGCTATG TTCCGGCCGA CGACCGAGAC ACGTGGCGTC AGGCCGGCAT GGCGCTCAAG GCCGAGTTCG GCGAAGAGGG TTTCACCCTC TGGAACGAAT GGAGCCAAGG CGCGCAAAAC TACAACGCAA GAGACGCCCG CGATGTATGG AAGTCGTTCA AGGGCGGCAA GATCACCATC AACACGCTGT TTCATCTCGC CAAACAAGGC GGCTTCGATC CGCGCGCGCA TCGCGCCAAG TCGATCGACC CAGCACAGCG CGAGCGACAG CACGCCGAGC GCGCCGCCCG CGAAGCGGCC GAGTTGGCGA CACTCACCGA GAAGCAGCAA GCCGCATCGG CGCTCGCCGA ATCGATCTGG TCGGCGGCCG AGCCCGCGCC GGCCGATCAC GCGTACCTCG TTCGCAAGCG CATCCCGGTC GACGCGCTGC GCGTCTATCG CGGCGGCTTG TGCATCGGCA CGGCCGCATG CGACGGCGCA CTGGTCATCG CAGCGCGTGA CGCGGACGGC AAGCTGTGGA CGCTGGAGTT CGTCCTCACG GACGGCCAGA AACGCTATCT GCCGAACGGC CGCAAGGCGG GCTGCTTCTC GCTGATCGGC GGGGCGGTAT CGTCCACGCT GCTGATTGGC GAGGGTTACG CCACGTGCGC GACGCTCGTG GCCGCGACCG GCTATCCGGC CGCTGTCGCG TTTGACGCAG GCAACCTGCA CGCCGTGGCG ACGGCACTGC GCGGCCAGTA TCCGGACGCC CGCATCGTCG TGTGCGCCGA CGACGACCAC ACGACGAAGG GCAATCCGGG CGTGACGAAG GCTCGCGCGG CGGCCGAGGC CGTCGCCGGC ATCGTGGCCG TACCCGACTT CGGTCCGAAC CGCCCGGCGG CCGGGACCGA CTTCAACGAC CTGGCTGCGC ACGTCGGCCC GGATGCGGTG GCCGCCGCCG TGCGGGCTGC GCTCGCGCCG GTCGGCTCAC CGGATGCCGG CAAGGCCAAG ACAGCACTGC CCGCCGCGAA GCCCGCCAAG CGCCCGAAAA CGGCTTGCGC GCAGGACGGC AAGTCGCGGT TCGTCGTCGA CGACAAGGGC GTGTGGTTTC ACGGCTTCAA CAATCAGGGC GATCCGCTGC CGCCGCATTG GGTCAGCACG CGGATCGACG TGATTGCGGA GACGCGCAAC GAGATGAACA GCGAGTGGGG CTACCTGCTC GAATTCACGG ACCGCGACGG CATCCTCAAA CGGTGGGCGG TGCCGGCGGG GCTCTTTGCC GGCGACGGCA CGGAGCTGCG CCGCATGCTG CTCGATATGG GCGTGAAGCT CGGCGTGACG CAGATCGCCC GCACGCAGAT CGCGAACTAT GTGCAGATGG CGCAGCCGGA CGAGCGCGTG CGCTGCGTGC CGCGCGTCGG CTGGCATCAC GGCGCGTTCG TGCTGCCCGA TCGCGTGATC GGCACCGGCA AAGAGGCGCT GATCTATCAG GCCGACACGC CGATCCAGAG CCAGTTCAAG GAGCGCGGCA CGCTGGAGGA CTGGCAACGC GAGGTCGCGG CCTACTGCGT CGGCAATAGC CGGCTGCTGT TCTGCGTCGC TACCGCCTTC GCTGGTCCGC TGCTGCACTT CTCCGGGCTT CAGTCGGGCG GCTTTCACTT GCTCGGCACG ACGTCGAAAG GCAAGTCGAC GGGCGGTGTC ATCGCCGCGT CCGTGTTCGG CTCACCGGAC TACGTGCGGA GCTGGAAGGC GACCGACAAC GCGCTCGAAG CCGTCGCCAC GCAGCATAGC GACGCGCTGC TGATTCTCGA CGAAATCGGG CAGGTCGAGC CGCGCTTGGT TGGAGACGTG ATCTACATGC TCGCGAACGA GTCGGGCAAG GCCCGCGCGT CGCGTAGCGG CTCGGCAAAG CCGGTTCTCA CGTGGCGACT GCTGTTCCTG TCGAACGGCG AAAAGAGCGT GTCCGCGTTG ATGGCCGAAG GCAACAAGCC GATGAAAGGC GGTATCGAGG TGCGCTTGCC CGCGATCCCG GCCGAGGTCG GCGAAATGGG CGTCGTGGAG AAGCTGCACG GGTTCCCGAC GCCGGCCGCG CTGATCGAGC ATCTAGAGCG GCACGCCGGC AGGCACTACG GCACGGCGGG GCCGGCCTTC ATCGAATGGG CATCGTCGCA GGCCGATGAG CTGGCTGAGC ATCTGCGCGT GCGCGTCGAC GAGCTGGTCG GGCAATGGGT GCCGGACGGC TCGCATTCGC AGGTCGCGCG CGTCGCCAAG CGGTTCTGCC TCGTTGCGGT GGCCGGCGAG CTGGCGACAG CGCACGGGCT GACCGGCTGG CCGCAGGGCG AAGCGGTCGA GGCCGCGCGT CGCTGCTTCG AAGGCTGGCT CGAACTGCGC GGCGGCACCG GCAACTCGGA CGAGGCCGAA GCCGTGCGGC AGGTACAGCA TTTCCTCGCC GCGCACGGCG ACAACCGTTT CGTGTGGATG AACCGTGCGC AGGACGACCA TCGGCCGAAC GTGCCGCATC GAGCGGGCTT CAAGCAGCAC GTGAAGCGCG ACGAGCGCCG CACGCCCATC GCGTCCGATC GCGAGTATTA CGCCGAGTTC GGCGGCAAGA TGAGCGCCGA CGATGCCGAA AGCGTCGAGA CGGAATACCT GATCGAAGCG GCCGTGTTCC GCAAGGACGT GTGCGCCGGC TTCGATCACA AGATCGTCGC CAAGGCACTA ATGAAGCGGG GCGTGCTGAT GCCGCGCAGC GACGGCTATC CGTACCGGCA GGAATACATC CCCGGTCACG GCAAGTTCAT GGTCTATCGC GTGCTGCCGT CGATCTTCAC GCTTGAGCTG TGA
|
Protein sequence | MSEFERASVA LGYVPADDRD TWRQAGMALK AEFGEEGFTL WNEWSQGAQN YNARDARDVW KSFKGGKITI NTLFHLAKQG GFDPRAHRAK SIDPAQRERQ HAERAAREAA ELATLTEKQQ AASALAESIW SAAEPAPADH AYLVRKRIPV DALRVYRGGL CIGTAACDGA LVIAARDADG KLWTLEFVLT DGQKRYLPNG RKAGCFSLIG GAVSSTLLIG EGYATCATLV AATGYPAAVA FDAGNLHAVA TALRGQYPDA RIVVCADDDH TTKGNPGVTK ARAAAEAVAG IVAVPDFGPN RPAAGTDFND LAAHVGPDAV AAAVRAALAP VGSPDAGKAK TALPAAKPAK RPKTACAQDG KSRFVVDDKG VWFHGFNNQG DPLPPHWVST RIDVIAETRN EMNSEWGYLL EFTDRDGILK RWAVPAGLFA GDGTELRRML LDMGVKLGVT QIARTQIANY VQMAQPDERV RCVPRVGWHH GAFVLPDRVI GTGKEALIYQ ADTPIQSQFK ERGTLEDWQR EVAAYCVGNS RLLFCVATAF AGPLLHFSGL QSGGFHLLGT TSKGKSTGGV IAASVFGSPD YVRSWKATDN ALEAVATQHS DALLILDEIG QVEPRLVGDV IYMLANESGK ARASRSGSAK PVLTWRLLFL SNGEKSVSAL MAEGNKPMKG GIEVRLPAIP AEVGEMGVVE KLHGFPTPAA LIEHLERHAG RHYGTAGPAF IEWASSQADE LAEHLRVRVD ELVGQWVPDG SHSQVARVAK RFCLVAVAGE LATAHGLTGW PQGEAVEAAR RCFEGWLELR GGTGNSDEAE AVRQVQHFLA AHGDNRFVWM NRAQDDHRPN VPHRAGFKQH VKRDERRTPI ASDREYYAEF GGKMSADDAE SVETEYLIEA AVFRKDVCAG FDHKIVAKAL MKRGVLMPRS DGYPYRQEYI PGHGKFMVYR VLPSIFTLEL
|
| |