Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0194 |
Symbol | |
ID | 4882635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 188167 |
End bp | 189435 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640126122 |
Product | hypothetical protein |
Protein accession | YP_001057247 |
Protein GI | 126439215 |
COG category | [R] General function prediction only |
COG ID | [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATACCG TTACGCTCAA GCCGTCGAAA GACAAATCCC TGCTGCGCCG CCACCCGTGG GTCTACGCGA ACGCGATCGA CCGCGTCGAC GGCAAGCCCG CGCCCGGCGC GACCGTCATC GTGCGCGCGC ACGACGGGCG CTTCCTCGCG CGCGCCGCGT ACAGCCCGCA TTCGCAGATC CGGCTGCGCG TGTGGAGCTT CGACGAGAAC GAGCCGATCG ACCACGCGTT CTTCAAGCGG CGCGTGCAAC GCGCGCTCGC GCATCGCCGC GCGATGATCT CGGGCACGGA CGCGGTGCGG CTCGTGTTCG GCGAGGCGGA CGGGTTGCCG GGGCTCATCG TCGATTACTA CGTCGCGGCG CGAGGCGCCG CCCACACCGG CGACGCCGCC GCGCGCGCGG CCGAAGGCGG CGCGGCCGCG GCGGGCGACG GCGAGGGTCG CGGCCAGCTC GTCTGCCAGT TCATGGCGGC GGGCGTCGAG CACTGGAAGG GCGCGATCGT CGCGGCGCTC GTCGCGGCCA CCGGCTGCCC GAACGTCTAC GAGCGCTCGG ACGTGTCGAT CCGCGAAAAG GAAGGGCTCG AACAGACGAC CGGCGTGCTC GCGGGCGACG CGCCGCCCGA CACGCTGATC GCGAACGAGA ACGGCGTGCT GTATCACGTC GACGTGCGCA ACGGCCACAA GACGGGCTTC TACGTCGACC AGCGCGAGAA CCGCGCGCTC GTCGCGCAGT ACGCGCGCGA TCGCGACGTG CTGAACTGCT TCTGCTACAC GGGCGGCTTC TCGCTCGCGG CGCTCAAGGG CGGCGCGAAG CGGGTCGTGT CGATCGATTC GTCGGGCGAC GCGCTCGCGC TCGCGCAGCG CAACGTCGCC GCGAACGGCT TCGACGCCGC GCGCGCGCAA TGGCTCGACG CCGACGCGTT CAAGACGCTG CGCCGCCTCG TCGACGAAGG CGAGCGCTTC GACCTGATCG TGCTCGATCC GCCGAAGTTC GCGCCGACGC GCGACAGCGT CGATCGCGCG GCGCGCGCGT ACAAGGACAT CAACCTGAGC GGTTTGAAGC TGCTGCGCCC GGGCGGCCTG CTGTTCACGT ACTCGTGCTC CGGCGCGATC GACATGGACC TGTTCCAGAA GATCGTCGCG GGCGCGGCGG CCGACGCGAA GGTCGACGCG CGCATCCTCA AGCGGCTCGG CGCGGGCGTC GATCATCCGC TGCTGACCGC GTTCCCCGAA GGGGAATATC TGAAGGGGCT GCTGTTGCAA ATCGCGTGA
|
Protein sequence | MHTVTLKPSK DKSLLRRHPW VYANAIDRVD GKPAPGATVI VRAHDGRFLA RAAYSPHSQI RLRVWSFDEN EPIDHAFFKR RVQRALAHRR AMISGTDAVR LVFGEADGLP GLIVDYYVAA RGAAHTGDAA ARAAEGGAAA AGDGEGRGQL VCQFMAAGVE HWKGAIVAAL VAATGCPNVY ERSDVSIREK EGLEQTTGVL AGDAPPDTLI ANENGVLYHV DVRNGHKTGF YVDQRENRAL VAQYARDRDV LNCFCYTGGF SLAALKGGAK RVVSIDSSGD ALALAQRNVA ANGFDAARAQ WLDADAFKTL RRLVDEGERF DLIVLDPPKF APTRDSVDRA ARAYKDINLS GLKLLRPGGL LFTYSCSGAI DMDLFQKIVA GAAADAKVDA RILKRLGAGV DHPLLTAFPE GEYLKGLLLQ IA
|
| |