Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1616 |
Symbol | |
ID | 4888752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1545210 |
End bp | 1546160 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640131555 |
Product | putative hydrogenase, membrane subunit |
Protein accession | YP_001062612 |
Protein GI | 126442527 |
COG category | [C] Energy production and conversion |
COG ID | [COG0650] Formate hydrogenlyase subunit 4 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAGCG CATCCGGCAT CCTGTCGCAG ACGCTCGAGA TACTCGTCGC GCTCGCAGCC GCGCCGCTGT TGACGGGCTG GGTCAACCAG TGCCGCGCGT GGCTGCAGAA CCGCCGCGCG CCGAGCATCT GGCAGCCGTA CCGGATGCTG CACAAGCTGT TCAACAAGGA ATCGGTGGTC GCGGAGCACG CGAGCCCGCT CTTTCGCGGC GCGCCGTACG TCGTCTGGGC GGCGATGGCG CTCGCGTGCG CGATCGTGCC GACGCTGTCG ACCGAGCTGC CGTTCTCGCC CGCGGCCGAC GCGATCGCGC TCGTCGGCCT GTTCGCGCTC GCGCGCGTCG CGCTCTCGCT CGCGGCGATG GACATCGGCA CCGCGTTCGG CACGCTCGGC GCGCGCCGCG AGATGCTGAT CGGCTTCCTC GCGGAGCCGG CGCTCCTGAT GGTGCTGTTC TCGGCGTCGC TGATCACGCG CTCGACGCTG CTGACGAGCA TCGTCGCCGC GCTCGGCCAC CGCGAGCTCG CGATCTATCC GAGCCTCGCG TTCGCGGGCA TCGCGTTCAC GCTCGTGTCG CTCGCGGAGA ACGCGCGGCT GCCGGTGGAC AACCCCGCGA CGCACCTCGA GCTGACGATG ATCCACGAGG CGCTGATCCT CGAATACTCG GGCCGCCATC TCGCGCTGAT GGAATGGGCG GCAAGCCTGA AGCTCTTCGC GTATTCGTGC ATCGGCCTCG CGCTCTTCAT GCCGTGGGGC ATCGCCGAGG CCGGCAGCCC GCTCGCGTTG CTGCTCGCGC TGCCGGCGCT CTTCGTGAAG TTGCTCGTCG GCGGCGCGGC GCTGGCGGTG GTCGAGACGA CGAACGCGAA AATGCGCCTC TTTCGCGTGC CCGAATTCCT CGCGAGCGCG TTCCTGCTCG CGGTGATCGG CATGCTCGTC CATTTCCTGC TGGGGGCGTA G
|
Protein sequence | MVSASGILSQ TLEILVALAA APLLTGWVNQ CRAWLQNRRA PSIWQPYRML HKLFNKESVV AEHASPLFRG APYVVWAAMA LACAIVPTLS TELPFSPAAD AIALVGLFAL ARVALSLAAM DIGTAFGTLG ARREMLIGFL AEPALLMVLF SASLITRSTL LTSIVAALGH RELAIYPSLA FAGIAFTLVS LAENARLPVD NPATHLELTM IHEALILEYS GRHLALMEWA ASLKLFAYSC IGLALFMPWG IAEAGSPLAL LLALPALFVK LLVGGAALAV VETTNAKMRL FRVPEFLASA FLLAVIGMLV HFLLGA
|
| |