Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A3037 |
Symbol | |
ID | 4888046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2885642 |
End bp | 2886676 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640132973 |
Product | hypothetical protein |
Protein accession | YP_001064028 |
Protein GI | 126444227 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATGCG ATCGAGGAGA CGTATCGATG GAACGAAGAT GGAAGCGCCG CCCGGCGGGA GCGAACTGGG GCGAATTCGG AGATGACGAC CAGAAGGGGA GGCTGAACTG GCTGACGGAG CGCAAGGTGC TGGAGGGGAC GGCGGAGGTG AAGGCGGGGA AGGTGTTCTC GCTGAGCCTG CCGCTGGACG TGCCGCGCGG CGGAGGGCTG AACGCGCGGC GGCAGCCGCC GCGGGTGATG GCGGCGCGGG CGGGGGGCAG GCCGTACTTC GGCTACCGCG CGGACGAATC GGCGCGCAAC GCGACGGACG TGGTGTGCGA CGACGCGTTC TGCGTGCACT CGCAGTATTC GACGCAATGG GATGCGCTGT CGCACGTGGG CGGGGTGTTC GACGCGGACG ACGACGGCCG TGCGGAGGTG GTGTTCTACA ACGGCTACCG GCTGGGGGAG CACGTGTTGG TGCCGAAGGA AGGGGACAGC GTGGGCGGCG CGCACGCGTT GGGGATCGAG GTGATGGCGC AGACCGGGGT GCAGGGGCGG GGGGTGCTGA TCGACTTGAG GCACCACTAT GGAGATGCGC GGCGCAAGGT GGGGTACGAG GCGCTGATGC GGGTGCTGGA TGCGGATCGG GTGGAAGTGG AGCGTGGAGA CATGGTGTGC GTGCACACGG GCTTCGCGGA GCGGTTGCTG GGGGAGGAGG CGGGGGCGTT GACGGGGGGC TGCGTGCTGG ACGGGGAGGA CGAAAGGCTG CTGAGGTGGG TGGACGAGAG CGGGCTGTCG GTGCTGGCGG CGGACAATCA CGCGGTGGAG GAGAGGCCGG GGGTGTTGAA GGCGAGGGAG AGGCCTGGGG CGCTGATGCC GTTGCACGAG CTGTGCCTGT TCAAGCTGGG CATTCACCTG GGAGAGCTGT GGAGGCTGAC GCCGCTGGCG CAGTGGCTGC GGGAGAAGGG GCGCAGCCGG TTTCTGCTGA CGGCGCCGCC GCTGCACATC CGCGGGCTGG TGGGCTCGCC CGTCAATCCG GTCGCGACGG TTTGA
|
Protein sequence | MRCDRGDVSM ERRWKRRPAG ANWGEFGDDD QKGRLNWLTE RKVLEGTAEV KAGKVFSLSL PLDVPRGGGL NARRQPPRVM AARAGGRPYF GYRADESARN ATDVVCDDAF CVHSQYSTQW DALSHVGGVF DADDDGRAEV VFYNGYRLGE HVLVPKEGDS VGGAHALGIE VMAQTGVQGR GVLIDLRHHY GDARRKVGYE ALMRVLDADR VEVERGDMVC VHTGFAERLL GEEAGALTGG CVLDGEDERL LRWVDESGLS VLAADNHAVE ERPGVLKARE RPGALMPLHE LCLFKLGIHL GELWRLTPLA QWLREKGRSR FLLTAPPLHI RGLVGSPVNP VATV
|
| |