Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3906 |
Symbol | |
ID | 4882942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3803668 |
End bp | 3804816 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640129834 |
Product | hypothetical protein |
Protein accession | YP_001060900 |
Protein GI | 126438984 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2856] Predicted Zn peptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACGGA TCGAATTTAT TAACCCGGAC AGAATTGCAT GGTGTTGTGC GGATCGACGC ATTACGCCAG ACGAACTCGC GTCCGAACTG AACATCGCCC CGGCGACGAT CGATGGAGTC GTTCAGGGCC GGGTTGGTAT GACGTTCAAC CAGTTGAGCA AGGTCGCAAC CTACTTTGGG CGAGGGGTGC TTTTCTTTCT GGAACCCGGG CCCGTGAATG AGGAAGCGGT CCACAGTTCG GCGTTTCGGA CGTTGGCCAA CCAAAAGCCG GAGCTCTCGG GAAAATTGAA AGGTCTCATC GAGAGAGTTG AGCGGCAGCG CGATGTCTAC GTTAGTTTGC GTGAGGACTT GGACGAAGCG GCGCTGCCAA TTTTCGCACC GCCTGCGCTA CCGGACGATA ATCCGTCCGA GGCTGCACGG ATTGTGCGCG ACTGGCTGAG GCTCGGTGAA ACAAACGACT TTGACTCTTA CAGAAGCGCT GTTGAGAGTC GCGGAATCTT GGTGTTTCGG AGCAACGGCT ACGATGGGAA GTGGCAGATT GCGAAGGAGA GTCCGATCCT AGGTTTCAGC ATTTATGATG CTGAATGTCC TGTCATTGTC GTGAAGAAAC AATTTTGGGA ATCGCGACAG TGCTTTACGC TCATGCATGA GCTTGGCCAT TTGCTCCTGC ATCGAGATAG CTCGATCGAC GATCAGCAGG ATATGTCTTC GTACCAAGGC CGCGAGCGCG AAGCCAATGC ATTTGCGGGG CATTTGCTTG TCCCAGACGA TTTGCTGGCT CTTGTTGACG ATGACGCGCG CCCGCAGAAT GTTGACGATT TCGATAGTTG GCTACAGCCT TGGCGGAGAG CTTGGGGCGT CAGTGGGGAA GTTATCTTAC GGCGGTTAAT GGACAGCGGG AGGCTTGCGC AGTACCAATA TCAAGCCTAT CGAGAATGGA GCGATAATCT TCCGATAGTG CAAGACGATG GTGGCACAAG AAAGTATCGT CATAGAGAGC CAAAGCATAT TTTTGGCGAC TATTTTGTAC GTGCCGTTTT GGACTCTCTC AATGCTCGGA ATATCTCGCT GGCACGTGCA AGTAGCTATT TGGACGGCCT GAAAGTCAAC GATCTGCATC AGTTGGAGCA ATACTATGCA GGCGTTTGA
|
Protein sequence | MERIEFINPD RIAWCCADRR ITPDELASEL NIAPATIDGV VQGRVGMTFN QLSKVATYFG RGVLFFLEPG PVNEEAVHSS AFRTLANQKP ELSGKLKGLI ERVERQRDVY VSLREDLDEA ALPIFAPPAL PDDNPSEAAR IVRDWLRLGE TNDFDSYRSA VESRGILVFR SNGYDGKWQI AKESPILGFS IYDAECPVIV VKKQFWESRQ CFTLMHELGH LLLHRDSSID DQQDMSSYQG REREANAFAG HLLVPDDLLA LVDDDARPQN VDDFDSWLQP WRRAWGVSGE VILRRLMDSG RLAQYQYQAY REWSDNLPIV QDDGGTRKYR HREPKHIFGD YFVRAVLDSL NARNISLARA SSYLDGLKVN DLHQLEQYYA GV
|
| |