Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A3211 |
Symbol | |
ID | 4888695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 3038168 |
End bp | 3039979 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640133147 |
Product | hypothetical protein |
Protein accession | YP_001064202 |
Protein GI | 126445213 |
COG category | [S] Function unknown |
COG ID | [COG2187] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACGG GCGCCGTCGC GGCGCGGCGG CTTCCTCGCC GCGGCACGGC GCGCGGCCGG CGCGCCCCGT CGCGCGCGTT CACGATCCAC GGAGGTGACG AGATGCGCCG TTCGCGCCAG CCGAAGCCCG CGAAGCCCGC CGCCGTCCGG CGCCGCCCGC ACGGCGCGGC GCGCGCGCAC GCCATTCGCG GCAACGGTCC GCAGCGGCGC GCCGTGCGCG AACGCGCGGC CCGCCGCCTG TCGCGCGACA TCGACGCGGC GCTGCGCCGC GCGTCGACCT ACCCGCATCC GGCCGGCCCT ATCGTGCGCA TCGAAACGCA TCTCTCCGTC GTTTATCTCG TCGGGCGCTT CGCGTACAAG CGCCTGAAGC CGTTCGATTT CGGCTTCGCG AATTTCAGCG AACTCGCCGC GCGCCGCCGC GCGTGCGAAG CGGAGCTCGC GCTGAACCGC CCGCTCGCCG CGCCGATCTA TCTCGCCGCC GGCCCGCTCG TGCGCCGCGC GCGCGGCTTG CGCCTGTTCG GCGCGGGCGC GGCCGTCGAC CACGTCGTCC GGATGCGCCG CTTCGACGAG CGGATGCTGT TCTCCCGGCT GCTCGCGCGC GGCGCGCTCG ACGCGGCGGA CATCGATGCC GCCGCGACGC GCCTCGCCGC CTACCATCTG CACGCGCCGC GCGACATCCC GCGGCGCGCG TACGGCAGCG CGCGCGAGCT GCGCCGGCAG CTCGACGACA TGCTCGCGCC GCTCGAGCGC GCGCTCGGCG CGGCGCTGCC GGCGTCGCTG CGCGCGTGGT GCGTGCGGCG CTGCGACGAG CTCGCCGCGC ACCTGGACGC CCGGCGAGCC GACGGCTACG TCCGCGCATG CCACGGCGAT CTGCACCTGA ACAACGTCGT GAAGCGCGGC CGCGACGCGC TGATGTTCGA CTGCATCGAT TTCGACGACG CGCTGCGCTG GATCGACGTG ATCAACGATC TGTCGTTTCT GTTGATGGAT CTGCACGCGC ACGATCGCGC CGCCCTCGCG CACCGCCTGC TGAACCGTTG GCTCGACGAA ACGGGCGATT TCGCCGGCCT CGCCGCGCTG CCGCTGTATG TCGCGTATCG CGCGCTCGTG CGGGCGCTCG TCGCGACGAT GCGCGCGGGC GGCGACGCCG CGGCGTGCGC CGCGCGCATC GAGCGCGCGC GCCGGTACGT CGACGTCGCC GCGCACGCGG CCCGCGCGCG CCGCCCATGC CTGCTGCTGT GCCACGGCTA TTCGGGCTCG GGCAAATCGG TTGCGAGCCG CGCGCTCGCC GACGTGTCCG GTGCGATCCG GCTGTCGAGC GACAGCGAGC GCAAGCGCGC CCGACCGTTC GCGGCGGTCG ACGCGCGGCC GCTTCCCGCG AGCGCGTACA CGCCGCAGCA GATCGACGCG CAATACGAGC GCCTGCGCGC GCTCGCGCGC GACGTGCTGC GCGCCGGCTA CACGGCGCTC GTCGACGCGA CGTTTCTCTC GCATGCGCGC CGCGCACGCT TCTTCGCGCT CGCGCGCGAG CTGGGCGTGC CCGTGTACGT GCTCGATTTC CATGCGAGCC GCGCATGCCT CGAGCGGCGC GTCGATGCGC GCGCCGCCGC GCGCGACGAT CGTTCGGACG CGGGCGCGGC CGTGCTCGCG ACGCAACGCG CGAGCGCCGA TCCGCTCGAT GCCGACGAGC GCGCGCGCAC GATCGGCTTC GATACCGACG TGCCGCTCGC GACGCTCCGG TCGGCCGGCT ATTGGCGGCC GGTGCTCGAC GCGCTCGACG CCGCGCGGGT GGACGCGCAA GCGACGCGTT GA
|
Protein sequence | MTTGAVAARR LPRRGTARGR RAPSRAFTIH GGDEMRRSRQ PKPAKPAAVR RRPHGAARAH AIRGNGPQRR AVRERAARRL SRDIDAALRR ASTYPHPAGP IVRIETHLSV VYLVGRFAYK RLKPFDFGFA NFSELAARRR ACEAELALNR PLAAPIYLAA GPLVRRARGL RLFGAGAAVD HVVRMRRFDE RMLFSRLLAR GALDAADIDA AATRLAAYHL HAPRDIPRRA YGSARELRRQ LDDMLAPLER ALGAALPASL RAWCVRRCDE LAAHLDARRA DGYVRACHGD LHLNNVVKRG RDALMFDCID FDDALRWIDV INDLSFLLMD LHAHDRAALA HRLLNRWLDE TGDFAGLAAL PLYVAYRALV RALVATMRAG GDAAACAARI ERARRYVDVA AHAARARRPC LLLCHGYSGS GKSVASRALA DVSGAIRLSS DSERKRARPF AAVDARPLPA SAYTPQQIDA QYERLRALAR DVLRAGYTAL VDATFLSHAR RARFFALARE LGVPVYVLDF HASRACLERR VDARAAARDD RSDAGAAVLA TQRASADPLD ADERARTIGF DTDVPLATLR SAGYWRPVLD ALDAARVDAQ ATR
|
| |