Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3717 |
Symbol | |
ID | 4883490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3641410 |
End bp | 3643293 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640129645 |
Product | thiol:disulfide interchange protein, putative |
Protein accession | YP_001060721 |
Protein GI | 126439727 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAACC GTATTCCGCG TCACGCGCAG CCCCGCTTTC GCTTCCTGAT CGCTGTCGTC GCGATGCTCG GCGTGCTGTT CGGCACGTCG CTCGCGGCGC GCGCGGCGGA GGACTTCCTC GATCCCGCGG TTGCGTTCAA ATTCAGCGCG AGCGAGGCGC CGGGCCAGGT CGACGTTCAT TTCAAGATCG CCGACGGCTA TTACATGTAT CGCGAGCGCT TCGCGTTCGC GGTGAAAAGC GGCTCGGCCA CGCTTGGCGA GCCGCAACTG CCGGCCGGGC ACGTCAAGTT CGATCCGACG TTCCAGAAGA ACGTCGAAAC CTACCGAGGC GATTTGACGA TCCATCTGCC GATCAAGCAG GCATCGGGAC CGTTCGAGCT CGCCGTGACA TCGCAGGGGT GCGCGGACGA AGGGATCTGC TATCCGCCTG CCGAGCATGT CGCGCGCATC GAGGGCGCGG CGCTCGGCGC CGCCGGCACC GCGCCCGCCG CGGCGGGCGC CGGTGCGGAC ACGTCCGCGG CCGACGGCGG CAGTTGGTAC GAGCGCGTGA CGAGCGCCGA CTACGCGCGC TCGCTGCTCG AAGGCCACGG ATTCCTGACG ATCGTCGCCC TTTATTTCGT GGCCGGCATG GTGCTGAGCC TGTTGCCGTG TTCGTATCCG ATGATCCCGA TCCTGTCGGC CATCATCGTC GGCGAAGGCG CACGGGCGAC ACGTGCGCGC GCCTTCGCGC TGTCGCTCAC CTATGTGATC GGCATGGCAC TCGTCTATAC GGCGCTTGGC GTCGCAGCCG CGCTCGTCGG GCAGAGCCTC GGCGCGTGGT TGCAGAATCC GTGGGTGCTC GGCGCGTTTG CGTTGCTGCT CACCGTGTTC GCGCTGCTGC TGATCGGCGG CGTCGACATC ACGCTGCCGC AGCGCTGGCA GAACGGTGCC GCGCAGAAGA GCGGGCCGCG CAAGGGCGGC CGCTTCGCAG CCGTCGCAAC GATGGGGGCG CTGTCCGCGC TCGTCGTCGG CGCCTGCATG ACCGCTCCGC TTTTCGCCGT GCTCGCGTTC ATCGCGCATA CCGGCAGCGC ACTTCTCGGC GGCGCCGCGC TCTTCTCGAT GGGGATCGGC CTCGGCGTGC CGCTGCTCGT CATCGGAATC GGCGCCGGGA CGTTGCTGCC CCGCGCCGGC GCATGGATGG ACGGCGTGAA GGTGTTTTTC GGCGTCCTGT TGCTTGCCGC CGCGCTATGG ATCGTCTGGC CGGTGCTGAA CGCCGCGTCG CAGCTTGGCC TGGGGGCGTT GTGGCTACTG ATCGCCGCCG CCGCGCTCGG GCTCTTCACG CCGCATTCGG GCTCTTCGTC GGTCTGGCGC CGCCTTGGGC GCGGGCTCGG CGCCGCGCTC GCGATCTGGG CGGCGACGCT CCTCGTCGGT CTCGCGGCGG GGTCCACCGA TCCGTTGCGT CCGCTTGCCG TACTCGCGGC GCGCGCGGCG CCGAGCAACG GCACCGCCGG TGCAGGTGCC GGCGCGCACG AAGGGCCGGC GTTCGCGCCG GTGCGCTCGA TCGCGGAGCT CGACGAGATC GTGAAGACGT CGACCCGGCC GGTGATGCTC GATTTCTATG CGGACTGGTG CGTGAGCTGC AAGGAGATGG AGCATCTGAC GTTCACCGAT GCGCGCGTCG GCGCGCGGCT TTCGCAGATG CATCTCGTGC GTGCGGACGT CACGGCGAAT TCGCCGGACG ACCAGGCGCT GCTGAAGCGT TTCGGCCTGT TCGGGCCGCC CGGCATCATC GTGTTCGATG CAAACGGGCA GGAGCGGGGG CGCGTCGTCG GTTATCAGTC GGCCGATCGT TTCCTGCGCA GCCTCGACCG GATGTCGTTG CCGGCCGCAT GGTCGGCTTC GTGA
|
Protein sequence | MFNRIPRHAQ PRFRFLIAVV AMLGVLFGTS LAARAAEDFL DPAVAFKFSA SEAPGQVDVH FKIADGYYMY RERFAFAVKS GSATLGEPQL PAGHVKFDPT FQKNVETYRG DLTIHLPIKQ ASGPFELAVT SQGCADEGIC YPPAEHVARI EGAALGAAGT APAAAGAGAD TSAADGGSWY ERVTSADYAR SLLEGHGFLT IVALYFVAGM VLSLLPCSYP MIPILSAIIV GEGARATRAR AFALSLTYVI GMALVYTALG VAAALVGQSL GAWLQNPWVL GAFALLLTVF ALLLIGGVDI TLPQRWQNGA AQKSGPRKGG RFAAVATMGA LSALVVGACM TAPLFAVLAF IAHTGSALLG GAALFSMGIG LGVPLLVIGI GAGTLLPRAG AWMDGVKVFF GVLLLAAALW IVWPVLNAAS QLGLGALWLL IAAAALGLFT PHSGSSSVWR RLGRGLGAAL AIWAATLLVG LAAGSTDPLR PLAVLAARAA PSNGTAGAGA GAHEGPAFAP VRSIAELDEI VKTSTRPVML DFYADWCVSC KEMEHLTFTD ARVGARLSQM HLVRADVTAN SPDDQALLKR FGLFGPPGII VFDANGQERG RVVGYQSADR FLRSLDRMSL PAAWSAS
|
| |