Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0772 |
Symbol | |
ID | 4887832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 738997 |
End bp | 740019 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640130712 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001061771 |
Protein GI | 126442488 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.11713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGATGA CGACGCGGCC GCAGGCGAAC GCGAAGGCGC GCGCGCGCAA GGCGGCCGCG CGCGGGGAGC CGGCGCGCGG CGCGCTGCTG CGCGTCACGC ACGATACGCG CTATCGATAC GCGGCGCGCG TCGAATCCGC GCAGCATCAG GCGCGTCTGC GCCCGCTCGA GACGCCGCGG CAGCGCGTGA TCGAGTTCTC GCTCGAGATC GAGCCGCGTG CGGAAGGGCT CGTCGTCGAC ATCGATTCGT TCGGCAACGA GCGCGCTTCG TTCGCGCTCA ACCAGCCGCA CGAGGAGCTG TTCGTGCGCA GCCGCAGCGT CGTGCGCGTC ACGCCGCCCG CGCTGGCGGC GGGCAAGCGC GGCGAGCCGC CGCCCGCCGT CGCTGCGCCG CGCGACGGCT GCGCGAGCGC GTGGGAAGCG GTGCGTGAGC GCCTGACGTT TCGCGCCGGT CGCCCGTTCG ATCCGGCGAG CGAATTCGTG TTCGCTTCGC CGCACGTCGC ATGCCACTCC GATCTCGCCG CCTATGCGGC GGCGAGCTTC ACGCCGGGCC GGCCGCTCGT GCAGGCCGCG TGGGAGCTGA TGCGCCGCAT CCACGCGGAT TTCGCGTATG CGCCGAACAG CACCGACGTC GGCACGACCG CGCTCGATGC GCTCGTGCTG CGCCAGGGCG TGTGCCAGGA TTTCGCGCAC GTGATGATCG GCGCGCTGCG CTCGCTCGGG CTTGCCGCAC GCTACGTGAG CGGCTATCTG CTGACGCAGC CGCCGCCCGG GCAGCCGCGA TTGATCGGCG CGGACGCATC GCATGCGTGG GTCGAGGTCT ACGATCCCGC GTGGCCCGAG GACGGTGGCT GGCTGCCGCT CGATCCGACC AACGATCGCG CGCCCGGCGA CGATTACGTG ATGCTGTCGA TCGGCCGCGA CTACGCGGAC GTGACGCCGT TGCGCGGCGT CATTCGCGGC GGCGGGGCCG ATCAGGTGCT GACGGTCGGC GTGACGGTGG AGCCGCTCGA TTCGGTATCG TGA
|
Protein sequence | MGMTTRPQAN AKARARKAAA RGEPARGALL RVTHDTRYRY AARVESAQHQ ARLRPLETPR QRVIEFSLEI EPRAEGLVVD IDSFGNERAS FALNQPHEEL FVRSRSVVRV TPPALAAGKR GEPPPAVAAP RDGCASAWEA VRERLTFRAG RPFDPASEFV FASPHVACHS DLAAYAAASF TPGRPLVQAA WELMRRIHAD FAYAPNSTDV GTTALDALVL RQGVCQDFAH VMIGALRSLG LAARYVSGYL LTQPPPGQPR LIGADASHAW VEVYDPAWPE DGGWLPLDPT NDRAPGDDYV MLSIGRDYAD VTPLRGVIRG GGADQVLTVG VTVEPLDSVS
|
| |