Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0681 |
Symbol | |
ID | 4903980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 664712 |
End bp | 665764 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640143787 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001074717 |
Protein GI | 126456820 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGATGA CGACGCGGCC GCAGGCGAAC GCGAAGGCGC GCGCGCGCAA GGCGGCCGCG CGCGGGGAGC CGGCGCGCGG GGAGCCGGCG CGCGGGGAGC CGGCGCGCGG CGCGCTGCTG CGCGTCACGC ATGATACGCG CTATCGATAC GCGGCGCGCG TCGAATCCGC GCAGCATCAG GCGCGTCTGC GCCCGCTCGA GACGCCGCGG CAGCGCGTGA TCGAGTTCTC GCTCGAGATC GAGCCGCGCG CGGAAGGGCT CGTCGTCGAC ATCGATTCGT TCGGCAACGA GCGCGCTTCG TTCGCGCTCA ACCAGCCGCA CGAGGAGCTG TTCGTGCGCA GCCGCAGCGT CGTGCGCGTC ACGCCGCCCG CGCTGGCGGC GGGCAAGCGC GGCGAGCCGC CGCCCGCCGT CGCCGCGCCG CGCGACGGCT GCGCGAGCGC GTGGGAAGCG GTGCGTGAGC GCCTGACGTT TCGTGCCGGC CGCCCGTTCG ATCCGGCGAG CGAATTCGTG TTCGCTTCGC CGCACGTCGC ATGCCACTCC GATCTCGCCG CCTATGCGGC GGCGAGCTTC ACGCCGGGCC GGCCGCTCGT GCAGGCCGCG TGGGAGCTGA TGCGCCGCAT CCACGCGGAT TTCGCGTATG CGCCGAACAG CACCGACGTC GGCACGACCG CGCTCGATGC GCTCGCGCTG CGCCAGGGCG TGTGCCAGGA TTTCGCGCAC GTGATGATCG GCGCGCTGCG CTCGCTCGGG CTTGCCGCGC GCTACGTGAG CGGCTATCTG CTGACGCAGC CGCCGCCCGG GCAGCCGCGA TTGATCGGCG CGGACGCATC GCATGCGTGG GTCGAGGTCT ACGATCCCGC GTGGCCCGAG GACGGTGGCT GGCTGCCGCT CGATCCGACC AACGATCGCG CGCCCGGCGA CGATTACGTG ATGCTGTCGA TCGGCCGCGA CTACGCGGAC GTGACGCCGT TGCGCGGCGT CATTCGCGGC GGCGGGGCCG ATCAGGTGCT GACGGTCGGC GTGACGGTGG AGCCGCTCGA TTCGGCGTCC TGA
|
Protein sequence | MGMTTRPQAN AKARARKAAA RGEPARGEPA RGEPARGALL RVTHDTRYRY AARVESAQHQ ARLRPLETPR QRVIEFSLEI EPRAEGLVVD IDSFGNERAS FALNQPHEEL FVRSRSVVRV TPPALAAGKR GEPPPAVAAP RDGCASAWEA VRERLTFRAG RPFDPASEFV FASPHVACHS DLAAYAAASF TPGRPLVQAA WELMRRIHAD FAYAPNSTDV GTTALDALAL RQGVCQDFAH VMIGALRSLG LAARYVSGYL LTQPPPGQPR LIGADASHAW VEVYDPAWPE DGGWLPLDPT NDRAPGDDYV MLSIGRDYAD VTPLRGVIRG GGADQVLTVG VTVEPLDSAS
|
| |