Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1894 |
Symbol | |
ID | 4900952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1849178 |
End bp | 1850209 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640135123 |
Product | TadE family protein |
Protein accession | YP_001066158 |
Protein GI | 126453566 |
COG category | [S] Function unknown |
COG ID | [COG4655] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.321739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAGC GCCGGGCCGC CTTCGCGCGA CGCCGCGAGC GCGGCGCGGT CATGCTGTGG TTCGTGCTGT TCCTGCCGGT GCTGCTGCTC TTCGGCGCGT TCGCGATCGA CCTGCCGCGC GTGGCCGCCG CGCGCAACGA ACTGCAGAAC GCGGCGGATG CGGCCGCGCT CGCCGGGGCC GCGTCGCTCG AAGCGGGCGC CGGCGCGCCC GCGTGGGCGG CGGCCGCGAG CGCGGCGGCC GCGGCGCTTT CGCTGAACGC GTCCGACGGC GCGGCGCTAT CGAGCGGCGA CGTGCAGACG GGCTACTGGA ACGTGACGGG CGTGCCCGCC GGGCTCGAGC CGACGACGCT CGCGCCCGGC GAGTACGACG TGCCCGCCGT GCAGGCCACC GTCACGCGCG CGCCGAACCA GAACGGCGGG CCGCTCTCGC TGTTGATGGG CGGCTTGCTC GGTCTCGTCG GCACGCCCGC CGCGGCGACG GCGGTCGCGG TCGCCGGCGC GCCGGCGACG GTCGGCGCGG GCGGGCTCTT TCCGATGGTC ATCGATCAAT GCGTGCTCGA TCAGTACTGG GACGCGCGGG CGGGCGCGCC GCGCGTGGAT CCGACGACGG GCGCGCCGTA CGAGTTCCAG GTCGGCAACG GCCGGACGTA CGGCGGCACC TGCTATGCCG GCCAGTGGAC GACGTTCCTC GTCAACGCGA ACGACGTGCC GACCGTGCGA GGCCTGATGG CCCACGGCAA CCCGACGCCG CTTTCGATCG GCGACAGCAT CTGGATCGAG CCCGGCGTGA AGACCGCGCT CTATTACGAC GTGCCGGTCG GCGTGACGGT CGTCGTGCCG GTCGCCACGC AGATCAGCAG CAAGACGTAC GTGCCGGTCG TCGCGTTCGC CGCGTTCTAC GTCGACGCGT CGGACGGCGC GAACCTGAAG GCGATCACCG GCCACTTCGT CGGCGGCTAC AAGATTCCCG CGAGCGCGAG CGGCATCGGG CCCGCCTACG GCGCGTACGT CGCGCCTCGC CTCGCATACT GA
|
Protein sequence | MNERRAAFAR RRERGAVMLW FVLFLPVLLL FGAFAIDLPR VAAARNELQN AADAAALAGA ASLEAGAGAP AWAAAASAAA AALSLNASDG AALSSGDVQT GYWNVTGVPA GLEPTTLAPG EYDVPAVQAT VTRAPNQNGG PLSLLMGGLL GLVGTPAAAT AVAVAGAPAT VGAGGLFPMV IDQCVLDQYW DARAGAPRVD PTTGAPYEFQ VGNGRTYGGT CYAGQWTTFL VNANDVPTVR GLMAHGNPTP LSIGDSIWIE PGVKTALYYD VPVGVTVVVP VATQISSKTY VPVVAFAAFY VDASDGANLK AITGHFVGGY KIPASASGIG PAYGAYVAPR LAY
|
| |