Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2826 |
Symbol | |
ID | 4904996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 2766340 |
End bp | 2767599 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640145929 |
Product | substrate-binding repeat-containing protein |
Protein accession | YP_001076855 |
Protein GI | 126457855 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGATC CGGCGGGGCG GACGACGGCT TGGGAATATG ACGCGTATGG CAGTTTGCTT GTGAAGACGT TGCCGGATGG CAGCGCAGTC AGAACGGAAT TTGACCTCGA TCACCGACCG GTCTGCATGA CGTTGATAGG CGGCCGGCAG TGGGGCTACG AGTGGGATAC GTTCGGTAAT CTGCTCGCGC AGATCGATCC ATCGGGGGCG ATATCTCGCT ATACCTATGA CGAGTACGGC CAGCTTGTTG AGCATACTGG GCCGCGTGGT GCGAGCACAC TGTTCGATTA TCACCCGGAC GGCAATCTCG CGGCGCAGAT CGATGCGTTG GGGCATCGCA CGCAGTATCG GTACGATGCG CGCGGCTACC TCGTCGAAGC GATCGATGCG CTCGGACAGC AAAGCCAATA CGAGTACGAC CGCAACGGCC ATCTGACGCG CGCAATCGAG CCGGGCGGGC GTGAGATTCA CTGTGCGTAC GACGCCGATG GAAATCTGTC TCGCCATCGT GACCCCATGG GCCACGTGAC GCAGGTGGAG TACTCGGCGC TCGGACAGGT CAGCAGACGG CTCGCGCCCG ACGGCACCAC CGTTGAATAC CGCTACGACA GCCACATTAC CAGCGCGGGA TTCCGAACGC GGCCCATCGG TCGGCTGCCG ATGTTCGCGT GCCAGACTTG CCGGCGCTAC TTCAGGCGCA CGGCCGCCCC CCCACTCGGC GAGAAACATC TCAAGAAACT CGATCTATTC GTGTCCTTGC TGTCGCATCC GATCTCGTGC GTTGATGCGG GCGAACAGAT GGGCAGCCTA TCGACCGACA TCGGGAAACG CGTGACGGCC TGGCGCGCGT GGCTGTTGGA GCTCGACCCG AGCGGCAAGT GGGAGCGCCG CGTGAGGCTC AGCCATCGAC CTCCGCATTG CCCGAACTGC GGCAGTCACC AGACGCGTTT CGATGAATGC TCGAACGGCG CCTTCCCACG GTTCAAATGC GCGAATTGCG GGACCAAATT CACCCGACGC CGCGGCACGC CGTTCGTCAA TGCGAAGATG AGTTCGCCCG AGCGCATGCG CCTGGTCATT CGGCGCCTGT CGCTGCCGTT GTTGGTCATG CAGGTGGCGG ACCTTGTCGG CACGAGCCAT GGGATGGTCC GGAAATGGCA CAGCATGTTC ACCGATTTTG CGGATCGGCT CGAACCGAGT GGCAGTCTTT CAGCGCGGAT CAGGTTGCGC TCGAACTCTG CCAATGCGCC GAACAAATGA
|
Protein sequence | MIDPAGRTTA WEYDAYGSLL VKTLPDGSAV RTEFDLDHRP VCMTLIGGRQ WGYEWDTFGN LLAQIDPSGA ISRYTYDEYG QLVEHTGPRG ASTLFDYHPD GNLAAQIDAL GHRTQYRYDA RGYLVEAIDA LGQQSQYEYD RNGHLTRAIE PGGREIHCAY DADGNLSRHR DPMGHVTQVE YSALGQVSRR LAPDGTTVEY RYDSHITSAG FRTRPIGRLP MFACQTCRRY FRRTAAPPLG EKHLKKLDLF VSLLSHPISC VDAGEQMGSL STDIGKRVTA WRAWLLELDP SGKWERRVRL SHRPPHCPNC GSHQTRFDEC SNGAFPRFKC ANCGTKFTRR RGTPFVNAKM SSPERMRLVI RRLSLPLLVM QVADLVGTSH GMVRKWHSMF TDFADRLEPS GSLSARIRLR SNSANAPNK
|
| |