Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0861 |
Symbol | |
ID | 3846429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1005686 |
End bp | 1006750 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637838164 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_439058 |
Protein GI | 83717467 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0186818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGACA CAGTTCACGC GGCGCTCGCC GCCCTCACCG ACACCCGCAC GCTATCCGGC GTCGATCTTT CGGATGCCGA TCTTTCCGGC CTCGACCTGT CCGGCTGCAC GCTCCACCGC GTGATCCTGC GCGGCGCGAA CCTGTCGGCC GCGCAGCTCG ACGCGACGCG CTGGCTGCAT TGCGACCTGA CGGGCGCGCG CGTCGACGGC GCGACGCTCG GCGAATCGAG CTGGCATGCG GTCGCGCTGC GCGGCGCGAG CCTGCGCGCG ACGACGGGCG ACGCGTTCGC GATGACGGAC GCCGACCTCG GCGGCGCGAC GCTCACCGAC GCGCTGTGGG CGCGCGCGAC GTTCGAGCGC GTCGATTTCT CCGCCGCGCA GTGCGCGCGC GCGAAGCTGC TGCGCTGCGA GGCCGCCGAT TGCCGCTTCG AGCGCACCGA TTTCTCGAGC GCGGAGCTCG AGCGTTTCTC GGCAATGCGC GCCGACCTGT CGAGCGCGCG CTTCGACGCC ACTCGCCTGA CGAATGCGCT CTTGTGCGAA GCGGATCTGC GCGGCCAACG CTTCGCGCGC TGCGATCTGA CGATGACGCA TTTGAACGGC GCGACGCTCG CCGGCAGCGA TTTCAGCGGC ACATCGCTCG TGCAGACGAT GTTCTTCGCA GCCGATCTCG AAGGCGCGAC GCTCGCCGGC GCGCGCGGTC GCCATGTGCG CTTCGCGGAC GCGACGCTTG TCGGCGCGCG CCTCGCCGAG GCCGTGTTCG ACGAATGCGA TTTCGCGCGC GCGCGATTGT CGTCGGCGAA CGCGCGCGGG CTGCGCGCGC GGATGTCGCT GTTCTCGCAC GCCGATTGCG CGGGCGCGAC GCTCGCGGGC GGCCACTTCG TCTACTGCGA CTTCTCGCAC GCGACGCTGT CGCGCGCCGA CTGCACCGAC GCCGATTTCT CGCACGCGAA CCTGCACGGC ATCGACGATC GCGCCGCCCG CTGGGACGGC GCGTGCAAGA CAGGCGCTTG CGCGACCGAT CCCACACTCG CGCTGGCCGA ACGATGGACC GCGCCCGAAC GATGA
|
Protein sequence | MSDTVHAALA ALTDTRTLSG VDLSDADLSG LDLSGCTLHR VILRGANLSA AQLDATRWLH CDLTGARVDG ATLGESSWHA VALRGASLRA TTGDAFAMTD ADLGGATLTD ALWARATFER VDFSAAQCAR AKLLRCEAAD CRFERTDFSS AELERFSAMR ADLSSARFDA TRLTNALLCE ADLRGQRFAR CDLTMTHLNG ATLAGSDFSG TSLVQTMFFA ADLEGATLAG ARGRHVRFAD ATLVGARLAE AVFDECDFAR ARLSSANARG LRARMSLFSH ADCAGATLAG GHFVYCDFSH ATLSRADCTD ADFSHANLHG IDDRAARWDG ACKTGACATD PTLALAERWT APER
|
| |