Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0407 |
Symbol | |
ID | 4886709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 372485 |
End bp | 373705 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640130348 |
Product | hypothetical protein |
Protein accession | YP_001061413 |
Protein GI | 126443337 |
COG category | [S] Function unknown |
COG ID | [COG4461] Uncharacterized protein conserved in bacteria, putative lipoprotein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCCGG TGAGTTTGTG GCCGCTGCCG CGTTCACCAT GGCTACCGCC GCGCCGCGCC GCGCCGCGCC GCGCCGCGCC ACGCTATGCC AAGCCAAGCC AAGCCAAGCC AAGCCGCCAA TGTCGTCGCT GCTACCGCTA CTGCCACCGC CACGGCATCG ACATCGACAT CGACTCTGGC AACGGCAACG GCAACGGCAA CGGCAACAGC AACAGCAACG GCAACAGCAA CGGCAACAGC AACAGCAACA GCAACAGCAA CAGCAACAGC AACAGCAACG GCAACGGCAA CAGCAACAGC AATAGTTCCC GCCACCGCTA TCATCCCCAC CTTCGCCGCA CACCATATGA ACGTTCATCG AATTACCCAC TCGAAGTACC GCAAGGAGGA AGCATGTCGA CGAACATGAA ACGACTGATG ACCGCGGCGC TCGGCGCCGC ATTGGCGTTC GGCGCGCTGT CCGCGCGCGC CGCGAGCTTC GACTGCGCGC ACGCCGCGAA CGCCGCCGAG CGCGCGATCT GCGGGACGCC CGCGCTCGGC GAGTTGGACG TTCGAATGGC CGCGTACTAC GAAATACTGC AGAACGCGCG GCCGGCCGAC GAGGGCATGG CGTATCGCGA GTTCCGCGAC GCGCTGCGCG ACGAGCAGCA GCGCTGGCGG CAGCGCACGC GCGATGCGTG CGGTGCGCGG ATCGATTGCC TGACGAACGC CTATACCGCG CGGATCGCCG CGCTGCGCGG CGTCGCCGCC GAGCGGCTCG TGCTGCGGAT GACGGGTGGG AGCGCGGCGT CGGCAGGCGC CGCCGACGCG ACGTACGCGA TCGAAGGCGA GTCGATCACG CTCGCGAACG GCGAATCGGT GCGTCCGGCC GCGCCGGGCT CGGCGATGAA GCGCGTGACG ACGCTCGTCG CGCGGAGCGC TGTGGCGACG ATCGCCGGCC GTCCGGTCGA GGCCGTCCTG CTGAGCGACG ATCCGGGCGG CAGCGGCCGG TTCCTGTATG TGGCGACCGC GCAGCCGGGC GGCGGCGCGC CGGCGGTGCT GCTCGGCGAT CGGGTAAAGC CCGTGTCGGT GTCGATCGAG CGCGCGGCGA CGGGCGGCGC GGTTGTCGTC GTCGAATATC TGGATCGTCC GGAAGGCGCG CCGTTCGCGC AGGCGCCGAC GATCAAGATC GTCCGGCGCT TCGCGCTGGA GCAAGGCCGG CTCGTCGAGC AGCGCGGGTA G
|
Protein sequence | MFPVSLWPLP RSPWLPPRRA APRRAAPRYA KPSQAKPSRQ CRRCYRYCHR HGIDIDIDSG NGNGNGNGNS NSNGNSNGNS NSNSNSNSNS NSNGNGNSNS NSSRHRYHPH LRRTPYERSS NYPLEVPQGG SMSTNMKRLM TAALGAALAF GALSARAASF DCAHAANAAE RAICGTPALG ELDVRMAAYY EILQNARPAD EGMAYREFRD ALRDEQQRWR QRTRDACGAR IDCLTNAYTA RIAALRGVAA ERLVLRMTGG SAASAGAADA TYAIEGESIT LANGESVRPA APGSAMKRVT TLVARSAVAT IAGRPVEAVL LSDDPGGSGR FLYVATAQPG GGAPAVLLGD RVKPVSVSIE RAATGGAVVV VEYLDRPEGA PFAQAPTIKI VRRFALEQGR LVEQRG
|
| |