Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0172 |
Symbol | |
ID | 4886520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 159528 |
End bp | 161408 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640130113 |
Product | hypothetical protein |
Protein accession | YP_001061178 |
Protein GI | 126442678 |
COG category | [S] Function unknown |
COG ID | [COG3519] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03359] type VI secretion protein, VC_A0110 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCCGC AGTTTCTCGA TCACTACAAC CGCGAATTGA CCTACATGCG GGAGTTGTCC GCGGAATTCG CGGCGCAACA CCCGAAGATC GCGCGCCGAC TCGGCATGCA AGGCATCGAA GTGGCGGACC CGTACGTAGA ACGGCTCATC GAGGCGTTCT GCTTCATGTC CGCGCGCACG CAACTCAAGC TCGAGGCGGA GTTCCCCCGC TTCACGCAAC GCCTGCTCGA AGTCGCCTAT CCGAATTACG TCGCGCCGAC GCCGTCGATG GCCGTCGCGC GCCTGCGTCC AAGCCTGCGG GAAGGCGACT TCAGCAAGGG GTTCAAGGTT CCGCGTCACA GCATGCTGCG CTCGTCGATC CCGCCCGGCG AGCAGACCGC ATGCGAATTC CGCACCGGCC AGGACATCAC GCTCTGGCCC ATCGAGATCG CCGGCGCGAC GCTGACCGCC GTGCCGCCCG ATCTTCCGGA TCTTCAGCGC AGCCTGCTGC CGCACACGAA GCTGCGCGGC GCACTCCGGC TGCGCGTGCG CACCGTCGGC GAAATCAGGT TTTCGCAAAT TACCGGACTC GATCGCCTCT CGCTGTATAT CGGCGGCGAC GAACGCATCG CCTCCCATCT TTTCGAGCTG ATTCACGCGA GCAGCGTCGC GTCGGTCGTG CGCGCGCCGG GCGCCGCGCG CAGCGAAGGC GCCGTCGTCG CGAAGAATGC GGTCGACTTC GAAGGCCTGT CGCCCGATCA AAGCCTGTTG CCGCTCGTCT GGAATACGTT CCACGGGCAC AATCTGCTGC ACGAATACTT CACCTGCCGT CAGCGCTTCT ATTTCTTCGC GCTCACGCAA TTGAACGCCG GCTTGTCGCG CATCGACGGA AAGGAAGCCG AAATCGTGCT GTTGCTCGAC CGCTTGCCCG ACGAGCTCGT CACGCACGTC GAGGCCGCGC GCTTTCTGCT GTTTTGCGCC CCCATCGTCA ATCTGTTCCC GAAGCGGACC GACCGGGTGG AAATCAATCG CGCCCAAACG GCTTTCCACC TGATACCGGA CCGCACCCGC CCGCTCGATT ACGAAGTATT CTCGGTGTCG CGCGTGTTCG GGCAAAAAGC CGAGACGTCG ACGGAAGTCA CGTTCAATCC GCTCTATCAG ACGCTGCATA GCGACATCGG CAATTACGGC CGGTATTTCT CGATCCTGCG CGAACCCAGA ACGACATCGA CCAACGCGCG TAAATATGGA ACGCGCACGC CATACGTCGG AACGGAAGTC TACGTGTCGC TCGTCGACCA GGCCGAAGCG CCGTACGCCG ACGACATTCG TTACCTGTCC GTCGACGCAT GGGTCACCAA CCGCGATCTG CCCCGACTGA TCCCGCGCAA CGGCGTCAAC GATCTGACGA TGCAAGACTC CGTGCCGATC GAGGGCGTGA GCCTCGTCCA TCCGCCGAGC GCGCCGCGCG AGCCGTACGC GACCGGCGAA ACCGCGTGGC GGCTGATTCG CCAGCTCAGC TTCAACTACA TGCCGCTTGC CGAGCTCGAC CACCGCGACG GCGGGCAAGC ACTGCGCAAC ATGCTGCGCC TCTTCGTCGG CACGAGCGAG CGCGAACAGG CCACGCAGAT CGACAGCCTG GTCGGCGCGC GCACGGAGCC TGTCGTCCGC CGTCTGCCCG GCCACGGATT ACTGGTTTAC GGGCGCGGCG TCCGGTGCGA GCTGACCGTC GACGAAAGCG GCTTCTCCGG GTTGAGTCCA TACCTGTTCG GTCTCGTGCT CGAGCAATAC CTCACGCGCC ACGTCTCGAT CAATGTGTTC ACCGAGACGG AGCTTCGCTC GATGCAACGC GGCCTCGTCA CGCGCTGGAA ACCGCGCATG GGCGGAAGGG GCGCGGTATG A
|
Protein sequence | MDPQFLDHYN RELTYMRELS AEFAAQHPKI ARRLGMQGIE VADPYVERLI EAFCFMSART QLKLEAEFPR FTQRLLEVAY PNYVAPTPSM AVARLRPSLR EGDFSKGFKV PRHSMLRSSI PPGEQTACEF RTGQDITLWP IEIAGATLTA VPPDLPDLQR SLLPHTKLRG ALRLRVRTVG EIRFSQITGL DRLSLYIGGD ERIASHLFEL IHASSVASVV RAPGAARSEG AVVAKNAVDF EGLSPDQSLL PLVWNTFHGH NLLHEYFTCR QRFYFFALTQ LNAGLSRIDG KEAEIVLLLD RLPDELVTHV EAARFLLFCA PIVNLFPKRT DRVEINRAQT AFHLIPDRTR PLDYEVFSVS RVFGQKAETS TEVTFNPLYQ TLHSDIGNYG RYFSILREPR TTSTNARKYG TRTPYVGTEV YVSLVDQAEA PYADDIRYLS VDAWVTNRDL PRLIPRNGVN DLTMQDSVPI EGVSLVHPPS APREPYATGE TAWRLIRQLS FNYMPLAELD HRDGGQALRN MLRLFVGTSE REQATQIDSL VGARTEPVVR RLPGHGLLVY GRGVRCELTV DESGFSGLSP YLFGLVLEQY LTRHVSINVF TETELRSMQR GLVTRWKPRM GGRGAV
|
| |