Gene BURPS1106A_A2841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2841 
Symbol 
ID4904249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2788268 
End bp2789584 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content73% 
IMG OID640145944 
Producthypothetical protein 
Protein accessionYP_001076870 
Protein GI126458045 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins
[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family
[TIGR03350] type VI secretion system OmpA/MotB family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.548037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACTT CTTCCGATTC GTTCTCGGCC GGCTCGGGCG GATTCGTGCC GCCGAATCCG 
GGCGGCGCGC ACCCGGCCGC CGCGCCGGCG GCGGGTGCGG CCGCGTTTCA GCCGAGGCCG
GGCCGCTGGG CGGCGAGCGG CACGAATCCG CTCGTCGCGG CCGCGAACCC GCTGCTGAAC
CTCGTGCCGC AGATCCGTTC GACCGTCCAT CATCCGAATC CCGCGTGGCT GCGCGAGCAT
CTCGTCGTCG AGATCCGCCA GTTCGAGGAG CGCGCGCAGC AGGCGGGCGT CGCCTCCGAG
GCGATCATCG GCGCGCGCTA CTGCCTGTGC ACCGCGCTCG ACGAGGCCGC CGCGCTGACG
CCGTGGGGCG GCAGCGTGTG GTCGTCGCAT AGCCTGCTCG TGTCGTTCCA CAACGAGACG
TGGGGCGGCG AGAAGTTCTT CCATCTGCTC GAGCGGCTGT CGCAGCAGCC GCGCCAGCAT
CTCGACCTGC TCGAGCTGCT GTACTTCTGC CTCGCGCTCG GCTTCGAGGG GCGCTATCGC
GTGCTCGACA ACGGCCGCGC GCAGCTCGAC GCGGTGCGCC GCCAGCTCGC GCAGACGATC
CGCTCGGTGC GCGGCGAATT CGATCCGGCG CTCTCGCCGC ATTGGCGCGA CGTCGTCACG
CGCGACGTCA CGCGGCGCTT CACGGTGCCG CTGTGGGTGT GCGTCGCGCT CGCGCTGCTC
GTGGGCTTCG GCGTGTTCGC GGGGCTGCGC ATCGCGCTCG CCGGCCATTC GGATCGGCTG
TTCGCGTCGA TCGACGCGCT GCACGTGCCG AAGCTGCAGC CGGCGCCGCC CGCGCCGCAT
CCGGCGCCCG CGCCGCGCGT CGCGAAGTTC CTCGAGCCGG AGATCGCCGC GGGGCTCGTG
AGCGTGCGCG ACGAGGCCGA CCGCAGCGTG ATCGTGCTGC GCGGCGACGG CCTGTTCGGC
TCCGGCTCGA CGTCGGTGAT CGATCGCTAC ATGCCGGTGC TCACGCGCGT GGCCGACGCG
CTGAACCAGG TGCAGGGCAA CGTGCGCGTG AGCGGCTACA CCGACGACAC GCCGGTGCAC
ACCGCGCGCT TCGCGTCGAA CTGGGATTTG TCGCGCGAGC GCGCGCAGGC GGTCCGCAGC
CTGATCGCCG CGCGGCTCGA CCGCCCCGAG CGGATCACCG CCGAAGGGCG CGGCACGCTC
GATCCGGTCG CGCCGAACGA TTCGCCCGCG AACCGCGCGC GCAACCGGCG CGTCGAGATC
ACGCTGATGC TCGCGCCCGG CAGCGACGCC GCGCGCGCGA CGAAGGAGGC GCCCTGA
 
Protein sequence
MNTSSDSFSA GSGGFVPPNP GGAHPAAAPA AGAAAFQPRP GRWAASGTNP LVAAANPLLN 
LVPQIRSTVH HPNPAWLREH LVVEIRQFEE RAQQAGVASE AIIGARYCLC TALDEAAALT
PWGGSVWSSH SLLVSFHNET WGGEKFFHLL ERLSQQPRQH LDLLELLYFC LALGFEGRYR
VLDNGRAQLD AVRRQLAQTI RSVRGEFDPA LSPHWRDVVT RDVTRRFTVP LWVCVALALL
VGFGVFAGLR IALAGHSDRL FASIDALHVP KLQPAPPAPH PAPAPRVAKF LEPEIAAGLV
SVRDEADRSV IVLRGDGLFG SGSTSVIDRY MPVLTRVADA LNQVQGNVRV SGYTDDTPVH
TARFASNWDL SRERAQAVRS LIAARLDRPE RITAEGRGTL DPVAPNDSPA NRARNRRVEI
TLMLAPGSDA ARATKEAP