Gene BURPS1106A_2618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2618 
Symbol 
ID4902551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2579919 
End bp2581061 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content68% 
IMG OID640135845 
Productputative lipoprotein 
Protein accessionYP_001066871 
Protein GI126454691 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3317] Uncharacterized lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATT CCGCCTTTTC CTCCCGCGCG ATTCAGGTCT CGGTGCTGGC GCTCGCGCTG 
GCCGCGCTCG CCGGCTGCGA CACGCTGAAC GACTATCTCG CGCCCGACCG GGTCAACTAC
AAGTCCACCG GCTCGGCGCC GCCGCTGCAG GTGCCGCAGG ATCTGACGGC GATGCCGCTC
AGCCCGTCGT ACGTCGCGCC GCCGACCAAT TCGGGCCTCG GTTCCGCGCC GACGCGCGCG
GTGACGGCCG CGGGCAACGC GACGGAAGGG CAGCCGAGCG CGCAGGATCC GTTCGGGATG
CACGTCGAGC GCGACGGCGA CCGCCGCTGG CTCGTCGTCG ACGGCCGCAC GCCCGAGCAG
CTCTGGCCGC AGTTGAAGGA GTTCTGGCAG GACAACGGCT TCGCACTGAA GACGGACGCG
CCGTCGACGG GCATCATGGC GACCGACTGG GCGGAGAACC GCGCGAACAT TCCCGACGAC
TGGTTCCGCC GCACGATCGG CCGTGTGATC GATTTCGCCT ATTCGTCGGG CACGCGCGAT
CGGTTCCGTA CGCTCGTCAC GCGCACGGCG GACGGCAACA CGGACATCTC GATCACGCAC
AGCGCGATGG AGGAGAAGCT GACGGGCGCG CAGGGCGGCA CGTCGTCGCG CTGGGAAGAG
CGTCCGCGCA ACCCGGTGCT CGAGGCCGTG TTCCTCGCGA AGCTGATGGA GAAGTTCGGT
TTGACCGACG CGCAGGCGAA GCAACTGCTC GCCGACGCGC GTCCAGCCAC CGCGCCCGCG
ACCGTCGTCG GCTCCGGTGC CGCGTCGACG CTCGATCTGG CCGAATCGTT CGACCGGGCG
TGGCTGCGTG TCGGGCTCGC GCTCGACCGC ACGAACTTCA CGGTCGACAA CCGGGATCGC
GCGAAGGGCG TCTATTACGT GCGTTACGCG AACTCGATGG AGGAGCTCAA GCGCGACGGC
CTGTTCGGCA AGCTGTTCTA CGGCGGCCCG ACGGCGGCGA AGCCGGGCAA GGAATTCCTC
GTCAACGTGC GCGCGCAAGG CGACGCGAAG ACGCAAGTGG CCGTCATCGA TGCAAACGGC
CAGATCGACA CCTCTTCCGA TGCGCAGCGG ATCATCTCGC TGCTGCACGC GCAGTTGAAC
TAA
 
Protein sequence
MKHSAFSSRA IQVSVLALAL AALAGCDTLN DYLAPDRVNY KSTGSAPPLQ VPQDLTAMPL 
SPSYVAPPTN SGLGSAPTRA VTAAGNATEG QPSAQDPFGM HVERDGDRRW LVVDGRTPEQ
LWPQLKEFWQ DNGFALKTDA PSTGIMATDW AENRANIPDD WFRRTIGRVI DFAYSSGTRD
RFRTLVTRTA DGNTDISITH SAMEEKLTGA QGGTSSRWEE RPRNPVLEAV FLAKLMEKFG
LTDAQAKQLL ADARPATAPA TVVGSGAAST LDLAESFDRA WLRVGLALDR TNFTVDNRDR
AKGVYYVRYA NSMEELKRDG LFGKLFYGGP TAAKPGKEFL VNVRAQGDAK TQVAVIDANG
QIDTSSDAQR IISLLHAQLN