Gene BURPS668_3032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3032 
Symbol 
ID4883039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2973617 
End bp2974837 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID640128960 
Productputative lipoprotein 
Protein accessionYP_001060045 
Protein GI126441291 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.924018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGCA ACTCCTCGCT GTCCATCCTG ATAGCGGCCG CCTGCATTCA GGCATTCGCG 
GCGACGGCCT CGCTCGCGCA GGGCCCCGCG CACCCGCCGT CGTACGTCGA AGGCACCCGC
GTGCCGAAAG GCTTCGCGCG GCCGCCGTTC CACACGAATC CCGCACGCTT CTCGGCCACC
ACCGTCTCGG GCCTCGCGCC CGCCACCGTG CGGCACGCGT ACGGCTTCGA CTCGATCGCG
AACCAGGGCG ACGGCATGGT CGTCGCGATC GTCGACGCAT ACGACGACCC GAAGATCGAA
TCCGATCTCG GCGTGTTCAG CAAGAATTTC TCGCTGCCGC CCTGCACGAC GTCGAACGGC
TGCTTCAAGA AGCTCTACGC GAGCGGCAGC AAGCCGAGCC CCAACGCCGG CTGGGCGCTC
GAGATGTCGC TCGATGTCGA ATGGGTGCAT GCGATCGCGC CGAAGGCGAA GATCGTGCTC
GTCGAGGCGG CGTCGAACAG CTTCAACGAT CTGATGACCG CGGTCGATGT CGCCGTCGGG
GCCGGCGCGT CGGTCGTGTC GATGAGCTTC GGCGGCAGCG AATTCAGTTC CGAGACGAGT
TTCGACAGCC ACTTCGGCGC ACCGTCGAAC GTCACGTTCG TCGCATCGTC CGGCGACAGC
GGCAACGGCA CCGAGTATCC GGCGGCGTCG CCGTACGTCG TCGCCGTCGG CGGCACGACG
CTGTCGGCCG ACGCGTCCGG CAACTACGTC GGCGAAACCG CATGGAGCGG CAGCGGCGGC
GGCGTCAGCG CGTACGAACT GGAGCCGGTG GGCCAGACGC TCTGGCCGAT TCCGTACGCC
GGCCAACGCG GCGTGCCCGA CGTCGCGTAC GACGCGAATC CGAATTCCGG CTTCGCGGTG
TACGATTCCG TCACCTATCA GGGGCAATCG GGCTGGTTCG TCGTCGGCGG CACGAGCGCC
GGCGCGCCGC AATGGGCGGC GCTCTTCGCG ATCGCGAACT CGATGCGCAC CGCGGCCGGC
AAGGCGAAGC TCGCCGGCGC GTACAACCAG CTCTATACGG TCGGCAAGAC CGCGTACGGC
AGCGACTATC ACGACGTCAC GTCGGGCACC AACGGCAGTT GCGGGATGAT TTGCACCGCG
AGCGGCGGCT ACGATTACGT GACGGGCCTG GGCTCGCCGC AGGCGCTCAA CCTGGTTCAG
GCGCTCGTCG CGCAACCCTG A
 
Protein sequence
MKSNSSLSIL IAAACIQAFA ATASLAQGPA HPPSYVEGTR VPKGFARPPF HTNPARFSAT 
TVSGLAPATV RHAYGFDSIA NQGDGMVVAI VDAYDDPKIE SDLGVFSKNF SLPPCTTSNG
CFKKLYASGS KPSPNAGWAL EMSLDVEWVH AIAPKAKIVL VEAASNSFND LMTAVDVAVG
AGASVVSMSF GGSEFSSETS FDSHFGAPSN VTFVASSGDS GNGTEYPAAS PYVVAVGGTT
LSADASGNYV GETAWSGSGG GVSAYELEPV GQTLWPIPYA GQRGVPDVAY DANPNSGFAV
YDSVTYQGQS GWFVVGGTSA GAPQWAALFA IANSMRTAAG KAKLAGAYNQ LYTVGKTAYG
SDYHDVTSGT NGSCGMICTA SGGYDYVTGL GSPQALNLVQ ALVAQP