Gene BURPS668_A2544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2544 
Symbol 
ID4887937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2454523 
End bp2456472 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content68% 
IMG OID640132480 
Producthypothetical protein 
Protein accessionYP_001063536 
Protein GI126444793 
COG category[S] Function unknown 
COG ID[COG4655] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.7801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCACT GGCGCGATAC GCGCGCAGCC GGCGACATCG CGCGATGTCG GCCGGCCGGA 
TGCGCCGCGG ACGACCACAC AGGCCCGATG ACGCCGCTCG ACCGCGCATC GCGTCGACAA
CGACACGCAA CGCCGACGAA GCACTACGAG GTTGGAATGA ACGAGCCGAC ACGCCAGACC
ACCGGGGACC TTTCCGAGCG TCGCGGGCCG AATCGACGGC GCCGCGCGCC GCGCTCGCCG
GCGCGCCAGC GCGGCTCGCT CGCCATCATC GCGGCAATCG CGATCGGCGT CGTGATCGCC
GCGCTCGGCG CGGTCGACCT CGGCAATCTG TTCTATCAGC GCCGCGCGCT GCAAAGCATC
GCCGACCTCG CCGCGCTCGC GGCCGCGCAG ACGATGGACG ACGGCTGCGC GAAGCCGGCC
GCCACCGCGC AATCGGCCGC GCTCGGCAAC GGCTTCGACA GCACCGCGTC GGGACAATCG
ATGACGGTCG TCTGCGGCAG ATGGGACGTG AAGGACAACG TCGGCCCGAG CTTCTTCGCG
GGTTCGGCAT CGGGCGCCGC GGCCGGCAGC GACGCGCAGC TCAACGCGGT TCAAGTGACG
GTCACGCGCG CCGTGCCTTA CTACTTCCTC GGCGCGCAGC GCACGATCGC GGCGACCAGC
ACCGCAGAGG CGACCAATGT CGGCGCCTAC TCGATCGGCA CGACGCTCGC GCAACTGCAA
GGCGGCGTCG TGAACGCGCT GCTCAACGGG CTGCTCGGCA CGAATCTGAA TCTGTCGGTG
TTGTCGTATC AAGGCCTTGC CAATGCGCGA ATCAGGATCA AGGACCTGAT GGCCGCCGCG
AACGTCGGCA CCGTGAGCGC GCTGCTGAGC ACGCAGACGA CCGTCCCGCA GCTCGCGAAC
TGGATGCTGA GCGCGCTGTC GCAGACCTCG GTCGCGAATG CCGACTTGCA GACGAGCATC
GGCGCGCTAC AGACGATCGT CAGCGCGAAC ATTCCGGGCG GCCGGACTTT CACGATCGGC
AACACCGCGA ATTCGGCGGG CATCTTCTCG ATCGGCCTGT CCAATCCGCA GGCCGCGCTC
GACGCGACAT TCAGCCCGTT CGACGCACTT CTCGTCGCGG CCGAGATCGC GACCGGGCAA
ACGGCGTTCT CGCTCGCGAA CGGGCTGAAC ATCGGCGGGT TGAACGCGAA TCTGCAAGTG
CAGATCATCC AGCCGCCCGT GCTCGGCATC GGCGAAGCGG GCATCGACCC CGTCACGAAA
ACGTGGCGCA CGATCGCACG CACCGCGCAG GTGCGACTCT ATCTGAACAT CGGACTCGGC
ACGGCGAACC TGCCGCTCGG GCTGCTCGGC GCGCTCCTGC CGGTGCAGGT GAATCTGCCG
CTATCGCTGC AGATCGCGCC GGGCCAGGCG TGGCTGCAAT CGGCGAGCTG CACGGCGTCG
CCGTCGACTT GCGCCTCGGC CATCGGCGTG CAGACGGGCC TCACGAATCT GTGCATCGGC
GACACGCCGG CCAACATGTC CGCGTCGCTG CCGTTCACCT GCTCGACGCC CGCGACGCTC
GTCAATGTCG CGAACCTCGT GACGATCAAG TCGCTCGTGT CGTTCCCGGC CGACGTGCCC
GCGAGCCAGA CGCCGACGCT CACGTTCTAC GGCACGACGG GCGGCTATCA GAGCACGAAC
TCGAACGGCG TCGGCAGCGT GCTCGGCAAT GCGCTGTCCG GCCTCGGCGC ATCGCTGCAG
CAGACGCAGA TCTCGCTGTT CGGCATCAGC CTGCCGCTCG GCCCGATCCA GACCGCGCTC
AATGCGTTCC TGGGCGGCGT GCTACCGCCG CTGCTGTCGG GGCTCGACGC CGCGATCGTG
CCGCTGCTGC AACTGCTAGG CGTGCAGGTC GGCGAAAGCA CGATTCACGA CATGTCGCTG
ACTTGCGGGG TGTCGCAGCT CGTCTATTGA
 
Protein sequence
MMHWRDTRAA GDIARCRPAG CAADDHTGPM TPLDRASRRQ RHATPTKHYE VGMNEPTRQT 
TGDLSERRGP NRRRRAPRSP ARQRGSLAII AAIAIGVVIA ALGAVDLGNL FYQRRALQSI
ADLAALAAAQ TMDDGCAKPA ATAQSAALGN GFDSTASGQS MTVVCGRWDV KDNVGPSFFA
GSASGAAAGS DAQLNAVQVT VTRAVPYYFL GAQRTIAATS TAEATNVGAY SIGTTLAQLQ
GGVVNALLNG LLGTNLNLSV LSYQGLANAR IRIKDLMAAA NVGTVSALLS TQTTVPQLAN
WMLSALSQTS VANADLQTSI GALQTIVSAN IPGGRTFTIG NTANSAGIFS IGLSNPQAAL
DATFSPFDAL LVAAEIATGQ TAFSLANGLN IGGLNANLQV QIIQPPVLGI GEAGIDPVTK
TWRTIARTAQ VRLYLNIGLG TANLPLGLLG ALLPVQVNLP LSLQIAPGQA WLQSASCTAS
PSTCASAIGV QTGLTNLCIG DTPANMSASL PFTCSTPATL VNVANLVTIK SLVSFPADVP
ASQTPTLTFY GTTGGYQSTN SNGVGSVLGN ALSGLGASLQ QTQISLFGIS LPLGPIQTAL
NAFLGGVLPP LLSGLDAAIV PLLQLLGVQV GESTIHDMSL TCGVSQLVY