Gene BURPS668_A2699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2699 
Symbol 
ID4886306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2583918 
End bp2585252 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content67% 
IMG OID640132635 
Producthypothetical protein 
Protein accessionYP_001063691 
Protein GI126445074 
COG category[S] Function unknown 
COG ID[COG2733] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGACG AGACGAAACC GAGACGAAAC CGATACGACC GGATCGCGAT GCTCGACGAC 
AAGGAACGGG AACTCAGAAA CAGCAAGCGG CGCGCGCTCG CGCTGCTGCT CGCCGCGGCG
GGCGTGTTCG CGGCGACGCT GTTCGCGCCG CGCGGCTTCT GGATCGACGG CGTCAAGGCG
GTGGCGGAGG CGTCGATGGT GGGCGCGCTC GCCGACTGGT TCGCGGTGGT CGCGCTGTTT
CGCCGCGTGC CGATTCCGTT CGTGTCGCGG CACACGGAGA TCATTCCGCA GAACAAGGAC
AAGATCGCCG ACAATCTCGC CGCGTTCGTG CACGAGAAGT TTCTCGATCC GGCGTCGATC
GTCGCGCTGA TCAAGCGGCA CGATCCGGCC GCCCGGCTCG CGCAGTGGCT CGCGACGCCG
CGCAACGCGA ACGTGCTGGG CGGCTACGCG GCGCGCCTCG TCGCGTTCGG GCTCGACATG
ACCGACGACG CGCGAATCCA GTCGTTCGTG AAGGATGCGT TCCATGCGGT GCTCGTGCGG
ATCGACCTGT CGCAATCGGC GGGCGCGATC CTCGACACGC TGACGAAGGA CGGCCGCCAC
CAGGCGCTCC TCGACGACGG CATCGCGCAG ATCGTCGGAT TCCTTCGCGA TCCCGACAAT
CGCGCGTCGA TCGCGGCGTA CATCGTCGAC TGGCTCAAAT ACCAGTTCCC GAAGATGGAG
AAGCTGCTGC CGACGAACTG GCTCGGCGAG CACGGCGCGG AGCTGATCTC GAACGTCGTC
ACGCGGGTGC TCACGCAGAT CGCCGAAGAC CCGGAGCACC GGCTGCGGCG CAGCTTCGAC
GACGCGGCGG CGCGCCTCGT CACGCGGCTG AAGAGCGATC CGGCGTTCAT CGCGAAGGGC
GAGGAGATCA AGCGCTACCT GCGCGACGGC GACGCGTTCA ACCGTTACGT GAAGGACATG
TGGGACCAAC TGCGCGCATG GCTGAAGGCC GATCTCGCGC GCGACGATTC GGTTGTCCAC
CGGCGCGCGA CGGCGCTCGG CGGCTGGCTC GGCGAGCGTC TCGCGCAGAG CCCGGAGCTG
CGCGATTCGA TGAACGAGCA CGTCGAGCGC GCGGCGAGCG AGATGGCGCC CGAGTTCGCC
GAATTCCTGA CGCGGCACAT CAGCGACACC GTGAAGAACT GGGACGCGCG CGAGATGTCG
CGGCAGATCG AGCTGAACAT CGGCAAGGAC CTGCAGTACA TCCGGATCAA CGGCACGCTG
GTCGGCGGCT TGATCGGGCT CGGGCTGTAT GCGGTGTCGA GCGCCGCGCG GTGGGCGGGC
GCGCTGCCTT ACTGA
 
Protein sequence
MHDETKPRRN RYDRIAMLDD KERELRNSKR RALALLLAAA GVFAATLFAP RGFWIDGVKA 
VAEASMVGAL ADWFAVVALF RRVPIPFVSR HTEIIPQNKD KIADNLAAFV HEKFLDPASI
VALIKRHDPA ARLAQWLATP RNANVLGGYA ARLVAFGLDM TDDARIQSFV KDAFHAVLVR
IDLSQSAGAI LDTLTKDGRH QALLDDGIAQ IVGFLRDPDN RASIAAYIVD WLKYQFPKME
KLLPTNWLGE HGAELISNVV TRVLTQIAED PEHRLRRSFD DAAARLVTRL KSDPAFIAKG
EEIKRYLRDG DAFNRYVKDM WDQLRAWLKA DLARDDSVVH RRATALGGWL GERLAQSPEL
RDSMNEHVER AASEMAPEFA EFLTRHISDT VKNWDAREMS RQIELNIGKD LQYIRINGTL
VGGLIGLGLY AVSSAARWAG ALPY