Gene BURPS1106A_A2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2554 
Symbol 
ID4903890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2511113 
End bp2512447 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content68% 
IMG OID640145657 
Producthypothetical protein 
Protein accessionYP_001076584 
Protein GI126457814 
COG category[S] Function unknown 
COG ID[COG2733] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGACG AGACGAAACC GAGACGAAAC CGATACGACC GGATCGCGAT GCTCGACGAC 
AAGGAACGGG AACTCAGGAA CAGCAAGCGG CGCGCGCTCG CGCTGCTGCT CGCCGCGGCG
GGCGTGTTCG CGGCGACGCT GTTCGCGCCG CGCGGCTTCT GGAGCGACGG CGTCAAGGCG
GTGGCGGAGG CGTCGATGGT GGGCGCGCTC GCCGACTGGT TCGCGGTGGT CGCGCTGTTT
CGCCGCGTGC CGATTCCGTT CGTGTCGCGG CACACGGAGA TCATTCCGCA GAACAAGGAC
AAGATCGCCG ACAATCTCGC CGCGTTCGTG CACGAGAAGT TTCTCGATCC GGCGTCGATC
GTCGCGCTGA TCAAGCGGCA CGATCCGGCC GCCCGGCTCG CGCAGTGGCT CGCGACGCCG
CGCAACGCGA ACGTGCTGGG CGGCTACGCG GCGCGCCTCG TCGCGTTCGG GCTCGACATG
ACCGACGACG CGCGAATCCA GTCGTTCGTG AAGGACGCGT TCCATGCGGT GCTCGAGCGG
ATCGACCTGT CGCAATCGGC GGGCGCGATC CTCGACACGC TGACGAAGGA CGGCCGCCAC
CAGGCGCTCC TCGACGACGG CATCGCGCAG ATCGTCGGAT TCCTTCGCGA TCCCGACAAT
CGCGCGTCGA TCGCGGCGTA CATCGTCGAC TGGCTCAAAT ACCAGTTCCC GAAGATGGAG
AAGCTGCTGC CGACGAACTG GCTAGGCGAG CACGGCGCGG AGCTGATCTC GAACGTCGTC
ACGCGGGTGC TCACGCAGAT CGCCGAAGAC CCGGAGCACC GGCTGCGGCG CAGCTTCGAC
GACGCGGCGG CGCGCCTCGT CACGCGGCTG AAGAGCGATC CGGCGTTCAT CGCGAAGGGC
GAGGAGATCA AGCGCTACCT GCGCGACGGC GACGCGTTCA ACCGCTACGT GAAGGACATG
TGGGACCAGC TGCGCGCATG GCTGAAGGCC GATCTCGCGC GCGACGATTC GGTTGTCCAC
CGGCGCGCGA CGGCGCTCGG CGGCTGGCTC GGCGAGCGTC TCGCGCAGAG CCCGGAGCTG
CGCGATTCGA TGAACGAGCA CGTCGAGCGC GCGGCGAGCG AGATGGCGCC CGAGTTCGCC
GAATTCCTGA CGCGGCACAT CAGCGACACC GTGAAGAACT GGGACGCGCG CGAGATGTCG
CGGCAGATCG AGCTGAACAT CGGCAAGGAC CTGCAGTACA TCCGGATCAA CGGCACGCTG
GTCGGCGGCT TGATCGGGCT CGGGCTGTAT GCGGTGTCGA GCGCCGCGCG GTGGGCGGGC
GCGCTGCCTT ACTGA
 
Protein sequence
MHDETKPRRN RYDRIAMLDD KERELRNSKR RALALLLAAA GVFAATLFAP RGFWSDGVKA 
VAEASMVGAL ADWFAVVALF RRVPIPFVSR HTEIIPQNKD KIADNLAAFV HEKFLDPASI
VALIKRHDPA ARLAQWLATP RNANVLGGYA ARLVAFGLDM TDDARIQSFV KDAFHAVLER
IDLSQSAGAI LDTLTKDGRH QALLDDGIAQ IVGFLRDPDN RASIAAYIVD WLKYQFPKME
KLLPTNWLGE HGAELISNVV TRVLTQIAED PEHRLRRSFD DAAARLVTRL KSDPAFIAKG
EEIKRYLRDG DAFNRYVKDM WDQLRAWLKA DLARDDSVVH RRATALGGWL GERLAQSPEL
RDSMNEHVER AASEMAPEFA EFLTRHISDT VKNWDAREMS RQIELNIGKD LQYIRINGTL
VGGLIGLGLY AVSSAARWAG ALPY