Gene BURPS1106A_A1982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1982 
Symbol 
ID4903883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1951623 
End bp1952807 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content74% 
IMG OID640145088 
ProductApbE family protein 
Protein accessionYP_001076016 
Protein GI126457068 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.63389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAAGA CGTCTATTGA ATGGTCGCCC GGCGCGCGGC TGAATCGCTG CCGCGCAAGC 
GGCGCGACGA TGGGCACGCG CTACGGCGCG CAGTTCTACG CGCCGCCGAC GGCCGACGCG
AGGGCGATCG CGGCCGCCCT CGACGCGGCG GTGCGGGCGG TCGACGCGCA GATGTCGAAC
TGGAAGGCCG ATTCGGATCT GTCGCGGCTC AATCGCGCGA CGCCCGGAAG CTGGACGCCG
ATCTGCGCGA ACCTCGCCGC GGTGCTCGTG CGCGCGCGGG AAATCGGCCG CGAGACGGAC
AACGCGTTCA ACATCGGCGT CGGCACGCTC GTCGATCGAT GGGGATTCGG GCCGGGCGCG
GCCGCGAACC GACAAGCGGA CAACGAATGG GCGGCGAATC GACAGGCGGC CGGCCGACAC
ACGGTTGATC GACGTACGGT TGATCGACAC ACGGCGGACC GACACACGGC GGACCGACAA
ACGAAGGACG GGCGCACGCC GGACCGCCAG CCGGCGCGCC CGGCCGACCC CGCGAACGGG
TTGTCGGGCG CGATCGACGC GCGCCGCCGC GCGTCGATCC TGCGCGGCCC CGTGCCGTCG
CCGTGCCGCC CGATCGACGA ACTGCTCGAA GTCGATGTCG CGCGGGGCCG GGCGCGCCGG
CTCGCGGACG TCGCCTTCGA CCTGTGCGGG ATCGCGAAGG GCTTCGGCGT GGACGAGCTT
GCGCGCGTGC TCGATCGCCA CGGCATCGGC GCATGGCTCG TCGGCATCGA CGGCGAGCTG
CGCGCGCGCG GATGCAAGCC GGACGGCTCG CCGTGGGCGA TCGCGCTCGA AGCGCCCGAC
TACGGCCGGC GCGGCGCGAT GGGCGCGATC GATCTCGTCG ACGCGGCCGT CGCGACCTCC
GGCGATTACC GGCATTGGGC CGACTTCGGC GGCGAACGCC TCTCGCATAC GATGGACCCG
CGCGCCGGCG CGCCGCTGCG CGGCGACATC GCCTCGGTCA CGGTCGTCGC GCCGACCTGC
ACCGACGCGG ACGCGTACGC CACCGCGTTG ATGGTGCTCG GCGCGCAGGC GGGATGCGCG
CACGCCGAAC GCCACGGGCT CGACGCGCTG TTCGTCGTGC GCGACGGCGA CGCGCTGCGC
ACGATCGGCT GCGGCGCTTT CGCGGACGCG GGGCCGGCGG GCTGA
 
Protein sequence
MSKTSIEWSP GARLNRCRAS GATMGTRYGA QFYAPPTADA RAIAAALDAA VRAVDAQMSN 
WKADSDLSRL NRATPGSWTP ICANLAAVLV RAREIGRETD NAFNIGVGTL VDRWGFGPGA
AANRQADNEW AANRQAAGRH TVDRRTVDRH TADRHTADRQ TKDGRTPDRQ PARPADPANG
LSGAIDARRR ASILRGPVPS PCRPIDELLE VDVARGRARR LADVAFDLCG IAKGFGVDEL
ARVLDRHGIG AWLVGIDGEL RARGCKPDGS PWAIALEAPD YGRRGAMGAI DLVDAAVATS
GDYRHWADFG GERLSHTMDP RAGAPLRGDI ASVTVVAPTC TDADAYATAL MVLGAQAGCA
HAERHGLDAL FVVRDGDALR TIGCGAFADA GPAG