Gene BURPS668_A2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2079 
Symbol 
ID4886291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2017357 
End bp2018541 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content73% 
IMG OID640132017 
ProductApbE family protein 
Protein accessionYP_001063074 
Protein GI126445269 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAAGA CGTCTATTGA ATGGTCGCCC GGCGCGCGGC TGAATCGCTG CCGCGCAAGC 
GGCGCGACGA TGGGCACGCG CTACGGCGCG CAGTTCTACG CGCCGCCGAC GGCCGACGCG
AGGGCGATCG CGGCCGCCCT CGACGCGGCG GTGCGGGCGG TCGACGCGCA GATGTCGAAC
TGGAAGGCCG ATTCGGATCT GTCGCGGCTC AATCGCGCGA CGCCCGGAAG CTGGACGCCG
ATCTGCGCGA ACCTCGCCGC GGTGCTCGTG CGCGCGCGGG AAATCGGCCG CGAGACGGAC
AACGCGTTCA ACATCGGCGT CGGCACGCTC GTCGATCGAT GGGGATTCGG GCCGGGCGCG
GCCGCGAACC GACAAGCGGA CAACGAATGG GCGGCGAATC GACAGGCGGC CGGCCGACAT
ACGGTTGATC GACGTACGGT TGCTCGACAC ATGGTTGATC GACACACGGC GGACCGGCAA
ACGAAGGACG GGCGCACGCC GTGCCGCCAG CCGGCGCGCC CGGCCGGCCC CGCGAACGGG
TTGTCGGGCG CGATCGACGC GCGCCGCCGC GCGTCGATCC TGCGCGGCCC CGTGCCGTCG
CCGTGCCGCC CGATCGACGA ACTGCTCGAA GTCGATGTCG CGCGGGGCCG GGCGCGCCGG
CTCGCGGACG TCGCCTTCGA CCTGTGCGGG ATCGCGAAGG GCTTCGGCGT GGACGAGCTT
GCGCGCGTGC TCGATCGCCA CGACATCGGC GCATGGCTCG TCGGCATCGA CGGCGAACTG
CGCGCGCGCG GATGCAAGCC GGACGGCTCG CCGTGGGCGA TCGCGCTCGA AGCGCCCGAC
TACGACCGGC GCGGCGCGAT GGGCGCGATC GATCTCGTCG ACGCGGCCGT CGCGACCTCC
GGCGATTACC GGCATTGGGC CGACTTCGGC GGCGAACGCC TCTCGCATAC GATGGACCCG
CGCGCCGGCG CGCCGCTGCG CGGCGACATC GCCTCGGTCA CGGTCGTCGC GCCGACCTGC
ACCGACGCGG ACGCGTACGC CACCGCGTTG ATGGTGCTCG GCGCGCAGGC GGGATGCGCG
CACGCCGAAC GCCACGGACT CGACGCGCTG TTCGTCGTGC GCGACGGCGA CGCGCTGCGC
ACGATCGGCT GCGGCGCTTT CGCGGACGCG GGGCCGGCGG GCTGA
 
Protein sequence
MSKTSIEWSP GARLNRCRAS GATMGTRYGA QFYAPPTADA RAIAAALDAA VRAVDAQMSN 
WKADSDLSRL NRATPGSWTP ICANLAAVLV RAREIGRETD NAFNIGVGTL VDRWGFGPGA
AANRQADNEW AANRQAAGRH TVDRRTVARH MVDRHTADRQ TKDGRTPCRQ PARPAGPANG
LSGAIDARRR ASILRGPVPS PCRPIDELLE VDVARGRARR LADVAFDLCG IAKGFGVDEL
ARVLDRHDIG AWLVGIDGEL RARGCKPDGS PWAIALEAPD YDRRGAMGAI DLVDAAVATS
GDYRHWADFG GERLSHTMDP RAGAPLRGDI ASVTVVAPTC TDADAYATAL MVLGAQAGCA
HAERHGLDAL FVVRDGDALR TIGCGAFADA GPAG