Gene BURPS1106A_A0741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0741 
Symbol 
ID4905538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp730367 
End bp731338 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content63% 
IMG OID640143847 
Productdipeptidase family protein 
Protein accessionYP_001074777 
Protein GI126457459 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGC TGCACCAGGA CAGCATCATC ATCGATGGCC TGAACATTTC GAAGTTCGAA 
CGCTCGGTGT TCGAAGACAT GCAAAAGGGC GGCGTGACGG CCGCGAACTG CACGGTGTCC
GTGTGGGAGA ACTTCACGAA GACGGTCGAC AACATCGCGC TGATGAAAAA GCAGATTCGC
GAGAACGGCG AACTGCTGAC GCTCGTGCGC ACGACGGACG ACATCCTCCG CGCGAAGCGG
GAAGGCCGCA CGGGCGTGAT CCTCGGCTTC CAGAACGCGC ACGCGTTCGA GGACAACCTG
GGCTATGTCG AGGCGTTCGC CGACATGGGC GTGCGCGTCG TGCAGCTTTG CTACAACACG
CAGAACCTCG TCGGCACCGG CTGCTACGAG CGCGACGGCG GGCTGTCGGA TTTCGGCCGC
GAGGTGATCA CCGAGATGAA CCGCGTCGGG ATCATGGTCG ACTTGTCGCA CGTCGGCGGC
AACACGTCGT CGGAGGCGAT CGCGTTCTCG AAGAAACCCG TGTGCTACTC GCACTGCCTG
CCGTCGGGTC TCAAGGCGCA TCCGCGCAAC AAGAGCGACG CGCAACTGAA GGAGATCGCG
GACGCGGGCG GCTTCGTCGG GGTGACGATG TTCGCGCCGT TCCTGAAGCG CGGGATCGAC
GCGACGATCG ACGATTACAT CGAGGCGATC GGCTACGTCG TGAACCTGAT CGGCGAGGAC
GCGGTCGGCA TCGGCACCGA TTTCACGCAG GGCTACAGCG TCGATTTCTT CGATTGGCTC
ACGCACGACA AGGGCCGCTA CCGCCGGCTC ACGAATTTCG GCAAGGTCGT GAATCCTGAA
GGCATCCGAA CGATCGGCGA ATTCCCGAAC CTGACGGCCG CGATGGAGCG CGCGGGATGG
AAGGCGTCGC GCATCCGCAA GATCATGGGC GAAAACTGGG TGCGCGTGTT CAAGGAGGTC
TGGGGCGCGT AA
 
Protein sequence
MSTLHQDSII IDGLNISKFE RSVFEDMQKG GVTAANCTVS VWENFTKTVD NIALMKKQIR 
ENGELLTLVR TTDDILRAKR EGRTGVILGF QNAHAFEDNL GYVEAFADMG VRVVQLCYNT
QNLVGTGCYE RDGGLSDFGR EVITEMNRVG IMVDLSHVGG NTSSEAIAFS KKPVCYSHCL
PSGLKAHPRN KSDAQLKEIA DAGGFVGVTM FAPFLKRGID ATIDDYIEAI GYVVNLIGED
AVGIGTDFTQ GYSVDFFDWL THDKGRYRRL TNFGKVVNPE GIRTIGEFPN LTAAMERAGW
KASRIRKIMG ENWVRVFKEV WGA