Gene BURPS1106A_0436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0436 
Symbol 
ID4901983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp401220 
End bp402539 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content72% 
IMG OID640133666 
Producthypothetical protein 
Protein accessionYP_001064719 
Protein GI126451918 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTTGT CGTCCATCGT TTCGTCGAAC CCGCGCGGCG CGGCGCGCCG CGTGCGCCGC 
GCCGCGTGCG TTCTCGCGCT CGTGCTGGCG CTGCACTGGC TCGCCGCGCT GTGGCTCGTG
CGGTTTCGCG AGCCGTTCAG GCCCGTCGAG CCCGACCACG TGCCGGTTCA GGTCGAATTG
CTCAAGCCGC AGCCGATCGA GCGCGCGCCC GCGCCGGAGA AGCCCGCCGC CGATCGGCCG
CGGGCCGCGC CGAAGCGGGC GGCCCGCGCG TCCGCGCCGC CCGCGCATGC GCCGCGCGCC
TCGGCGCCCG TATCGAGCGC CGCTGAATCC TCCACCGAAT CCTCCGCTGA ATCGCCCGCC
GCCGCCTCCG GCACCGAACC GGCAAGCGCG GCGGGCGGGC AGGCGGCCGG CGCGACGAGC
GGCGCGGCCG CCGGCGCATC GGGCGCGAGC GCGCCGCCCG GCGAAGCAGC GCAGGGCATG
AAATTCGCGC TGCCGCCGTC CGCCGATCTG CAATACGACA CGTTCTACAA CGGCATGCAA
AACATGCCCG GCACGATCCA CTGGCGCACC GACGGCGGCG GCTATTCGTT ATACGTATCG
ATGCCCGTGC CGTTCGTCGG CCCGTACACA TACGAAAGCC GCGGCCGCGT CGACGCGTTC
GGCGTCGCGC CTGCGCGCTA TGTCGAGACG CGCGGCCGGC GGCCGCCCGA TTTCGCGATC
TTCAATCGGC AGACGAAGCA AATCGTGTTC ACCGGCACGC CGAACTCGCT CGCGCTGCCC
GACGGCGCGC AGGACCGCTT CAGCATGCTG ATGCAACTCG CGGGCCTCGT CGGCGGCGAT
CCCGACGCGT ATCGCCCGGG CGTCACGCGC GAGTTCTTCG TCGTCGATCG CGACAGCGGC
GAGACGTGGC CGATCACGAC GATCGGCGAC GAGACGATCT CGACGGGCAT GGGCTCGCTC
GATGCTAGGC ATTTCATGCG GCTGCCGCGC CGCGCGGGCG ACACGCGCCG CATCGACATC
TGGCTCGCGC CGTCGCTCGG CTGGCTGCCC GTGCGGATGG TTCAGACGGA ACCGAACGGC
GCGCAGATCG AATTGCTGCT GCACCGGCGC ACGAACGCAA ACGGCGATGC GGACGTACAC
TCGGATACGA GCGCAAACAC GGACGCAGAC GCGAACGCCG ACGGCGGCGC GGCGCCGGCC
GCCGCCCCGG CATCCGCGTC CGGCGCGCCG TTGAACGCGA ACGAGCCGGT AAATTCGACC
GAGAAGGCCG ACGGATCGCC GCGCCCGCCG CCGGCCGATC CCGGAGAACC GCAACCTTGA
 
Protein sequence
MPLSSIVSSN PRGAARRVRR AACVLALVLA LHWLAALWLV RFREPFRPVE PDHVPVQVEL 
LKPQPIERAP APEKPAADRP RAAPKRAARA SAPPAHAPRA SAPVSSAAES STESSAESPA
AASGTEPASA AGGQAAGATS GAAAGASGAS APPGEAAQGM KFALPPSADL QYDTFYNGMQ
NMPGTIHWRT DGGGYSLYVS MPVPFVGPYT YESRGRVDAF GVAPARYVET RGRRPPDFAI
FNRQTKQIVF TGTPNSLALP DGAQDRFSML MQLAGLVGGD PDAYRPGVTR EFFVVDRDSG
ETWPITTIGD ETISTGMGSL DARHFMRLPR RAGDTRRIDI WLAPSLGWLP VRMVQTEPNG
AQIELLLHRR TNANGDADVH SDTSANTDAD ANADGGAAPA AAPASASGAP LNANEPVNST
EKADGSPRPP PADPGEPQP