Gene BURPS668_A3068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3068 
Symbol 
ID4888656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2911238 
End bp2912773 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content72% 
IMG OID640133004 
ProductMlrC domain-containing protein 
Protein accessionYP_001064059 
Protein GI126443475 
COG category[S] Function unknown 
COG ID[COG5476] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.295091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTTCG CCATACAGGA AAGCGACATG AACATTCTGA TTGCAGGCTT TCAGCACGAA 
ACCAACACGT TCGCACCGAC GCGCGCGTCG TATCAGAGCT TCGTTCGCGG CGAAGGCTTT
CCGCCGCTCG CGCGCGGCGC CGGCGTGCTG TCGCTGCGCG ACGTCAACGT GCCGATCGGC
GGCTTCATCC GGGCCGCGCA GGCGAGCGGG CACGCGCTGC TGCCTGTCGT GTGGGCGGGC
GCGTGCCCGT CGGCGCACGT GACGAGCGGC GCGTTCGAGC GGATCGGCGG CGAGATCGTC
GCCGCGGTCC AGGCGGGCGG CTTCGACGCG ATCTATCTCG ACCTGCACGG CGCGATGGTC
ACCGAGCAGT TCGACGACGG CGAAGGCGAG CTCCTCGCGC GCGTGCGGCG GATCGTCGGC
GAGCGGATGC CGATCGTCGT GTCGCTCGAT CTGCATGCGA ACGTCACCGC GCGAATGGCC
GCGCATGCGA GCGCGCTCGT TGCCTACCGC ACGTATCCGC ACGTCGACAT GGCGCAAACC
GGCGAGCGTG CCGCGCGGGT GCTCGAGCGG CTCGCGGCCG AGGCCCGGCC GCTGCATTGC
GCGATACGCC GGCTGCCGTT CCTGATTCCG GTCAACGGCA TGTGCACGCA CGCGGAGCCG
GCGAGCGGCG CGTACCGGCT GCTCGCGCAG CTCGAGCGGG ACGGCGTCGT ATCGATGTCG
TTCGCCCCCG GCTTTCCGGC CGCCGATTTC CCGGAGTGCG GGCCGACCGT ATGGGCGCAC
GCGTTCGAGG CCGACGCGGC GCAGCGCGCG GCCGACGCGC TGTTCGCGAA GCTCGTCGGC
GACGAGGCGC GCTGGAGCGT GCCGTTGCTC GCGCCCGACG CGGCCGCCGC CGAGGCGATC
CGCCTGAGCC GCACGGCGAC CAGGCCCGTC GTCATCGCCG ATACGCAGGA CAACCCCGGC
GCGGGCGGCG ACGCCGATAC GATGGGCATG GTGCGCGCGC TGCTGCGCAG CGGCGCGCGG
GACGCGGCGG TGGGCGTGAT CTGGGATCCC GACGCCGCGG CCGCCGCGCA CCGCGCGGGC
GTCGGCGCGC GCATCGGCCT GCGTCTGGGC GGCCGCTCGC GCGTGCGGGG CGACGCGCCG
CTCGATGCCG AATTCGAAGT CGAGCATCTG TCCGACGGCC GTTTCCGGTT CGACGGCCCG
ATGTTCAACG GCGCGCACGG CGAGCTCGGG CCGGTGGCCT GCCTGCGGAT CGACGGCGTG
CGGATCGCGG TGAGCACGAA CAAGATGCAG ACGTTCGAGC GCAACCAGTT CCGCGTGGCG
GGCATCGAGC CCGAGCGCAC GAAGATCGTC GTGAACAAGA GCTCGGTGCA TTTTCGCGCG
GACTTCGAGG CGATCGCCGA TGCGATCCTC GTCGCGAAAT CGCCGGGGCC GATGGCCGCC
GATCCCGCGG ATCTCGCGTG GGCGCGTCTC GATCCCGACA TCCGCGTTCG GCCGAACGGA
CCGACCTTGA GGTCGCTTCG CGCAATGGCG CGTTAG
 
Protein sequence
MPFAIQESDM NILIAGFQHE TNTFAPTRAS YQSFVRGEGF PPLARGAGVL SLRDVNVPIG 
GFIRAAQASG HALLPVVWAG ACPSAHVTSG AFERIGGEIV AAVQAGGFDA IYLDLHGAMV
TEQFDDGEGE LLARVRRIVG ERMPIVVSLD LHANVTARMA AHASALVAYR TYPHVDMAQT
GERAARVLER LAAEARPLHC AIRRLPFLIP VNGMCTHAEP ASGAYRLLAQ LERDGVVSMS
FAPGFPAADF PECGPTVWAH AFEADAAQRA ADALFAKLVG DEARWSVPLL APDAAAAEAI
RLSRTATRPV VIADTQDNPG AGGDADTMGM VRALLRSGAR DAAVGVIWDP DAAAAAHRAG
VGARIGLRLG GRSRVRGDAP LDAEFEVEHL SDGRFRFDGP MFNGAHGELG PVACLRIDGV
RIAVSTNKMQ TFERNQFRVA GIEPERTKIV VNKSSVHFRA DFEAIADAIL VAKSPGPMAA
DPADLAWARL DPDIRVRPNG PTLRSLRAMA R