Gene BURPS668_A3102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3102 
Symbol 
ID4886150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2941333 
End bp2942889 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content72% 
IMG OID640133038 
ProductMlrC domain-containing protein 
Protein accessionYP_001064093 
Protein GI126443428 
COG category[S] Function unknown 
COG ID[COG5476] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.328821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCGCC GTCGCATCAT GAAGATCCTG GTCGCCGGCT TTCGGCACGA GTCGAACACG 
TTCGCACCCA GCAAGGCGAC CTACGCGAGC TTCGCGGCGG ACGGCGGCCG CTATCCGCTG
TCGCGCGGCG CCGAGATCGG CCGGCTCAAG CGGATGAACC TGCCCGTCGC GGGCGCGCTC
GCCGCGCTCG CCGACGCCGG CCACGTCGCG CTGCCCGCGG TCTGGGCCGA TGCGACGCCG
TCGGGCCGCG TCGAATCGGT GGCGTTCGAG CGGATCGCGG GCGAGATCGT CGATGCCGCG
AAGCGCTACG ACGCCGACGG CGTCTACGTC GATCTGCACG GCGCGATGGC GACCGAGCGC
TACGACGACG GCGAGGGCGA GCTGCTGCGC CGGCTGCGCG AGACGGTGGG CGCGCGCGTG
CCGATCGTCG CGTCGCTCGA TCTGCACGCG AACGTCACGC AGCGCATGCT CGACAGCGCG
GACGGGCTCG TCACGTACCG CACGTATCCG CACGTCGACA TGGCCGATAC CGGGCGCCGC
GCGGTCGCGC TGCTCGACAC GCTGCTCGGC AGGCGCGGCC GCCACCGCCA TTTCCGCAGC
GCGCGGCGCG TGCCGTTCCT GATCCCGGTG AACGCGATGT GCACGTCGCT CGAGCCGTCG
AAGAGCCTGT TCAGGCTGCT CGAGCGGCTC GAGACGGGCG CCGTGCGCTC GCTGTCGTTC
GCGCCGGGCT TTCCGGCCGC GGACTTCCCG GAATGCGGGC CGACGATCTG GGGCTACGGC
GCGGACCCCG TCCAGCTCGC GCGCGCGGTC GACGCGCTGT ACGAGCACGT CGTGTCGACC
GAGGCGCAGT GGTCGGTGCC GTTCATGTCG GCGGACGACG CGGTGACCGA GGCGATCCGG
ATCGCGCGCG GCGCGGACAA GCCGGTCGTG ATCGCCGACA CGCAGGACAA CCCGGGCGCG
GGCGGCGGCT CGAACACGAC GGGGCTGTTG CGCGCGCTCG TGCGGCACCG CGCGCCCGAT
GCGGCGCTCG GGCTGTTCTT CGATCCGGCG GCCGCGTGCG CCGCGCATGC GGCAGGCCGC
GGCGCGACGG TCGAGATCAC GCTCGGCGCG GACAGCGGGC TGCCGTTTAC CGGGACGTTT
CGCGTCGAAT CGCTGTCGAA CGGCCGCTGC CATTGCAACG GCCCGATGCT CAAGGGCGCG
ACGTTCGAGC TCGGCCCGAC CGCGTGCCTG CGGATCGGCG ACGTGCGCGT CGTCGTCACG
TCGGCGCGCG TGCAGATGAC CGACCGGAGC TTCTATCGGA TCGCGGGCAT CGCGCCCGAG
ACGATGAAGA TCCTGGTCAA CAAGAGCTCC GTTCATTTTC GGGCGGATTT CGATGCGATC
GCAGATTGCG TGCTGATCGC GAAAGCGGGC GGCTGGATGG CCGCCGACCC GGCCGATCTG
CGCTGGACGT CGCTTGCCGA CGGGATACGC ACGAGCCCGT GCGGCTCGCC GTTCTTCGGC
TGCGGCGGGC GGCGCGCGCC GCATGCGGAC GGGATCGCGG GAGAGATGCG GATATAG
 
Protein sequence
MERRRIMKIL VAGFRHESNT FAPSKATYAS FAADGGRYPL SRGAEIGRLK RMNLPVAGAL 
AALADAGHVA LPAVWADATP SGRVESVAFE RIAGEIVDAA KRYDADGVYV DLHGAMATER
YDDGEGELLR RLRETVGARV PIVASLDLHA NVTQRMLDSA DGLVTYRTYP HVDMADTGRR
AVALLDTLLG RRGRHRHFRS ARRVPFLIPV NAMCTSLEPS KSLFRLLERL ETGAVRSLSF
APGFPAADFP ECGPTIWGYG ADPVQLARAV DALYEHVVST EAQWSVPFMS ADDAVTEAIR
IARGADKPVV IADTQDNPGA GGGSNTTGLL RALVRHRAPD AALGLFFDPA AACAAHAAGR
GATVEITLGA DSGLPFTGTF RVESLSNGRC HCNGPMLKGA TFELGPTACL RIGDVRVVVT
SARVQMTDRS FYRIAGIAPE TMKILVNKSS VHFRADFDAI ADCVLIAKAG GWMAADPADL
RWTSLADGIR TSPCGSPFFG CGGRRAPHAD GIAGEMRI